Model Catalog · Updated May 2026

AI Models

65 models from 15 providers · Prices shown per million tokens · 36 new models added

AMA Smart Routes · Built by Us

One endpoint that picks the right model

Use nexus/* routes and let our system automatically select the optimal model for each task — no model-switching overhead.

✦Smart Auto

nexus/auto

Smart routing — picks the best quality/cost balance among all live models for your request.

✦Smart Routing

nexus/cheapest-west

Routes to the cheapest capable Western provider model — balanced capability, speed, and cost.

◈Lowest Cost

nexus/cheapest

Always routes to the cheapest capable model for your task. Great for high-volume workloads.

◉CN Provider Optimized

nexus/cheapest-cn

Routes to the cheapest China-based provider models for eligible users in permitted regions.

⬡Speed + Quality

nexus/best-reasoning

Optimizes for both response speed and output quality. Best for production chat apps.

▸Ultra-Fast

nexus/fastest

Always picks the fastest available model. Ideal for real-time streaming and low-latency apps.

⌘Best for Code

nexus/best-coding

Routes to the strongest coding-capable model available. Best for refactors, reviews, and complex logic.

⬚Long Context

nexus/long-context

Routes to the cheapest model with 500K+ context. Ideal for long documents and large codebases.

◎Web Search

nexus/web-search

Routes to Perplexity Sonar models with live web search built in. Best for research and current events.

Trending nowSeed3D 2.0 (CN)+70%Seedance 2.0 (CN)+62%Seed3D 2.0 (International)+60%

65 models shown

Sort:

Provider

Use

Claude Sonnet 5NEWLatest

Latest Sonnet — claude-sonnet-latest routes here. Intro pricing through Aug 2026.

chat vision code reasoning

Anthropic

$1.90

$9.50

+45%

Get Key

GPT-5.6 SolNEWGPT-5.6 Flagship

OpenAI GPT-5.6 flagship — highest intelligence in the 5.6 family

chat reasoning vision code function calling

OpenAI

$5.00

$30.00

+22.4%

Get Key

GPT-5.6 TerraNEWBalanced

GPT-5.6 balanced tier — strong quality at mid price

chat reasoning vision code function calling

OpenAI

$2.50

$15.00

+18.1%

Get Key

GPT-5.6 LunaNEWFast GPT-5.6

GPT-5.6 fast/efficient tier for high-volume workloads

chat reasoning vision code function calling

OpenAI

$1.00

$6.00

+16.5%

Get Key

Claude Opus 4.8NEWLive

Use claude-opus-4-8 or claude-opus-latest — platform routes to ArkLin automatically.

chat vision code reasoning

Anthropic

$4.75

$23.75

+48.2%

Get Key

Claude Opus 4.7BYOK / Soon

Anthropic catalog + BYOK direct API. Opus 4.7 on the US pool rolls out when enabled on the platform.

chat vision code reasoning

Anthropic

$4.75

$23.75

200K

+18%

Get Key

GPT-5.5 ProMost Powerful

OpenAI's most powerful frontier model for the most complex tasks

chat reasoning vision code

OpenAI

$5.00

$30.00

256K

+4.2%

Get Key

GPT-5.5Latest

OpenAI's flagship model for professional workloads (Apr 2026)

chat reasoning vision code function calling

OpenAI

$2.50

$10.00

256K

+18.6%

Get Key

Gemini 3.1 ProNEWLatest

Google's most capable model, 2M context, multimodal (Feb 2026)

chat vision code reasoning

Google

$2.00

$12.00

+9.8%

Get Key

Claude Fable 5NEWRestricted

Anthropic 第五代旗舰 — 上游限制许可，暂不可调用。请用 Sonnet 5 / Opus 4.8。

chat vision code reasoning

Anthropic

$10.00

$50.00

Get Key

DeepSeek V4 ProNEW75% OFF

Advanced reasoning, 75% launch discount until May 31 — list price $1.74/$3.48

chat reasoning code

DeepSeek

$0.500

$1.00

+44.5%

Get Key

GPT-5.4Recommended

Current flagship GPT — powerful, 1M context, fast

chat vision code function calling

OpenAI

$2.50

$10.00

128K

+3.1%

Get Key

DeepSeek V4 FlashNEWCheapest

Fastest DeepSeek model — ultra-cheap for high-volume tasks

chat code

DeepSeek

$0.140

$0.280

+19.3%

Get Key

GPT-5.4 Mini

Cost-efficient GPT-5.4 class model, great for most tasks

chat vision code function calling

OpenAI

$0.150

$0.600

128K

-1.2%

Get Key

Hy3 PreviewNEWHunyuan 3

Tencent Hunyuan 3 (混元 3) — 295B MoE, agent-first, open-sourced Apr 2026

chat reasoning code function calling

Tencent

$0.190

$0.620

262K

+52.1%

Get Key

Seedance 2.0 (CN)NEWVideo · CN

Volcengine Ark CN — text/image/video-to-video, up to 15s, 720p/1080p. API: POST /v1/contents/generations/tasks

video

ByteDance

$0.0000

+62%

Get Key

Seedance 2.0 Fast (CN)NEWVideo · CN

Faster CN route for prompt iteration — 720p

video

ByteDance

$0.0000

+48%

Get Key

Seedance 2.0 (International)NEWVideo · Intl

BytePlus ModelArk international region — same API, base URL routed automatically

video

ByteDance

$0.0000

+55%

Get Key

Seedance 2.0 Fast (Intl)NEWVideo · Intl

International fast variant for draft renders

video

ByteDance

$0.0000

+44%

Get Key

Seed3D 2.0 (CN)NEW3D · CN

Volcengine Ark CN — image/text-to-3D asset generation. API: POST /v1/contents/generations/tasks

ByteDance

$0.0000

+70%

Get Key

Seed3D 1.0 (CN)NEW3D · CN

Seed3D 1.0 — image/text-to-3D (China region)

ByteDance

$0.0000

+45%

Get Key

Grok 4.3NEWLatest

xAI flagship — 1M context, configurable reasoning effort

chat reasoning vision code

xAI

$1.25

$2.50

+17.9%

Get Key

Claude Sonnet 4.6Most Popular

Best balance of Claude intelligence and speed — 1M context

chat vision code reasoning

Anthropic

$2.85

$14.25

+11.4%

Get Key

Gemini 3.1 Flash-LiteNEWBest Value

Best price-performance in Gemini family, retains free tier

chat vision code

Google

$0.250

$1.50

+31.2%

Get Key

Qwen3.7 MaxNEWLatest

Alibaba Qwen3.7 flagship — Bailian verified, 1M context

chat code reasoning function calling

Alibaba

$0.800

$3.20

+32%

Get Key

GLM-5.1NEWLatest

Zhipu GLM-5.1 — Bailian/ArkLin verified live

chat code reasoning

Zhipu AI

$1.00

$3.20

200K

+30%

Get Key

Kimi K2.7 CodeNEWLatest

Moonshot coding flagship — 256K context, best for agentic code

chat code reasoning

Moonshot

$0.950

$4.00

256K

+22.4%

Get Key

Grok 4.1 FastLegacy

Legacy slug — xAI redirects to Grok 4.3 (none reasoning effort)

chat code

xAI

$1.25

$2.50

+6.3%

Get Key

Gemini 3.0 Flash

Fast next-gen model, proven stability, free tier available

chat vision code

Google

$0.575

$3.45

-4.1%

Get Key

DeepSeek R1

Reasoning alias → V4 Pro (replaces deepseek-reasoner)

chat reasoning code

DeepSeek

$0.550

$2.19

-6.3%

Get Key

Mistral Large

Top European AI — strong multilingual, GDPR-compliant

chat code function calling

Mistral

$2.00

$6.00

131K

-1.8%

Get Key

Sonar ProWeb Search

Real-time web search + reasoning — live internet access

chat reasoning

Perplexity

$3.00

$15.00

127K

+7.2%

Get Key

Qwen3.7 PlusNEWNew

Qwen3.7 plus — strong balance of quality and cost

chat code reasoning function calling

Alibaba

$0.400

$1.60

+26%

Get Key

Qwen3.6 Plus

Qwen3.6 long-context — still widely used

chat code reasoning

Alibaba

$0.325

$1.95

+12.4%

Get Key

Kimi K2.6

Multimodal general model — vision, thinking, and agents

chat code reasoning vision

Moonshot

$0.550

$2.65

256K

+15.8%

Get Key

Command R+

Enterprise RAG specialist — retrieval-augmented generation

chat code reasoning function calling

Cohere

$2.50

$10.00

128K

-3.5%

Get Key

GPT-5.4 NanoCheapest GPT

Ultra-fast, ultra-cheap — high-volume tasks and simple queries

chat code

OpenAI

$0.230

$1.44

+2.4%

Get Key

GPT-5

Base GPT-5 — excellent balance of capability and cost

chat vision code function calling

OpenAI

$0.720

$5.75

-3%

Get Key

GPT-4o

Proven multimodal model — stable for existing integrations

chat vision code function calling

OpenAI

$2.88

$11.50

128K

-8.4%

Get Key

Sonar

Fast web-grounded answers — great for factual Q&A

chat

Perplexity

$1.00

127K

+3.1%

Get Key

Claude Haiku 4.5Fast

Near-instant responses, high throughput, full 1M context

chat vision code

Anthropic

$0.950

$4.75

200K

+4.7%

Get Key

Mistral Small

Lightweight, fast, EU-based for compliance-sensitive use

chat code

Mistral

$0.115

$0.345

33K

-0.4%

Get Key

Llama 4 MaverickOpen Source

Open-source multimodal — 512K context, strong for its price

chat vision code