AI Models

34 models from 14 providers · Prices shown per million tokens · 12 new models added

AMA Smart Routes · Built by Us

One endpoint that picks the right model

Use nexus/* routes and let our system automatically select the optimal model for each task — no model-switching overhead.

Smart Auto
nexus/auto

Smart routing — picks the best quality/cost balance among all live models for your request.

Smart Routing
nexus/cheapest-west

Routes to the cheapest capable Western provider model — balanced capability, speed, and cost.

Lowest Cost
nexus/cheapest

Always routes to the cheapest capable model for your task. Great for high-volume workloads.

China Optimized
nexus/cheapest-cn

Cheapest routing via China mainland servers — ultra-low latency for CN users.

Speed + Quality
nexus/best-reasoning

Optimizes for both response speed and output quality. Best for production chat apps.

Ultra-Fast
nexus/fastest

Always picks the fastest available model. Ideal for real-time streaming and low-latency apps.

Best for Code
nexus/best-coding

Routes to the strongest coding-capable model available. Best for refactors, reviews, and complex logic.

Long Context
nexus/long-context

Routes to the cheapest model with 500K+ context. Ideal for long documents and large codebases.

Web Search
nexus/web-search

Routes to Perplexity Sonar models with live web search built in. Best for research and current events.

Trending nowHy3 Preview+52.1%DeepSeek V4 Pro+44.5%Seed 2.0 Pro+38.4%

34 models shown

Provider
Use
GPT-5.5 ProNEWMost Powerful
OpenAI's most powerful frontier model for the most complex tasks
OpenAI
$34.50
$207.00
200K
98
62
+4.2%
Claude Opus 4.7NEWLatest
Anthropic's most capable model — SWE-bench 87.6%, 3× vision (Apr 2026)
Anthropic
$5.75
$28.75
1M
97
68
+22.1%
GPT-5.5NEWLatest
OpenAI's flagship model for professional workloads (Apr 2026)
OpenAI
$5.75
$34.50
200K
95
74
+18.6%
Gemini 3.1 ProNEWLatest
Google's most capable model, 2M context, multimodal (Feb 2026)
Google
$2.30
$13.80
2M
94
72
+9.8%
DeepSeek V4 ProNEW75% OFF
Advanced reasoning, 75% launch discount until May 31 — list price $1.74/$3.48
DeepSeek
$0.500
$1.00
1M
90
70
+44.5%
GPT-5.4Recommended
Current flagship GPT — powerful, 1M context, fast
OpenAI
$2.88
$17.25
1M
91
81
+3.1%
DeepSeek V4 FlashNEWCheapest
Fastest DeepSeek model — ultra-cheap for high-volume tasks
DeepSeek
$0.161
$0.322
1M
78
95
+19.3%
GPT-5.4 Mini
Cost-efficient GPT-5.4 class model, great for most tasks
OpenAI
$0.863
$5.18
1M
84
88
-1.2%
Hy3 PreviewNEWHunyuan 3
Tencent Hunyuan 3 (混元 3) — 295B MoE, agent-first, open-sourced Apr 2026
Tencent
$0.190
$0.620
262K
89
72
+52.1%
Seed 2.0 ProNEWSeed 2.0
ByteDance Seed 2.0 flagship — agentic reasoning, multimodal, 256K context
ByteDance
$0.470
$2.37
256K
92
68
+38.4%
Grok 4.3NEWLatest
xAI's latest flagship — new #1 on xAI leaderboard (Apr 2026)
xAI
$1.44
$2.88
131K
92
76
+17.9%
Claude Sonnet 4.6Most Popular
Best balance of Claude intelligence and speed — 1M context
Anthropic
$3.45
$17.25
1M
93
78
+11.4%
Gemini 3.1 Flash-LiteNEWBest Value
Best price-performance in Gemini family, retains free tier
Google
$0.288
$1.73
2M
80
94
+31.2%
Kimi K2.6NEWLatest
Latest Kimi — strongest reasoning, top-10 by global API usage
Moonshot
$0.630
$3.05
131K
86
75
+15.8%
Grok 4.1 FastFast
Grok workhorse — fast, cheap, great for high-volume routing
xAI
$0.230
$0.575
131K
79
93
+6.3%
Gemini 3.0 Flash
Fast next-gen model, proven stability, free tier available
Google
$0.575
$3.45
1M
79
91
-4.1%
DeepSeek R1
Proven reasoning specialist, comparable to o1 at fraction of cost
DeepSeek
$0.630
$2.52
128K
87
66
-6.3%
Mistral Large
Top European AI — strong multilingual, GDPR-compliant
Mistral
$2.30
$6.90
131K
83
77
-1.8%
Sonar ProWeb Search
Real-time web search + reasoning — live internet access
Perplexity
$3.45
$17.25
127K
86
71
+7.2%
Qwen3.6 PlusNEWLatest
Latest Qwen — top-5 by global usage, 1M context (Apr 2026)
Alibaba
$0.374
$2.24
1M
88
79
+28.4%
Kimi K2.5
Proven stable release — strong at coding and Chinese tasks
Moonshot
$0.690
$2.88
131K
83
73
-5.2%
Command R+
Enterprise RAG specialist — retrieval-augmented generation
Cohere
$2.88
$11.50
128K
82
73
-3.5%
GPT-5.4 NanoCheapest GPT
Ultra-fast, ultra-cheap — high-volume tasks and simple queries
OpenAI
$0.230
$1.44
1M
77
96
+2.4%
GPT-5
Base GPT-5 — excellent balance of capability and cost
OpenAI
$0.720
$5.75
1M
88
84
-3%
GPT-4o
Proven multimodal model — stable for existing integrations
OpenAI
$2.88
$11.50
128K
82
80
-8.4%
Sonar
Fast web-grounded answers — great for factual Q&A
Perplexity
$1.15
$1.15
127K
77
88
+3.1%
Claude Haiku 4.5Fast
Near-instant responses, high throughput, full 1M context
Anthropic
$1.15
$5.75
1M
83
93
+4.7%
Mistral Small
Lightweight, fast, EU-based for compliance-sensitive use
Mistral
$0.115
$0.345
33K
72
91
-0.4%
Llama 4 MaverickOpen Source
Open-source multimodal — 512K context, strong for its price
Meta
$0.220
$0.880
524K
81
82
+5.6%
Llama 4 Scout
Efficient open-source, long context, ultra-low cost
Meta
$0.092
$0.345
524K
74
89
+2.9%
Qwen MaxBest Chinese
Alibaba's most powerful model — excels at Chinese tasks
Alibaba
$0.460
$1.38
131K
85
76
+1.5%
Qwen Plus
Balanced performance and cost in the Qwen family
Alibaba
$0.092
$0.299
131K
75
84
-2.1%
Qwen Turbo
Fastest, cheapest Qwen — high-volume Chinese language tasks
Alibaba
$0.023
$0.069
131K
68
97
+0.8%
GLM-4
Flagship Chinese model, symmetric input/output pricing
Zhipu AI
$0.161
$0.161
128K
76
85
+1.1%
Prices shown per million tokensQuality/Speed scores from AMA composite benchmark, updated weeklyPrompt caching: save up to 90% on repeated inputsBatch API: 50% discount on async workloads