Model Catalog · Updated May 2026
AI Models 34 models from 14 providers · Prices shown per million tokens · 12 new models added
AMA Smart Routes · Built by Us
One endpoint that picks the right model Use nexus/* routes and let our system automatically select the optimal model for each task — no model-switching overhead.
✦ Smart Auto
nexus/auto
Smart routing — picks the best quality/cost balance among all live models for your request.
✦ Smart Routing
nexus/cheapest-west
Routes to the cheapest capable Western provider model — balanced capability, speed, and cost.
◈ Lowest Cost
nexus/cheapest
Always routes to the cheapest capable model for your task. Great for high-volume workloads.
◉ China Optimized
nexus/cheapest-cn
Cheapest routing via China mainland servers — ultra-low latency for CN users.
⬡ Speed + Quality
nexus/best-reasoning
Optimizes for both response speed and output quality. Best for production chat apps.
▸ Ultra-Fast
nexus/fastest
Always picks the fastest available model. Ideal for real-time streaming and low-latency apps.
⌘ Best for Code
nexus/best-coding
Routes to the strongest coding-capable model available. Best for refactors, reviews, and complex logic.
⬚ Long Context
nexus/long-context
Routes to the cheapest model with 500K+ context. Ideal for long documents and large codebases.
◎ Web Search
nexus/web-search
Routes to Perplexity Sonar models with live web search built in. Best for research and current events.
Trending now Hy3 Preview+52.1% DeepSeek V4 Pro+44.5% Seed 2.0 Pro+38.4% All Providers OpenAI Anthropic Google DeepSeek xAI (Grok) Alibaba Meta Mistral Cohere Perplexity Moonshot Zhipu AI Tencent ByteDance
All capabilities chat vision code reasoning function calling New in 202634 models shown
Sort: Spotlight Quality Speed Trending
Model Provider
Input /M Output /M Context Quality Speed Weekly Use
GPT-5.5 Pro NEW Most Powerful
OpenAI's most powerful frontier model for the most complex tasks
chat reasoning vision codeOpenAI
$34.50
$207.00
200K
Claude Opus 4.7 NEW Latest
Anthropic's most capable model — SWE-bench 87.6%, 3× vision (Apr 2026)
chat vision code reasoningAnthropic
$5.75
$28.75
1M
GPT-5.5 NEW Latest
OpenAI's flagship model for professional workloads (Apr 2026)
chat reasoning vision code function callingOpenAI
$5.75
$34.50
200K
Gemini 3.1 Pro NEW Latest
Google's most capable model, 2M context, multimodal (Feb 2026)
chat vision code reasoningGoogle
$2.30
$13.80
2M
DeepSeek V4 Pro NEW 75% OFF
Advanced reasoning, 75% launch discount until May 31 — list price $1.74/$3.48
DeepSeek
$0.500
$1.00
1M
GPT-5.4 Recommended
Current flagship GPT — powerful, 1M context, fast
chat vision code function callingOpenAI
$2.88
$17.25
1M
DeepSeek V4 Flash NEW Cheapest
Fastest DeepSeek model — ultra-cheap for high-volume tasks
DeepSeek
$0.161
$0.322
1M
GPT-5.4 Mini
Cost-efficient GPT-5.4 class model, great for most tasks
chat vision code function callingOpenAI
$0.863
$5.18
1M
Hy3 Preview NEW Hunyuan 3
Tencent Hunyuan 3 (混元 3) — 295B MoE, agent-first, open-sourced Apr 2026
chat reasoning code function callingTencent
$0.190
$0.620
262K
Seed 2.0 Pro NEW Seed 2.0
ByteDance Seed 2.0 flagship — agentic reasoning, multimodal, 256K context
chat reasoning vision code function callingByteDance
$0.470
$2.37
256K
Grok 4.3 NEW Latest
xAI's latest flagship — new #1 on xAI leaderboard (Apr 2026)
chat reasoning vision codexAI
$1.44
$2.88
131K
Claude Sonnet 4.6 Most Popular
Best balance of Claude intelligence and speed — 1M context
chat vision code reasoningAnthropic
$3.45
$17.25
1M
Gemini 3.1 Flash-Lite NEW Best Value
Best price-performance in Gemini family, retains free tier
Google
$0.288
$1.73
2M
Kimi K2.6 NEW Latest
Latest Kimi — strongest reasoning, top-10 by global API usage
Moonshot
$0.630
$3.05
131K
Grok 4.1 Fast Fast
Grok workhorse — fast, cheap, great for high-volume routing
xAI
$0.230
$0.575
131K
Gemini 3.0 Flash
Fast next-gen model, proven stability, free tier available
Google
$0.575
$3.45
1M
DeepSeek R1
Proven reasoning specialist, comparable to o1 at fraction of cost
DeepSeek
$0.630
$2.52
128K
Mistral Large
Top European AI — strong multilingual, GDPR-compliant
chat code function callingMistral
$2.30
$6.90
131K
Sonar Pro Web Search
Real-time web search + reasoning — live internet access
Perplexity
$3.45
$17.25
127K
Qwen3.6 Plus NEW Latest
Latest Qwen — top-5 by global usage, 1M context (Apr 2026)
Alibaba
$0.374
$2.24
1M
Kimi K2.5
Proven stable release — strong at coding and Chinese tasks
Moonshot
$0.690
$2.88
131K
Command R+
Enterprise RAG specialist — retrieval-augmented generation
chat code reasoning function callingCohere
$2.88
$11.50
128K
GPT-5.4 Nano Cheapest GPT
Ultra-fast, ultra-cheap — high-volume tasks and simple queries
OpenAI
$0.230
$1.44
1M
GPT-5
Base GPT-5 — excellent balance of capability and cost
chat vision code function callingOpenAI
$0.720
$5.75
1M
GPT-4o
Proven multimodal model — stable for existing integrations
chat vision code function callingOpenAI
$2.88
$11.50
128K
Sonar
Fast web-grounded answers — great for factual Q&A
Perplexity
$1.15
$1.15
127K
Claude Haiku 4.5 Fast
Near-instant responses, high throughput, full 1M context
Anthropic
$1.15
$5.75
1M
Mistral Small
Lightweight, fast, EU-based for compliance-sensitive use
Mistral
$0.115
$0.345
33K
Llama 4 Maverick Open Source
Open-source multimodal — 512K context, strong for its price
Meta
$0.220
$0.880
524K
Llama 4 Scout
Efficient open-source, long context, ultra-low cost
Meta
$0.092
$0.345
524K
Qwen Max Best Chinese
Alibaba's most powerful model — excels at Chinese tasks
Alibaba
$0.460
$1.38
131K
Qwen Plus
Balanced performance and cost in the Qwen family
Alibaba
$0.092
$0.299
131K
Qwen Turbo
Fastest, cheapest Qwen — high-volume Chinese language tasks
Alibaba
$0.023
$0.069
131K
GLM-4
Flagship Chinese model, symmetric input/output pricing
Zhipu AI
$0.161
$0.161
128K
Prices shown per million tokens Quality/Speed scores from AMA composite benchmark, updated weekly Prompt caching: save up to 90% on repeated inputs Batch API: 50% discount on async workloads
Accepted Payment Methods 256-bit TLS PCI DSS via Stripe Credits never expire Multi-currency
One endpoint for 100+ frontier models. Transparent pricing, pay-as-you-go, WeChat · Alipay · Stripe supported.
Virginia (US East) Shanghai (CN) Singapore (SEA) Frankfurt (EU) Riyadh (ME)
© 2026 ModelAPI. All rights reserved.
0% per-token markup · Official API prices · 5% infra fee only · Credits never expire