Better routing · transparent pricing · no subscriptions
Change two lines — base URL and API key — to connect. A small infra fee is added when you purchase credits (default 5%); usage deducts against provider list pricing.
Trending This Week
Based on real request volume over the past 7 days · updated hourly
Claude Code stays on native Opus 4.7 for planning and reasoning. ModelAPI nexus/cheapest-west handles high-volume subtasks cheaply. Blended output cost: $2.20/M — just 9% of the all-Opus price.
Sound familiar?
Account bans, unstable connections, payment walls — we eliminate all three.
Overseas phone + credit card required. Accounts banned without warning; balance lost.
✓ Our solution
Sign up with any email. No foreign card, no ban risk. Your credits never expire.
Direct overseas connections drop constantly. VPNs add latency and compliance risk.
✓ Our solution
US primary launches first; APAC and EU regional BASE_URLs enable as PoPs ship.
Foreign card only. No RMB payment, no VAT invoice, no corporate billing.
✓ Our solution
WeChat Pay, Alipay, bank transfer in RMB. VAT invoices available. Corporate billing supported.
Ten keys, inflated bills, flaky routing — condensed into one control plane.
OpenAI, Anthropic, Google, DeepSeek — each with its own account, billing, key rotation, and quota tracking.
Per-token API billing adds up. Some gateways add markup, inflate counts, or swap models.
Trading cost vs quality manually breaks unattended workflows.
Non-developers get stuck; ticket cycles are long.
Same credential surface for experimentation and scaled traffic.
GPT-5.5 Pro, Claude Opus 4.7, Gemini 3.1 Pro, Grok 4.3 — same integration surface.
DeepSeek V4 Flash, Qwen, Kimi, GLM — built for volume and everyday workloads.
Representative routing: code pane, streamed body, latency and credits.
Loading...Ecosystem
Point Cursor, Windsurf, VS Code, Claude Code, or Codex CLI at ModelAPI.
Universal config — works in every tool
Resilience
Reuse the exact same key — swap hostname only.
Catalog
50+ frontier and value tiers · one unified API envelope
Capabilities
One envelope for frontier models and value tiers — not another dashboard maze.
GPT-5.5, Claude Opus 4.7, Gemini 3.1, DeepSeek V4, Grok 4.3 and 50+ more — all through one API key, no restrictions.
Change 2 lines of code. Keep your existing SDK, prompts, and tools — zero refactoring.
US · Singapore · Tokyo · Frankfurt. Regional PoPs roll out after US primary launch.
Official API prices, zero markup. 5% infrastructure fee at top-up only — below industry average. Bring Your Own Key (BYOK) at $0.50/M for expensive models.
Key isolation · RBAC · Audit logs · Spend caps · SOC 2 compliant infrastructure.
Automatic provider fallback. Route by cost, latency, or capability. Model group aliases included.
Start in under 60 seconds
Top up from $5. Supports WeChat, Alipay, Stripe, Apple Pay, crypto.
View payment optionsCreate a key, swap the base URL. Your existing OpenAI code works as-is.
Read quickstartPricing
Upstream model passes through with a spelled-out infra surcharge — zero surprise fees.
For exploration and testing
Official API prices · zero markup · min $5 top-up
For production workloads
Loved by developers
"Used to juggle 7–8 API accounts. Now one ModelAPI key covers everything, billing is crystal clear, and switching between Claude Opus and DeepSeek takes zero effort."
"The Singapore PoP is legitimately fast. I was skeptical about a gateway adding latency, but median response time is actually lower than hitting the OpenAI API directly from SEA. Solid routing."
"China access was always our headache. ModelAPI domestic endpoints solved it, and WeChat Pay means no more fighting with foreign credit cards. Deployed to production in one afternoon."
"The PPP pricing for India is real — I'm paying ~40% less than teammates in the US for the same models. Switching my existing codebase took literally 2 lines of code."
"Looked at a lot of gateways — ModelAPI is one of the few with truly 0% per-token markup and a transparent fee structure. The nexus/cheapest route cut our batch inference cost substantially."
"Building multi-tenant where users have different model preferences. The Compare page saved hours of manual benchmarking. BYOK at $0.50/M flat is the most cost-effective routing I've found."
Recent Updates
Now routing via nexus/cheapest. $0.16/M input, 1M context window.
Pause agentic workflows mid-execution and inject human decisions via the API.
Prompt-level cache hits on repeated context now reduce latency by ~40% at no extra cost.
Sign up in 30 seconds. No credit card required. 50+ models, one key.