Price, context window, quality score, speed, latency and capabilities — all in one view to help you pick the right model for your workload.
Anthropic's most capable model — SWE-bench 87.6%.
OpenAI's flagship model for professional workloads.