Agentic AI Benchmarks

Interactive cost-vs-performance comparisons of frontier AI models. Automatically synced from Artificial Analysis.

Select Metric

Cost Basis

Blended price assuming 7:2:1 ratio of cache hits, input, and output tokens.

Cost vs. Overall Intelligence

Blended price per 1M tokens vs. Overall Intelligence

Most Attractive

High overall intelligence, low cost

Expensive

High overall intelligence, high cost

Budget

Low overall intelligence, low cost

Avoid

Low overall intelligence, high cost

Click any point to open the model page on Artificial Analysis.

56 of 56 active

Recent Releases

Grok 4.5 (high)

SpaceXAI · Jul 8, 2026

Claude Sonnet 5 (max)

Anthropic · Jun 30, 2026

GLM-5.2 (max)

Z AI · Jun 16, 2026

Kimi K2.7 Code

Kimi · Jun 12, 2026

Claude Fable 5 (with fallback)

Anthropic · Jun 9, 2026

Nemotron 3 Ultra

NVIDIA · Jun 4, 2026

MiniMax-M3

MiniMax · Jun 1, 2026

Qwen3.7 Plus

Alibaba · Jun 1, 2026

Claude Opus 4.8 (max)

Anthropic · May 28, 2026

Gemini 3.5 Flash

Google · May 19, 2026

Qwen3.7 Max

Alibaba · May 19, 2026

Grok 4.3 (high)

SpaceXAI · Apr 30, 2026

DeepSeek V4 Pro (max)

DeepSeek · Apr 24, 2026

DeepSeek V4 Flash (max)

DeepSeek · Apr 24, 2026

DeepSeek V4 Flash (high)

DeepSeek · Apr 24, 2026

GPT-5.5 (xhigh)

OpenAI · Apr 23, 2026

MiMo-V2.5-Pro

Xiaomi · Apr 22, 2026

MiMo-V2.5

Xiaomi · Apr 22, 2026

Kimi K2.6

Kimi · Apr 20, 2026

Claude Opus 4.7 (max)

Anthropic · Apr 16, 2026