Beta
Leaderboard/Claude Sonnet 4
Anthropic

Claude Sonnet 4

by Anthropic 路 Released 2025-05-22

36.9
avg score
$3.00/1M
Input Price
$15.00/1M
Output Price
200K tokens (~100 books)
Context Window
multimodal
Type

Tested on 20 benchmarks with 36.9% average. Top scores: MATH level 5 (84.4%), GPQA diamond (72.3%), OTIS Mock AIME 2024-2025 (71.1%).

Benchmark Scores

BenchmarkCategoryScoreBar
MATH level 5math84.4
GPQA diamondknowledge72.3
OTIS Mock AIME 2024-2025math71.1
SWE-Bench Verified (Bash Only)coding64.9
Aider polyglotcoding61.3
DeepResearch Benchknowledge47.8
Fiction.LiveBenchknowledge46.9
WeirdMLcoding46.1
OSWorldagentic43.9
ARC-AGIreasoning40.0
GeoBenchknowledge37.0
Cybenchcoding35.0
SimpleBenchreasoning34.6
The Agent Companyagentic33.1
ARC-AGI-2reasoning5.9
GSO-Benchcoding4.9
FrontierMath-2025-02-28-Privatemath4.1
HLEknowledge3.1
VPCTknowledge1.0
FrontierMath-Tier-4-2025-07-01-Privatemath0.1

Similar Models