HOST DIAGNOSTIC · Anthropic

Claude Sonnet 4.6

Claude Sonnet 4.6 · ACTIVE· Claude· CLOSED

ACTIVE

ATTRIBUTE MATRIX · ATRBT GROUP 01

HOST TELEMETRY

AVG

14.0

/20

BEST

Code Generation

WEAK

Multimodal Fluency

PARAMETERS

Undisclosed

CONTEXT

—

PRICING

—

STATUS

ACTIVE

ATTRIBUTE SCORES · 20 DIMENSIONS

Cognitive

Bulk Apperce…

Reasoning

Mathematical…

World Knowle…

Scientific A…

Technical

Code Generat…

Tool Use

Multimodal F…

Speed

Cost Efficie…

Behavioral

Candor

Creativity

Tenacity

Self-Correct…

Calibration

Operational

Instruction …

Context Fide…

Multi-turn C…

Planning and…

Safety Align…

SCORING METHODOLOGY

RECALIBRATED per ADR-NM108 (April 17, 2026). Prior Gen 1 scores were 11-15; Arena Text 1490 + Code 1523 justified significant upward correction. Key benchmarks: Arena Text #3 at 1490 → Reasoning 15 (was 12). Arena Code #3 at 1523 → Code Gen 17 (was 15). SWE-Bench 79.6% (within 1.2pts of Opus) → validates Code 17. AAII 51 → Bulk 15 (was 12). OfficeQA matches Opus 4.6. Pricing: $3/$15 per M tokens (40% cheaper than Opus). 1M context window (beta).

← ALL HOSTS

COMPARE IN MATRIX →KNOWLEDGE GRAPH →