HOST DIAGNOSTIC · Anthropic
Claude Opus 4.7
Claude Opus 4.7 · ACTIVE· Claude· CLOSED
ACTIVE
ATTRIBUTE MATRIX · ATRBT GROUP 01
HOST TELEMETRY
AVG
13.8
/20
BEST
18
Code Generation
WEAK
3
Cost Efficiency
PARAMETERS
Undisclosed
CONTEXT
—
PRICING
—
STATUS
ACTIVE
SCORING METHODOLOGY
CALIBRATED per ADR-NM108 (April 22, 2026). Benchmark-derived scores adjusted with 100% personal evaluation weight. Key benchmarks: SWE-bench Verified 87.6%. GPQA Diamond 94.2%. Arena Code #1 at 1560 ELO. Arena Text 1497 ELO. MMLU-Pro 89.87%. ARC-AGI-2 75.83%. MCP-Atlas 77.3%. REGRESSIONS: BrowseComp 83 to 79. Terminal-Bench trails GPT-5.4. New tokenizer +35% effective cost. Combative literalism. Sources: Anthropic, BenchLM, Vellum, Nate B Jones, Arena.ai, Epoch AI, Vals AI.