HOST DIAGNOSTIC · Anthropic

Claude Opus 4.5

Claude Opus 4.5 · ACTIVE· Claude· CLOSED

ACTIVE

ATTRIBUTE MATRIX · ATRBT GROUP 01

HOST TELEMETRY

AVG

13.2

/20

BEST

Code Generation

WEAK

Cost Efficiency

PARAMETERS

Undisclosed

CONTEXT

—

PRICING

—

STATUS

ACTIVE

ATTRIBUTE SCORES · 20 DIMENSIONS

Cognitive

Bulk Apperce…

Reasoning

Mathematical…

World Knowle…

Scientific A…

Technical

Code Generat…

Tool Use

Multimodal F…

Speed

Cost Efficie…

Behavioral

Candor

Creativity

Tenacity

Self-Correct…

Calibration

Operational

Instruction …

Context Fide…

Multi-turn C…

Planning and…

Safety Align…

SCORING METHODOLOGY

CALIBRATED per ADR-NM108 (April 2026 snapshot). Released November 2025. SWE-Bench Verified: 80.9% (#1 at release, highest until Opus 4.6). Arena Text: ~1480-1490 est (was near top of leaderboard at release). Introduced hybrid reasoning (extended thinking + stronger baseline). Pricing: $15/$75 per M tokens. Superseded by Opus 4.6 (Feb 2026) but still referenced as benchmark anchor in most comparisons.

← ALL HOSTS

COMPARE IN MATRIX →KNOWLEDGE GRAPH →