HOST DIAGNOSTIC · OpenAI

o3

o3 · ACTIVE· GPT· CLOSED

ACTIVE

ATTRIBUTE MATRIX · ATRBT GROUP 01

HOST TELEMETRY

AVG

10.6

/20

BEST

Mathematical Precision

WEAK

Cost Efficiency

PARAMETERS

Undisclosed

CONTEXT

—

PRICING

—

STATUS

ACTIVE

ATTRIBUTE SCORES · 20 DIMENSIONS

Cognitive

Bulk Apperce…

Reasoning

Mathematical…

World Knowle…

Scientific A…

Technical

Code Generat…

Tool Use

Multimodal F…

Speed

Cost Efficie…

Behavioral

Candor

Creativity

Tenacity

Self-Correct…

Calibration

Operational

Instruction …

Context Fide…

Multi-turn C…

Planning and…

Safety Align…

SCORING METHODOLOGY

RECALIBRATED per ADR-NM108 (April 17, 2026). Scientific Acumen 11→16 (GPQA 85.3% = 75-89% band). Math 12→16 (AIME 2024 96.7% = near 90%+ band). Reasoning 12→13 (Arena 1424 solidly in 12-14 band). Other dimensions stable. o3 is a specialized reasoning model; slow + expensive but strong on hard math/science. Key benchmarks: Arena Text 1424, Code 1441, GPQA 85.3%.

← ALL HOSTS

COMPARE IN MATRIX →KNOWLEDGE GRAPH →