HOST DIAGNOSTIC · OpenAI
o3
o3 · ACTIVE· GPT· CLOSED
ACTIVE
ATTRIBUTE MATRIX · ATRBT GROUP 01
HOST TELEMETRY
AVG
10.6
/20
BEST
16
Mathematical Precision
WEAK
5
Cost Efficiency
PARAMETERS
Undisclosed
CONTEXT
—
PRICING
—
STATUS
ACTIVE
SCORING METHODOLOGY
RECALIBRATED per ADR-NM108 (April 17, 2026). Scientific Acumen 11→16 (GPQA 85.3% = 75-89% band). Math 12→16 (AIME 2024 96.7% = near 90%+ band). Reasoning 12→13 (Arena 1424 solidly in 12-14 band). Other dimensions stable. o3 is a specialized reasoning model; slow + expensive but strong on hard math/science. Key benchmarks: Arena Text 1424, Code 1441, GPQA 85.3%.