HOST DIAGNOSTIC · OpenAI

o3

o3 · ACTIVE· GPT· CLOSED
ACTIVE
ATTRIBUTE MATRIX · ATRBT GROUP 01
Bulk Apperception
[11]
Reasoning
[13]
Math Precision
[16]
Code Generation
[12]
World Knowledge
[11]
Scientific Acumen
[16]
Instruction Following
[10]
Context Fidelity
[10]
Candor
[10]
Creativity
[8]
Multi-turn Coherence
[10]
Multimodal Fluency
[10]
Tool Use
[10]
Planning
[11]
Self-Correction
[11]
Tenacity
[11]
Safety Alignment
[10]
Calibration
[11]
Speed
[6]
Cost Efficiency
[5]
HOST TELEMETRY
AVG
10.6
/20
BEST
16
Mathematical Precision
WEAK
5
Cost Efficiency
PARAMETERS
Undisclosed
CONTEXT
PRICING
STATUS
ACTIVE
ATTRIBUTE SCORES · 20 DIMENSIONS
Cognitive
Bulk Apperce…
11
Reasoning
13
Mathematical…
16
World Knowle…
11
Scientific A…
16
Technical
Code Generat…
12
Tool Use
10
Multimodal F…
10
Speed
6
Cost Efficie…
5
Behavioral
Candor
10
Creativity
8
Tenacity
11
Self-Correct…
11
Calibration
11
Operational
Instruction …
10
Context Fide…
10
Multi-turn C…
10
Planning and…
11
Safety Align…
10
SCORING METHODOLOGY
RECALIBRATED per ADR-NM108 (April 17, 2026). Scientific Acumen 11→16 (GPQA 85.3% = 75-89% band). Math 12→16 (AIME 2024 96.7% = near 90%+ band). Reasoning 12→13 (Arena 1424 solidly in 12-14 band). Other dimensions stable. o3 is a specialized reasoning model; slow + expensive but strong on hard math/science. Key benchmarks: Arena Text 1424, Code 1441, GPQA 85.3%.