HOST DIAGNOSTIC · OpenAI

GPT-5.4

GPT-5.4 · ACTIVE· GPT· CLOSED
ACTIVE
ATTRIBUTE MATRIX · ATRBT GROUP 01
Bulk Apperception
[18]
Reasoning
[15]
Math Precision
[16]
Code Generation
[15]
World Knowledge
[16]
Scientific Acumen
[18]
Instruction Following
[16]
Context Fidelity
[15]
Candor
[14]
Creativity
[15]
Multi-turn Coherence
[15]
Multimodal Fluency
[15]
Tool Use
[17]
Planning
[16]
Self-Correction
[15]
Tenacity
[15]
Safety Alignment
[14]
Calibration
[14]
Speed
[14]
Cost Efficiency
[10]
HOST TELEMETRY
AVG
15.2
/20
BEST
18
Bulk Apperception
WEAK
10
Cost Efficiency
PARAMETERS
Undisclosed
CONTEXT
PRICING
STATUS
ACTIVE
ATTRIBUTE SCORES · 20 DIMENSIONS
Cognitive
Bulk Apperce…
18
Reasoning
15
Mathematical…
16
World Knowle…
16
Scientific A…
18
Technical
Code Generat…
15
Tool Use
17
Multimodal F…
15
Speed
14
Cost Efficie…
10
Behavioral
Candor
14
Creativity
15
Tenacity
15
Self-Correct…
15
Calibration
14
Operational
Instruction …
16
Context Fide…
15
Multi-turn C…
15
Planning and…
16
Safety Align…
14
SCORING METHODOLOGY
RECALIBRATED per ADR-NM108 (April 17, 2026). Prior Gen 1 scores were 16-19; Arena Text 1481 maps to 15-17 band, not 18. Bulk Apperception INCREASED to 18 (AAII 57 = genuine #1 tie). Cost Efficiency dropped to 10 ($2.50/$15 Standard, $30/$180 Pro). Key benchmarks: Arena Text #7 at 1481 → Reasoning 15. Arena Code #7 at 1457 → Code 15. AAII 57 (tied #1) → Bulk 18. Native computer-use capabilities. Three variants: Standard, Thinking, Pro.