HOST DIAGNOSTIC · OpenAI

GPT-5.4

GPT-5.4 · ACTIVE· GPT· CLOSED

ACTIVE

ATTRIBUTE MATRIX · ATRBT GROUP 01

HOST TELEMETRY

AVG

15.2

/20

BEST

Bulk Apperception

WEAK

Cost Efficiency

PARAMETERS

Undisclosed

CONTEXT

—

PRICING

—

STATUS

ACTIVE

ATTRIBUTE SCORES · 20 DIMENSIONS

Cognitive

Bulk Apperce…

Reasoning

Mathematical…

World Knowle…

Scientific A…

Technical

Code Generat…

Tool Use

Multimodal F…

Speed

Cost Efficie…

Behavioral

Candor

Creativity

Tenacity

Self-Correct…

Calibration

Operational

Instruction …

Context Fide…

Multi-turn C…

Planning and…

Safety Align…

SCORING METHODOLOGY

RECALIBRATED per ADR-NM108 (April 17, 2026). Prior Gen 1 scores were 16-19; Arena Text 1481 maps to 15-17 band, not 18. Bulk Apperception INCREASED to 18 (AAII 57 = genuine #1 tie). Cost Efficiency dropped to 10 ($2.50/$15 Standard, $30/$180 Pro). Key benchmarks: Arena Text #7 at 1481 → Reasoning 15. Arena Code #7 at 1457 → Code 15. AAII 57 (tied #1) → Bulk 18. Native computer-use capabilities. Three variants: Standard, Thinking, Pro.

← ALL HOSTS

COMPARE IN MATRIX →KNOWLEDGE GRAPH →