HOST DIAGNOSTIC · OpenAI

GPT-5.3 Codex

GPT-5.3 Codex · ACTIVE· GPT· CLOSED

ACTIVE

ATTRIBUTE MATRIX · ATRBT GROUP 01

HOST TELEMETRY

AVG

13.4

/20

BEST

Code Generation

WEAK

Cost Efficiency

PARAMETERS

Undisclosed

CONTEXT

—

PRICING

—

STATUS

ACTIVE

ATTRIBUTE SCORES · 20 DIMENSIONS

Cognitive

Bulk Apperce…

Reasoning

Mathematical…

World Knowle…

Scientific A…

Technical

Code Generat…

Tool Use

Multimodal F…

Speed

Cost Efficie…

Behavioral

Candor

Creativity

Tenacity

Self-Correct…

Calibration

Operational

Instruction …

Context Fide…

Multi-turn C…

Planning and…

Safety Align…

SCORING METHODOLOGY

CALIBRATED per ADR-NM108 (April 2026 snapshot). Released February 5, 2026. OpenAI's most capable agentic coding model. Moves beyond writing code to using code as a tool to operate computers end-to-end. Arena Code: ~1490-1510 est. SWE-Bench Pro: competitive with GPT-5.4. Code Gen scored high (17) as a dedicated coding agent. Planning+Execution scored high (16) for agentic capabilities. Creativity scored low (11) as a coding specialist, not a generalist.

← ALL HOSTS

COMPARE IN MATRIX →KNOWLEDGE GRAPH →