HOST DIAGNOSTIC · xAI

Grok 4.20

Grok 4.20 · ACTIVE· Grok· CLOSED
ACTIVE
ATTRIBUTE MATRIX · ATRBT GROUP 01
Bulk Apperception
[15]
Reasoning
[15]
Math Precision
[15]
Code Generation
[14]
World Knowledge
[16]
Scientific Acumen
[14]
Instruction Following
[14]
Context Fidelity
[13]
Candor
[13]
Creativity
[15]
Multi-turn Coherence
[14]
Multimodal Fluency
[13]
Tool Use
[15]
Planning
[15]
Self-Correction
[14]
Tenacity
[15]
Safety Alignment
[13]
Calibration
[12]
Speed
[12]
Cost Efficiency
[11]
HOST TELEMETRY
AVG
13.9
/20
BEST
16
World Knowledge
WEAK
11
Cost Efficiency
PARAMETERS
Undisclosed
CONTEXT
PRICING
STATUS
ACTIVE
ATTRIBUTE SCORES · 20 DIMENSIONS
Cognitive
Bulk Apperce…
15
Reasoning
15
Mathematical…
15
World Knowle…
16
Scientific A…
14
Technical
Code Generat…
14
Tool Use
15
Multimodal F…
13
Speed
12
Cost Efficie…
11
Behavioral
Candor
13
Creativity
15
Tenacity
15
Self-Correct…
14
Calibration
12
Operational
Instruction …
14
Context Fide…
13
Multi-turn C…
14
Planning and…
15
Safety Align…
13
SCORING METHODOLOGY
RECALIBRATED per ADR-NM108 (April 17, 2026). Prior scores 13-18 lacked benchmark provenance. No Arena ELO confirmed at time of recalibration; scores estimated from competitive position between Grok 4 (Arena ~1442) and frontier cluster (~1485+). Likely in 1460-1480 range → 15-17 band, scored conservatively at 15. Creativity kept at 15 (4-agent debate architecture produces distinctive outputs). World Knowledge: 16 (real-time X data is genuinely unique capability). Safety kept low (13) per xAI design philosophy. Cost/Speed penalized for 4-agent overhead.