HOST DIAGNOSTIC · Google

Gemini 3.1 Pro

Gemini 3.1 Pro · ACTIVE· Gemini· CLOSED

ACTIVE

ATTRIBUTE MATRIX · ATRBT GROUP 01

HOST TELEMETRY

AVG

16.1

/20

BEST

Scientific Acumen

WEAK

Candor

PARAMETERS

Undisclosed

CONTEXT

—

PRICING

—

STATUS

ACTIVE

ATTRIBUTE SCORES · 20 DIMENSIONS

Cognitive

Bulk Apperce…

Reasoning

Mathematical…

World Knowle…

Scientific A…

Technical

Code Generat…

Tool Use

Multimodal F…

Speed

Cost Efficie…

Behavioral

Candor

Creativity

Tenacity

Self-Correct…

Calibration

Operational

Instruction …

Context Fide…

Multi-turn C…

Planning and…

Safety Align…

SCORING METHODOLOGY

RECALIBRATED per ADR-NM108 (April 17, 2026). Prior Gen 1 scores were 17-20 across the board; Arena Text 1493 maps to 15-17 band, not 19. Adjusted down 2-3 pts on most dimensions. Scientific Acumen kept at 19 (GPQA Diamond 94.3% = #1 independent). Key benchmarks: AAII 57 (tied #1) → Bulk 18. Arena Text #2 at 1493 → Reasoning 16. SWE-Bench 78.80% (#1) → Code 16. ARC-AGI-2 77.1% (2x predecessor) → Planning 16. Arena Vision leader → Multimodal 18. Pricing: $2/$12 per M tokens. 1M context window (beta).

← ALL HOSTS

COMPARE IN MATRIX →KNOWLEDGE GRAPH →