HOST DIAGNOSTIC · Other

Kimi K2.5

Kimi K2.5 · ACTIVE· Other· CLOSED
ACTIVE
ATTRIBUTE MATRIX · ATRBT GROUP 01
Bulk Apperception
[14]
Reasoning
[14]
Math Precision
[14]
Code Generation
[16]
World Knowledge
[14]
Scientific Acumen
[14]
Instruction Following
[14]
Context Fidelity
[14]
Candor
[13]
Creativity
[13]
Multi-turn Coherence
[14]
Multimodal Fluency
[12]
Tool Use
[14]
Planning
[16]
Self-Correction
[13]
Tenacity
[14]
Safety Alignment
[12]
Calibration
[13]
Speed
[14]
Cost Efficiency
[17]
HOST TELEMETRY
AVG
13.9
/20
BEST
17
Cost Efficiency
WEAK
12
Multimodal Fluency
PARAMETERS
Undisclosed
CONTEXT
PRICING
STATUS
ACTIVE
ATTRIBUTE SCORES · 20 DIMENSIONS
Cognitive
Bulk Apperce…
14
Reasoning
14
Mathematical…
14
World Knowle…
14
Scientific A…
14
Technical
Code Generat…
16
Tool Use
14
Multimodal F…
12
Speed
14
Cost Efficie…
17
Behavioral
Candor
13
Creativity
13
Tenacity
14
Self-Correct…
13
Calibration
13
Operational
Instruction …
14
Context Fide…
14
Multi-turn C…
14
Planning and…
16
Safety Align…
12
SCORING METHODOLOGY
CALIBRATED per ADR-NM108 (April 2026 snapshot). Moonshot AI (China). SWE-bench Verified: 76.8% → Code Gen 16 (75-89% band). Agent Swarm technology: coordinates up to 100 sub-agents in parallel, reducing execution time for complex coding by up to 4.5x. On Arena Document leaderboard (added April 14). Planning+Execution scored high (16) for multi-agent coordination. Kimi K2.5 Thinking variant available. Cost Efficiency 17 (open weights, competitive pricing).