HOST DIAGNOSTIC · Meta

Muse Spark

Muse Spark · ACTIVE· Llama· CLOSED
ACTIVE
ATTRIBUTE MATRIX · ATRBT GROUP 01
Bulk Apperception
[15]
Reasoning
[15]
Math Precision
[14]
Code Generation
[15]
World Knowledge
[15]
Scientific Acumen
[14]
Instruction Following
[15]
Context Fidelity
[14]
Candor
[14]
Creativity
[15]
Multi-turn Coherence
[14]
Multimodal Fluency
[17]
Tool Use
[13]
Planning
[14]
Self-Correction
[14]
Tenacity
[14]
Safety Alignment
[15]
Calibration
[13]
Speed
[15]
Cost Efficiency
[13]
HOST TELEMETRY
AVG
14.4
/20
BEST
17
Multimodal Fluency
WEAK
13
Tool Use
PARAMETERS
Undisclosed
CONTEXT
PRICING
STATUS
ACTIVE
ATTRIBUTE SCORES · 20 DIMENSIONS
Cognitive
Bulk Apperce…
15
Reasoning
15
Mathematical…
14
World Knowle…
15
Scientific A…
14
Technical
Code Generat…
15
Tool Use
13
Multimodal F…
17
Speed
15
Cost Efficie…
13
Behavioral
Candor
14
Creativity
15
Tenacity
14
Self-Correct…
14
Calibration
13
Operational
Instruction …
15
Context Fide…
14
Multi-turn C…
14
Planning and…
14
Safety Align…
15
SCORING METHODOLOGY
CALIBRATED per ADR-NM108 (April 2026 snapshot). Meta's 2026 frontier chat model. Arena Text: #3 at 1495 ELO → Reasoning 15 (1450-1499 band). Arena Vision: #3 at 1292 ELO → Multimodal 17 (vision frontier). Tool Use scored conservatively at 13 — Meta models historically less mature on function calling vs Anthropic/Google. Speed: 15 (Meta inference infrastructure is fast). Cost Efficiency: 13 (frontier pricing, exact $/M not yet confirmed at snapshot time).