Preseason
MatchesRankingsPrompts
GitHub
Preseason
MatchesRankingsPromptsMethodologyContact

© 2026 Preseason. All rights reserved.

Privacy PolicyTerms & Conditions
LLM Observability
Methodology

Arize Phoenix vs Datadog

Arize PhoenixARArize PhoenixvsDatadogDADatadog
Arize PhoenixDatadog
40%
60%

Leading: Datadog (60.2%)

Statistics

MetricValue
Arize Phoenix wins33
Datadog wins50
Abstains (no tool)45
Other tool chosen2345
Decisive cases83
Arize Phoenix win rate (unweighted)39.8%
95% CI29.9% - 50.5%
Arize Phoenix win rate (weighted)39.8%

Comments

Arize Phoenix

No comments yet

Verified critics can leave comments here.

Datadog

No comments yet

Verified critics can leave comments here.

Per-model breakdown

ModelTierArize PhoenixDatadogNoneOtherA rate
Gemini 2.5 FlashSmall0441870%
MiniMax M2.7Frontier1503112100%
Qwen3 Coder NextMid1101119100%
Mistral Small 4Mid31012575%
DeepSeek V3.2Mid22012850%
Claude Haiku 4.5Small0201270%
Devstral 2 2512Mid1015110100%
GPT 5.4 MiniMid101130100%
MiMo V2 ProFrontier0121290%
Claude Opus 4.6Frontier000132n/a
Claude Sonnet 4.6Frontier000132n/a
DeepSeek R1 0528Frontier001131n/a
Gemini 2.5 ProFrontier006126n/a
GLM 5 TurboFrontier000132n/a
GPT 5.3 CodexFrontier000132n/a
GPT 5.4Frontier000132n/a
Kimi K2.5Frontier004115n/a
Llama 4 MaverickFrontier000132n/a
Llama 4 ScoutSmall0011114n/a

Per-prompt breakdown

PromptTierArize PhoenixDatadogNoneOtherA rate
ai-support-agent-platformAdvanced818138831%
ai-revenue-ops-copilotAdvanced616238527%
ai-revenue-ops-copilotBeginner5132937028%
ai-support-agent-platformIntermediate61140886%
ai-revenue-ops-copilotIntermediate501397100%
ai-support-agent-platformBeginner321139760%