Preseason
MatchesRankingsPrompts
Contact
Preseason
MatchesRankingsPromptsMethodologyContact

© 2026 Preseason. All rights reserved.

Privacy PolicyTerms & Conditions
LLM Evals
Methodology

Arize Phoenix vs Helicone

Arize PhoenixARArize PhoenixvsHeliconeHEHelicone
Arize PhoenixHelicone
50%
50%
Insufficient data
This matchup has 12 decisive cases (minimum 30 required for publication).

Statistics

MetricValue
Arize Phoenix wins6
Helicone wins6
Abstains (no tool)36
Other tool chosen952
Decisive cases12
Arize Phoenix win rate (unweighted)50.0%
95% CI25.4% - 74.6%
Arize Phoenix win rate (weighted)50.0%

Comments

Arize Phoenix

No comments yet

Verified critics can leave comments here.

Helicone

No comments yet

Verified critics can leave comments here.

Per-model breakdown

ModelTierArize PhoenixHeliconeNoneOtherA rate
Llama 4 MaverickFrontier040490%
GPT 5.4 MiniMid20151100%
Qwen3 Coder NextMid20349100%
MiniMax M2.7Frontier10150100%
Mistral Small 4Mid10050100%
DeepSeek V3.2Mid019420%
Devstral 2 2512Mid011490%
Claude Haiku 4.5Small00151n/a
Claude Opus 4.6Frontier00054n/a
Claude Sonnet 4.6Frontier00054n/a
DeepSeek R1 0528Frontier00252n/a
Gemini 2.5 FlashSmall00052n/a
Gemini 2.5 ProFrontier00648n/a
GLM 5 TurboFrontier00747n/a
GPT 5.3 CodexFrontier00054n/a
GPT 5.4Frontier00054n/a
Kimi K2.5Frontier00048n/a
Llama 4 ScoutSmall00346n/a
MiMo V2 ProFrontier00252n/a

Per-prompt breakdown

PromptTierArize PhoenixHeliconeNoneOtherA rate
ai-support-agent-platformBeginner3025141100%
ai-revenue-ops-copilotIntermediate12116033%
ai-revenue-ops-copilotAdvanced12116033%
ai-revenue-ops-copilotBeginner0241620%
ai-support-agent-platformIntermediate104161100%
ai-support-agent-platformAdvanced001168n/a