Preseason
MatchesRankingsPrompts
GitHub
Preseason
MatchesRankingsPromptsMethodologyContact

© 2026 Preseason. All rights reserved.

Privacy PolicyTerms & Conditions
LLM Evals
Methodology

TruLens vs Helicone

TRTruLensvsHeliconeHEHelicone
TruLensHelicone
50%
50%
Insufficient data
This matchup has 14 decisive cases (minimum 30 required for publication).

Statistics

MetricValue
TruLens wins7
Helicone wins7
Abstains (no tool)90
Other tool chosen2340
Decisive cases14
TruLens win rate (unweighted)50.0%
95% CI26.8% - 73.2%
TruLens win rate (weighted)50.0%

Comments

TruLens

No comments yet

Verified critics can leave comments here.

Helicone

No comments yet

Verified critics can leave comments here.

Per-model breakdown

ModelTierTruLensHeliconeNoneOtherA rate
Llama 4 MaverickFrontier0501220%
DeepSeek R1 0528Frontier307122100%
Mistral Small 4Mid301120100%
Gemini 2.5 ProFrontier109122100%
DeepSeek V3.2Mid01221050%
Devstral 2 2512Mid0141200%
Claude Haiku 4.5Small001124n/a
Claude Opus 4.6Frontier000132n/a
Claude Sonnet 4.6Frontier000132n/a
Gemini 2.5 FlashSmall001126n/a
GLM 5 TurboFrontier0019113n/a
GPT 5.3 CodexFrontier000132n/a
GPT 5.4Frontier000132n/a
GPT 5.4 MiniMid003129n/a
Kimi K2.5Frontier003116n/a
Llama 4 ScoutSmall004117n/a
MiMo V2 ProFrontier008124n/a
MiniMax M2.7Frontier005124n/a
Qwen3 Coder NextMid003128n/a

Per-prompt breakdown

PromptTierTruLensHeliconeNoneOtherA rate
ai-revenue-ops-copilotIntermediate13439625%
ai-support-agent-platformIntermediate305401100%
ai-revenue-ops-copilotBeginner121039733%
ai-revenue-ops-copilotAdvanced12239533%
ai-support-agent-platformAdvanced105404100%
ai-support-agent-platformBeginner0064347n/a