Preseason
MatchesRankingsPrompts
Contact
Preseason
MatchesRankingsPromptsMethodologyContact

© 2026 Preseason. All rights reserved.

Privacy PolicyTerms & Conditions
LLM Evals
Methodology

Patronus AI vs TruLens

Patronus AIPAPatronus AIvsTRTruLens
Patronus AITruLens
40%
60%

Leading: TruLens (60.0%)

Insufficient data
This matchup has 5 decisive cases (minimum 30 required for publication).

Statistics

MetricValue
Patronus AI wins2
TruLens wins3
Abstains (no tool)36
Other tool chosen959
Decisive cases5
Patronus AI win rate (unweighted)40.0%
95% CI11.8% - 76.9%
Patronus AI win rate (weighted)40.0%

Comments

Patronus AI

No comments yet

Verified critics can leave comments here.

TruLens

No comments yet

Verified critics can leave comments here.

Per-model breakdown

ModelTierPatronus AITruLensNoneOtherA rate
DeepSeek R1 0528Frontier022500%
Gemini 2.5 ProFrontier10647100%
MiMo V2 ProFrontier10251100%
Mistral Small 4Mid010500%
Claude Haiku 4.5Small00151n/a
Claude Opus 4.6Frontier00054n/a
Claude Sonnet 4.6Frontier00054n/a
DeepSeek V3.2Mid00943n/a
Devstral 2 2512Mid00150n/a
Gemini 2.5 FlashSmall00052n/a
GLM 5 TurboFrontier00747n/a
GPT 5.3 CodexFrontier00054n/a
GPT 5.4Frontier00054n/a
GPT 5.4 MiniMid00153n/a
Kimi K2.5Frontier00048n/a
Llama 4 MaverickFrontier00053n/a
Llama 4 ScoutSmall00346n/a
MiniMax M2.7Frontier00151n/a
Qwen3 Coder NextMid00351n/a

Per-prompt breakdown

PromptTierPatronus AITruLensNoneOtherA rate
ai-support-agent-platformBeginner1025143100%
ai-support-agent-platformAdvanced101167100%
ai-revenue-ops-copilotIntermediate0111620%
ai-revenue-ops-copilotAdvanced0111620%
ai-support-agent-platformIntermediate0141610%
ai-revenue-ops-copilotBeginner004164n/a