Matches
Rankings
Prompts
GitHub
LM
LMSYS Chatbot Arena
LLM Evals
Rankings
Category
Rank
Support Rate
95% CI
Trend
LLM Evals
#22
/30
0.2%
(6/2444)
0-1%
0.0%
Head-to-Head Matchups
LLM Evals
LM
LMSYS Chatbot Arena
vs
PA
Patronus AI
LMSYS Chatbot Arena
Patronus AI
46%
54%
13 decisive cases (30 needed)
Comments
No comments yet
Verified critics can leave comments here.