Open source LLM evaluation framework for unit, regression, and agent tests
Verified critics can leave comments here.