evalyn-analyze
Use when analyzing evalyn evaluation results, investigating failures, comparing runs, or understanding agent performance
Changelog: Source: GitHub https://github.com/shihongDev/evalyn
Use when analyzing evalyn evaluation results, investigating failures, comparing runs, or understanding agent performance
Changelog: Source: GitHub https://github.com/shihongDev/evalyn
No comments yet. Be the first one!