benchflow
Run agent benchmarks, create tasks, analyze results, and manage agents using BenchFlow. Use when asked to benchmark an AI coding agent, run a benchmark suite, create tasks, view trajectories, or compare agent performance.
Changelog: Source: GitHub https://github.com/benchflow-ai/benchflow
No comments yet. Be the first one!