livebench-coordinator

Category: Research & Analysis | Uploader: shelvickshelvick | Downloads: 0 | Version: v1.0(Latest)

Coordinates LiveBench benchmark runs. Reads a pre-built manifest, dispatches one solver per question by sequential index using batch_async, then scores all answers via score-run.sh and produces the benchmark report. Use when running LiveBench evaluations. Do NOT use for MMLU-Pro or general tasks.

Changelog: Source: GitHub https://github.com/shelvick/quoracle

Directory Structure

Current level: priv/groves/livebench/skills/livebench-coordinator/

SKILL.md

Login to download/like/favorite ❤ 24 | ★ 0
Comments 0

Please login before commenting.

No comments yet. Be the first one!