livebench-coordinator

分类: 调研与分析 | 上传者: shelvickshelvick | 下载: 0 | 版本: v1.0(最新)

Coordinates LiveBench benchmark runs. Reads a pre-built manifest, dispatches one solver per question by sequential index using batch_async, then scores all answers via score-run.sh and produces the benchmark report. Use when running LiveBench evaluations. Do NOT use for MMLU-Pro or general tasks.

更新日志: Source: GitHub https://github.com/shelvick/quoracle

目录结构

当前层级: priv/groves/livebench/skills/livebench-coordinator/

SKILL.md

登录后下载/点赞/收藏 ❤ 24 | ★ 0
评论 0

请先登录后评论。

还没有评论,快来第一个发言吧。