llm-benchmark-analyst

分类: 数据与AI | 上传者: ChekhovinChekhovin | 下载: 0 | 版本: v1.0(最新)

search and analyze llm benchmark results within a fixed benchmark universe, then produce evidence-based model strength and weakness reports or domain-leader summaries. use when comparing a model across benchmarks, ranking the best models by domain, explaining what a benchmark measures, checking predecessor-vs-current progress, or writing benchmark reports that must prioritize exact model version, evaluation date, benchmark variant, score semantics, sub-scores, and benchmark defect warnings. works with browser, web, and multimodal extraction for text, table, canvas, or image-only leaderboards.

更新日志: Source: GitHub https://github.com/Chekhovin/awesome-llm-benchmarks

目录结构

当前层级: llm-benchmark-analyst/

SKILL.md

登录后下载/点赞/收藏 ❤ 3 | ★ 0
评论 0

请先登录后评论。

还没有评论,快来第一个发言吧。