advanced-evaluation

分类: 数据与AI | 上传者: guanyangguanyang | 下载: 0 | 版本: v1.0(最新)

This skill should be used when the user asks to "implement LLM-as-judge", "compare model outputs", "create evaluation rubrics", "mitigate evaluation bias", or mentions direct scoring, pairwise comparison, position bias, evaluation pipelines, or automated quality assessment.

更新日志: Source: GitHub https://github.com/guanyang/antigravity-skills

目录结构

当前层级: skills/advanced-evaluation/

SKILL.md

登录后下载/点赞/收藏 ❤ 381 | ★ 0
评论 0

请先登录后评论。

还没有评论,快来第一个发言吧。