skill-eval

分类: 运维与交付 | 上传者: aws-samplesaws-samples | 下载: 0 | 版本: v1.0(最新)

Evaluate AI Agent Skills across safety, quality, reliability, and cost efficiency. Audit for security issues (secrets, injection, unsafe installs), test functional correctness with-skill vs without-skill, measure trigger precision, classify cost-efficiency tradeoffs, track version lifecycle, and generate unified grades. Use when evaluating a skill before installing, auditing marketplace skills, proving your skill works with automated tests, setting up CI/CD quality gates, or comparing two skill versions. NOT for: evaluating full agent systems, testing non-skill plugins, runtime performance benchmarking, or monitoring production agent behavior.

更新日志: Source: GitHub https://github.com/aws-samples/sample-agent-skill-eval

目录结构

当前层级: 根目录

SKILL.md

登录后下载/点赞/收藏 ❤ 2 | ★ 0
评论 0

请先登录后评论。

还没有评论,快来第一个发言吧。