skill-eval

Category: Ops & Delivery | Uploader: aws-samplesaws-samples | Downloads: 0 | Version: v1.0(Latest)

Evaluate AI Agent Skills across safety, quality, reliability, and cost efficiency. Audit for security issues (secrets, injection, unsafe installs), test functional correctness with-skill vs without-skill, measure trigger precision, classify cost-efficiency tradeoffs, track version lifecycle, and generate unified grades. Use when evaluating a skill before installing, auditing marketplace skills, proving your skill works with automated tests, setting up CI/CD quality gates, or comparing two skill versions. NOT for: evaluating full agent systems, testing non-skill plugins, runtime performance benchmarking, or monitoring production agent behavior.

Changelog: Source: GitHub https://github.com/aws-samples/sample-agent-skill-eval

Directory Structure

Current level: Root

SKILL.md

Login to download/like/favorite ❤ 2 | ★ 0
Comments 0

Please login before commenting.

No comments yet. Be the first one!