agentscore
AgentScore verifies agent alignment by comparing **what the agent was told to do** versus **what it actually did**. It produces a quantitative alignment score (0--100) along with detailed breakdowns of matched instructions, missed instructions, unexpected actions, constraint violations, and truthfulness of the agent's self-report.
Changelog: Source: GitHub https://github.com/Singularity-tian/agentscore
No comments yet. Be the first one!