agentscore
AgentScore verifies agent alignment by comparing **what the agent was told to do** versus **what it actually did**. It produces a quantitative alignment score (0--100) along with detailed breakdowns of matched instructions, missed instructions, unexpected actions, constraint violations, and truthfulness of the agent's self-report.
更新日志: Source: GitHub https://github.com/Singularity-tian/agentscore
还没有评论,快来第一个发言吧。