inspect-ai
Analyze Inspect AI evaluation logs, understand EvalLog structure, extract samples, events, and scoring data using dataframes
Changelog: Source: GitHub https://github.com/UKGovernmentBEIS/sandbox_escape_bench
Analyze Inspect AI evaluation logs, understand EvalLog structure, extract samples, events, and scoring data using dataframes
Changelog: Source: GitHub https://github.com/UKGovernmentBEIS/sandbox_escape_bench
No comments yet. Be the first one!