instruction-following-completeness
Score completeness of instruction execution. Partial edits get partial credit; missed sub-steps cost points.
Changelog: Source: GitHub https://github.com/TIGER-AI-Lab/RewardHarness
Score completeness of instruction execution. Partial edits get partial credit; missed sub-steps cost points.
Changelog: Source: GitHub https://github.com/TIGER-AI-Lab/RewardHarness
Loading comments...