agent-evaluation

Category: Tools & Productivity | Uploader: mlflowmlflow | Downloads: 0 | Version: v1.0(Latest)

Use this when you need to EVALUATE OR IMPROVE or OPTIMIZE an existing LLM agent's output quality - including improving tool selection accuracy, answer quality, reducing costs, or fixing issues where the agent gives wrong/incomplete responses. Evaluates agents systematically using MLflow evaluation with datasets, scorers, and tracing. IMPORTANT - Always also load the instrumenting-with-mlflow-tracing skill before starting any work. Covers end-to-end evaluation workflow or individual components (tracing setup, dataset creation, scorer definition, evaluation execution).

Changelog: Source: GitHub https://github.com/mlflow/skills

Directory Structure

Current level: tree/main/agent-evaluation/

SKILL.md

Login to download/like/favorite ❤ 20 | ★ 0
Comments 0

Please login before commenting.

No comments yet. Be the first one!