- upload or select a dataset
- add prompt and agent candidates
- run every dataset row against every candidate
- review outputs, usage, cost, and failures in one place
What you’ll find here
- Datasets — Upload CSV files, map columns, and understand dataset format
- Add candidates — Add dashboard prompts, deployed prompts, and agents
- Build an experiment — Create an experiment and prepare it for a run
- Run an experiment — Start, track, and cancel experiment runs
- Review results — Compare outputs, open request details, and inspect failures
- Run analytics — Compare run cost, tokens, latency, and success rate
- Evaluators — Understand current evaluator status and planned evaluator types

