

Overview
The Playground is a visual workflow builder where you connect nodes to test your AI agent:- Testset (left): Your test data flows into the agent
- Agent (center): Configure prompts and model settings
- Evaluator (top right): Select metrics to score outputs
- Results: View agent responses for each testcase
- Scores: See pass/fail status and metric scores
Running an Evaluation
- Select a testset or add testcases manually
- Configure your prompt using Jinja syntax with variables like
{{context}}or{{allInputs}} - Add metrics to the Evaluator node
- Click Run to execute the evaluation