Playground - Scorecard Docs

Screenshot of the Playground showing a visual node-based workflow with connected components.

Screenshot of the Playground showing a visual node-based workflow with connected components.

Overview

The Playground is a visual workflow builder where you connect nodes to test your AI agent:

Testset (left): Your test data flows into the agent
Agent (center): Configure prompts and model settings
Evaluator (top right): Select metrics to score outputs
Results: View agent responses for each testcase
Scores: See pass/fail status and metric scores

Running an Evaluation

Select a testset or add testcases manually
Configure your prompt using Jinja syntax with variables like {{context}} or {{allInputs}}
Add metrics to the Evaluator node
Click Run to execute the evaluation

Results flow through the workflow—watch as responses appear in the Results node and scores populate in the Scores node.

Testsets

Create and manage test data

Metrics

Define evaluation criteria

⌘I