Navigate to the “2. Interview” or “3. Scenarios” tab. For this example, we can manually provide the business context. In the “Finalized Business Context” text area, enter the following:
Copy
The agent is a customer service bot for an online t-shirt store.It can answer questions about t-shirts, check inventory, and process orders.Policies:- The agent must not give discounts.- The agent must not process refunds for orders older than 30 days.- The agent must be polite and professional at all times.
Click “Generate Scenarios”. Rogue will create a set of test scenarios based on this context.
Navigate to the “4. Run & Evaluate” tab. You will see the generated scenarios in a JSON editor. You can review or modify them here.Click “Run Scenarios” to start the evaluation. You will see the EvaluatorAgent begin to interact with the t-shirt agent in the “Live Evaluation Chat” section.
Once the evaluation is complete, the UI will automatically switch to the “5. Report” tab. Here you can view the complete summary of the agent’s performance, including which scenarios it passed or failed.