Rogue - The AI Agent Evaluator & Red Team Platform
Rogue is a powerful tool designed to evaluate the performance, compliance, security, and reliability of AI agents. It combines dynamic policy evaluation with comprehensive red teaming capabilities to test your agents against 87+ vulnerability types using 30+ attack techniques.
Rogue Server: Contains the core evaluation logic including the Red Team Orchestrator
Client Interfaces: Multiple interfaces that connect to the server:
TUI (Terminal UI): Modern terminal interface built with Go and Bubble Tea
Web UI: Gradio-based web interface
CLI: Command-line interface for automated evaluation and CI/CD
This architecture allows for flexible deployment and usage patterns, where the server can run independently and multiple clients can connect to it simultaneously.
🛡️ 87+ Vulnerability Types: Test against comprehensive vulnerability categories including prompt injection, PII exposure, content safety, bias, and more.
⚔️ 30+ Attack Techniques: Single-turn, multi-turn, and agentic attacks including Base64 encoding, roleplay, social engineering, and advanced jailbreaking.
📋 Compliance Framework Mapping: Automatically map findings to OWASP LLM Top 10, MITRE ATLAS, NIST AI RMF, EU AI Act, GDPR, and more.
📈 CVSS-Based Risk Scoring: Industry-standard risk scoring with severity levels, exploitability metrics, and remediation recommendations.