play_circle
Active Runs
Loading runs...
inbox
No benchmark runs found
Start a new benchmark from the
Run Benchmark page
Run Benchmark page
Loading run details...
rocket_launch
Run Benchmark
Configure and launch agent evaluation runs
Loading form...
Select tags...
expand_more
Select an agent to load tests
cloud
Infrastructure:
Manage in Infrastructure arrow_forward
assessment
View Report
Explore detailed benchmark results and metrics
Loading report...
compare_arrows
Compare Reports
Side-by-side comparison of benchmark runs
Comparing...
cloud
Infrastructure
Deploy and manage test infrastructure per agent
Loading infrastructure status...
account_tree
Agent Analyzer
Inspect conversation execution flows, tool calls & token usage
bolt Quick load
manage_search
No conversations loaded
Select a tenant and account above,
then click Search to explore
then click Search to explore
timeline
Select a conversation
Pick a conversation from the sidebar to
inspect the full execution flow
inspect the full execution flow