✅ happy_path PASS
GPT-4o-mini • 22 tokens • Standard chat completion
✅ tool_call_chain PASS
GPT-4o • 107 tokens • Multi-turn tool usage
✅ sensitive_payload PASS
GPT-4o-mini • 54 tokens • PII content detected
✅ huge_payload PASS
GPT-4o-mini • 5003 tokens • Large payload handling
✅ mixed_providers Anthropic
claude-3-5-sonnet-20241022 • 14 tokens
✅ deepseek_chat DeepSeek
deepseek-chat • 28 tokens
✅ grok_xai xAI
grok-2 • 18 tokens
✅ qwen_alibaba Alibaba
qwen-turbo • 16 tokens
✅ runaway_loop PASS
GPT-4o • 51 tokens • Loop detection
✅ malformed_request PASS
Error handling • Graceful failure