AI Skill Hub 推荐使用:智能体安全评估 是一款优质的Agent工作流。AI 综合评分 7.5 分,在同类工具中表现稳健。如果你正在寻找可靠的Agent工作流解决方案,这是一个值得深入了解的选择。
智能体安全评估 是一套完整的 AI Agent 自动化工作流方案。通过可视化的节点编排,将复杂的多步骤任务拆解为清晰的自动化流程,实现全程无人值守的智能处理。支持与数百种外部服务和 API 无缝集成,适合构建数据处理管线、业务自动化和 AI 辅助决策系统。
智能体安全评估 是一套完整的 AI Agent 自动化工作流方案。通过可视化的节点编排,将复杂的多步骤任务拆解为清晰的自动化流程,实现全程无人值守的智能处理。支持与数百种外部服务和 API 无缝集成,适合构建数据处理管线、业务自动化和 AI 辅助决策系统。
# 方式一:pip 安装(推荐)
pip install agent3sigma-canary
# 方式二:虚拟环境安装(推荐生产环境)
python -m venv .venv
source .venv/bin/activate # Windows: .venv\Scripts\activate
pip install agent3sigma-canary
# 方式三:从源码安装(获取最新功能)
git clone https://github.com/antgroup/Agent3Sigma-Canary
cd Agent3Sigma-Canary
pip install -e .
# 验证安装
python -c "import agent3sigma_canary; print('安装成功')"
# 命令行使用
agent3sigma-canary --help
# 基本用法
agent3sigma-canary input_file -o output_file
# Python 代码中调用
import agent3sigma_canary
# 示例
result = agent3sigma_canary.process("input")
print(result)
# agent3sigma-canary 配置文件示例(config.yml) app: name: "agent3sigma-canary" debug: false log_level: "INFO" # 运行时指定配置文件 agent3sigma-canary --config config.yml # 或通过环境变量配置 export AGENT3SIGMA_CANARY_API_KEY="your-key" export AGENT3SIGMA_CANARY_OUTPUT_DIR="./output"
English · Simplified Chinese
Agent3σ-Canary, abbreviated as AgentCanary, is part of the Agent3σ project. It provides security evaluation capabilities for AI Agents in realistic runtime environments. AgentCanary does not simply check whether a model gives a safe textual answer; it drives agents in controlled sandboxes to invoke real tools, process task materials in realistic formats, and evaluate the agent's complete execution trajectory across risk outcome, security awareness, and normal-task utility.

Evaluation tasks run in isolated Docker containers, so you need to build the evaluation environments first.
Before building images, package the Skills repository:
cd _skills_repository
bash buildAll.sh
cd ..
Then build evaluation images. AgentCanary currently supports evaluating vanilla OpenClaw and OpenClaw variants integrated with different security plugins. Each Docker image corresponds to an independent evaluation environment, and you can choose which variants to build.
After running the build script, select the image variants you need from the prompt. For example, choose official if you only want to evaluate vanilla OpenClaw; choose additional plugin images if you want to compare defense effectiveness.
bash workflow/workflow_step_1_image_builder.sh
Tip If network restrictions require an HTTP proxy for Docker builds, see docs/docker_proxy_en.md.
cd leaderboard python build.py
./scripts/run.sh --model openai-compatible/gpt-4o --suite direct --docker --verbose
**Common arguments:**
| Argument | Description | Example |
| --- | --- | --- |
| `--model` | Target model, in `provider-id/model-id` format | `--model anthropic/claude-sonnet-4` |
| `--suite` | Test suite | `direct`, `indirect`, `memory`, `chain`, `skills_poison`, `all`, or comma-separated task IDs |
| `--docker` | Run the agent inside Docker | `--docker` |
| `--verbose` | Print more detailed logs | `--verbose` |
| `--attack` | Attack method | `code_attack`, `pair`, `important_message`, `InjecAgent`, etc. |
| `--runs` | Number of repeated runs per task | `--runs 3` |
| `--output-dir` | Result output directory | `--output-dir results/my_test` |
**Batch evaluation:**
To evaluate multiple models across multiple Docker images in parallel, use scripts under `batch_run/`:
bash
./batch_run/direct.sh example
System requirements:
Install uv:
curl -LsSf https://astral.sh/uv/install.sh | sh
Install Python dependencies:
uv sync
Running AgentCanary requires configuring two types of LLMs:
--model argument.Configuration steps:
```bash
cp config.example.yaml config.yaml
vim config.yaml
uv run python scripts/validate_api.py
In `config.yaml`, configure:
- **providers**: API endpoints, keys, and available model lists. You can define multiple providers, and each provider can contain multiple models. The `provider-id/model-id` pair is the value used by the `--model` argument.
- **roles**: Auxiliary model assignments for roles such as pair, judge, and ipi. Auxiliary model requests currently use OpenAI-compatible Chat Completions APIs.
Example:
yaml providers: openai-compatible: base_url: "https://api.openai.com/v1" api_key: "sk-xxx" api: "openai-completions" models: - id: "gpt-4o" name: "GPT-4o"
anthropic: base_url: "https://api.anthropic.com/v1" api_key: "sk-ant-xxx" api: "anthropic-messages" models: - id: "claude-sonnet-4" name: "Claude Sonnet 4"
roles: # pair: # base_url: "https://api.openai.com/v1" # api_key: "sk-xxx" # model: "gpt-4o" judge: base_url: "https://api.openai.com/v1" api_key: "sk-xxx" model: "gpt-4o" # ipi: # base_url: "https://api.openai.com/v1" # api_key: "sk-xxx" # model: "gpt-4o"
**Generate runtime configuration:**
bash
bash setup.sh
source env.sh ```
cp batch_run/batch_config.example.sh batch_run/batch_config.sh vim batch_run/batch_config.sh
providers:
<provider-id>: # Unique provider ID used as the --model prefix
base_url: "" # [required] API endpoint URL
api_key: "" # [required] API key
api: "openai-completions" # [optional] API protocol: "openai-completions" or "anthropic-messages"
models:
- id: "" # [required] Model ID used as the --model suffix
name: "" # [optional] Display name, defaults to id
api: "" # [optional] Per-model API protocol override
reasoning: false # [optional] Whether reasoning is supported, defaults to false
context_window: 384000 # [optional] Context window size
max_tokens: 384000 # [optional] Maximum output tokens
roles:
pair: # PAIR attacker
base_url: "" # [required] OpenAI-compatible API endpoint
api_key: "" # [required] API key
model: "" # [required] Model name
judge: # Judge scorer
base_url: ""
api_key: ""
model: ""
settings:
web_sim_base_url: "" # [optional] Web simulation service URL

提供实用AI安全评估工作流
AI Skill Hub 为第三方内容聚合平台,本页面信息基于公开数据整理,不对工具功能和质量作任何法律背书。
建议在沙箱或测试环境中充分验证后,再部署至生产环境,并做好必要的安全评估。
✅ Apache 2.0 — 宽松开源协议,可商用,需保留版权声明和 NOTICE 文件,含专利授权条款。
总体来看,智能体安全评估 是一款质量良好的Agent工作流,在同类工具中具备一定竞争力。AI Skill Hub 将持续追踪其更新动态,建议收藏备用,结合自身场景选择合适时机引入使用。
| 原始名称 | Agent3Sigma-Canary |
| 原始描述 | 开源AI工作流:Agent3σ-Canary is an evaluation framework for AI Agent security in realistic run。⭐12 · Python |
| Topics | AI安全工作流 |
| GitHub | https://github.com/antgroup/Agent3Sigma-Canary |
| License | Apache-2.0 |
| 语言 | Python |
收录时间:2026-05-26 · 更新时间:2026-05-30 · License:Apache-2.0 · AI Skill Hub 不对第三方内容的准确性作法律背书。
选择 Agent 类型,复制安装指令后粘贴到对应客户端