经 AI Skill Hub 精选评估,开源AI工作流 获评「推荐使用」。这款Agent工作流在功能完整性、社区活跃度和易用性方面表现出色,AI 评分 7.5 分,适合有一定技术背景的用户使用。
Harness for running and evaluating AI agents against RL environments,开源AI工作流,提高AI模型的可靠性和可扩展性。
开源AI工作流 是一套完整的 AI Agent 自动化工作流方案。通过可视化的节点编排,将复杂的多步骤任务拆解为清晰的自动化流程,实现全程无人值守的智能处理。支持与数百种外部服务和 API 无缝集成,适合构建数据处理管线、业务自动化和 AI 辅助决策系统。
Harness for running and evaluating AI agents against RL environments,开源AI工作流,提高AI模型的可靠性和可扩展性。
开源AI工作流 是一套完整的 AI Agent 自动化工作流方案。通过可视化的节点编排,将复杂的多步骤任务拆解为清晰的自动化流程,实现全程无人值守的智能处理。支持与数百种外部服务和 API 无缝集成,适合构建数据处理管线、业务自动化和 AI 辅助决策系统。
# 方式一:pip 安装(推荐)
pip install archipelago
# 方式二:虚拟环境安装(推荐生产环境)
python -m venv .venv
source .venv/bin/activate # Windows: .venv\Scripts\activate
pip install archipelago
# 方式三:从源码安装(获取最新功能)
git clone https://github.com/Mercor-Intelligence/archipelago
cd archipelago
pip install -e .
# 验证安装
python -c "import archipelago; print('安装成功')"
# 命令行使用
archipelago --help
# 基本用法
archipelago input_file -o output_file
# Python 代码中调用
import archipelago
# 示例
result = archipelago.process("input")
print(result)
# archipelago 配置文件示例(config.yml) app: name: "archipelago" debug: false log_level: "INFO" # 运行时指定配置文件 archipelago --config config.yml # 或通过环境变量配置 export ARCHIPELAGO_API_KEY="your-key" export ARCHIPELAGO_OUTPUT_DIR="./output"
<a href="https://arxiv.org/abs/2601.14242"><img src="https://img.shields.io/badge/📝-Paper-b31b1b"></a> <a href="https://huggingface.co/datasets/mercor/apex-agents"><img src="https://img.shields.io/badge/🤗-Data-yellow"></a> <a href="http://mercor.com/blog/introducing-apex-agents"><img src="https://img.shields.io/badge/📰-Blog-0ea5e9"></a> <a href="mailto:apex@mercor.com"><img src="https://img.shields.io/badge/✉️-Contact-green"></a>
Archipelago is a system for running and evaluating AI agents against MCP applications. It consists of three main components:
All components run in Docker containers.
The environment is meant to be run independently as a sandbox, and then an LLM agent connects to the exposed MCP server. The agents runner spawns and manages environment sandboxes automatically.
Estimated time: 30-60 minutes for first run
This quick start walks you through running a single task end-to-end using the provided example.
cp environment/.env.example environment/.env cp agents/.env.example agents/.env cp grading/.env.example grading/.env
We provide two examples:
Option A: HuggingFace Benchmark Task (Recommended)
Run tasks from the mercor/apex-agents benchmark dataset with 480 professional services tasks.
cd examples/hugging_face_task
./run.sh
See examples/hugging_face_task/README.md for details.
Option B: Simple Task
A minimal example with a pre-defined task (find a gorilla image in a filesystem).
cd examples/simple_task
./run.sh
See examples/simple_task/README.md for a detailed step-by-step walkthrough.
Both scripts will: 1. Start the environment container 2. Populate the environment with the world snapshot 3. Configure MCP servers 4. Run the agent 5. Save the final snapshot 6. Run grading and display results
```bash cd archipelago
```
The Environment is a headless gateway designed to run in a Docker container. It serves as a management layer for LLM agents, providing MCP server orchestration, data population from S3, and state snapshotting.
/filesystem, /.apps_data).tar.gz archives of the environment state and stream them back to the client or upload directly to S3.| Endpoint | Method | Description |
|---|---|---|
/health | GET | Health check - returns 200 OK if running |
/docs | GET | FastAPI generated API documentation |
/apps | POST | Hot-swap MCP gateway configuration |
/mcp/ | - | MCP server endpoint (after configuration) |
/data/populate | POST | Download data from S3 into subsystems |
/data/snapshot | POST | Stream a tar.gz snapshot of environment state |
/data/snapshot/s3 | POST | Upload snapshot to S3, returns pre-signed URL |
The environment is configured via environment variables:
| Variable | Description | Default |
|---|---|---|
S3_SNAPSHOTS_BUCKET | S3 bucket for storing snapshots | snapshots |
S3_SNAPSHOTS_PREFIX | Prefix for snapshot objects in S3 | "" |
S3_DEFAULT_REGION | AWS region for S3 operations | us-west-2 |
S3_ACCESS_KEY_ID | AWS access key ID | None |
S3_SECRET_ACCESS_KEY | AWS secret access key | None |
import requests
config = {
"mcpServers": {
"filesystem_server": {
"transport": "stdio",
"command": "python",
"args": ["main.py"],
"cwd": "./mcp_servers/filesystem_server" # Must be a valid path in the container
}
}
}
requests.post("http://localhost:8080/apps", json=config)
After configuration, http://localhost:8080/mcp/ exposes an MCP server that agents can connect to.
For more details, see the Environment README.
cd environment
cp .env.example .env
# Edit if needed (can be left as-is for local development)
docker-compose --ansi always --env-file .env up --build
The server will be available at http://localhost:8080.
Once running, configure your MCP servers via the /apps endpoint:
curl -X POST http://localhost:8080/apps \
-H "Content-Type: application/json" \
-d '{"mcpServers": {"my-server": {"transport": "stdio", "command": "python", "args": ["main.py"], "cwd": "./mcp_servers/my-server"}}}'
Your LLM agent can now connect to the MCP gateway at http://localhost:8080/mcp/.
该项目提供了一个开源的AI工作流,用于开发和评估AI代理在RL环境中的性能,提高AI模型的可靠性和可扩展性,但缺乏更多的使用案例和文档。
AI Skill Hub 为第三方内容聚合平台,本页面信息基于公开数据整理,不对工具功能和质量作任何法律背书。
建议在沙箱或测试环境中充分验证后,再部署至生产环境,并做好必要的安全评估。
✅ Apache 2.0 — 宽松开源协议,可商用,需保留版权声明和 NOTICE 文件,含专利授权条款。
AI Skill Hub 点评:开源AI工作流 的核心功能完整,质量良好。对于自动化工程师和运维人员来说,这是一个值得纳入个人工具库的选择。建议先在非生产环境试用,再逐步推广。
| 原始名称 | archipelago |
| 原始描述 | 开源AI工作流:Harness for running and evaluating AI agents against RL environments。⭐174 · Python |
| Topics | workflowpython |
| GitHub | https://github.com/Mercor-Intelligence/archipelago |
| License | Apache-2.0 |
| 语言 | Python |
收录时间:2026-05-22 · 更新时间:2026-05-23 · License:Apache-2.0 · AI Skill Hub 不对第三方内容的准确性作法律背书。
选择 Agent 类型,复制安装指令后粘贴到对应客户端