开源AI工作流 是 AI Skill Hub 本期精选Agent工作流之一。综合评分 7.5 分,整体质量较高。我们推荐使用将其纳入你的 AI 工具库,帮助提升工作效率。
Autonomous multi-agent软件开发系统,转换自然语言请求,实现AI工作流自动化
开源AI工作流 是一套完整的 AI Agent 自动化工作流方案。通过可视化的节点编排,将复杂的多步骤任务拆解为清晰的自动化流程,实现全程无人值守的智能处理。支持与数百种外部服务和 API 无缝集成,适合构建数据处理管线、业务自动化和 AI 辅助决策系统。
Autonomous multi-agent软件开发系统,转换自然语言请求,实现AI工作流自动化
开源AI工作流 是一套完整的 AI Agent 自动化工作流方案。通过可视化的节点编排,将复杂的多步骤任务拆解为清晰的自动化流程,实现全程无人值守的智能处理。支持与数百种外部服务和 API 无缝集成,适合构建数据处理管线、业务自动化和 AI 辅助决策系统。
# 方式一:pip 安装(推荐)
pip install ai-team
# 方式二:虚拟环境安装(推荐生产环境)
python -m venv .venv
source .venv/bin/activate # Windows: .venv\Scripts\activate
pip install ai-team
# 方式三:从源码安装(获取最新功能)
git clone https://github.com/RickZee/ai-team
cd ai-team
pip install -e .
# 验证安装
python -c "import ai_team; print('安装成功')"
# 命令行使用
ai-team --help
# 基本用法
ai-team input_file -o output_file
# Python 代码中调用
import ai_team
# 示例
result = ai_team.process("input")
print(result)
# ai-team 配置文件示例(config.yml) app: name: "ai-team" debug: false log_level: "INFO" # 运行时指定配置文件 ai-team --config config.yml # 或通过环境变量配置 export AI_TEAM_API_KEY="your-key" export AI_TEAM_OUTPUT_DIR="./output"
A multi-agent software development system that transforms natural language into production-ready code — and a framework comparison platform for evaluating orchestration approaches side by side.

| Feature | Description |
|---|---|
| **9 specialized agents** | Manager, Product Owner, Architect, Backend/Frontend/Fullstack Developers, DevOps, Cloud, QA |
| **End-to-end workflow** | Intake → Planning → Development → Testing → Deployment |
| **Multi-backend** | Same team, same demos, different orchestration — compare results |
| **Team profiles** | Right-size the team for the use case (--team backend-api, --team prototype, ...) |
| **Enterprise guardrails** | Behavioral (role, scope), security (code safety, PII, secrets), quality (syntax, completeness) |
| **MCP servers** | Per-team, per-agent MCP tool providers (GitHub, filesystem, Docker, Postgres) |
| **RAG knowledge** | Static best practices + dynamic project knowledge, scoped per agent role |
| **Self-improvement reports** | Each run produces a manager report that summarizes failures, references prior lessons, and proposes corrective actions |
| **AutoOptimizer Loop** | Karpathy-style autonomous edit→run→measure→keep/revert loop for iterative code optimization |
| **Observable** | Web dashboard (FastAPI + React), Textual TUI, Rich CLI monitor, structured logging, cost tracking |
poetry run python -m ai_team run "Build a minimal Flask API" \ --backend langgraph --team backend-api --env dev --skip-estimate
```bash git clone https://github.com/RickZee/ai-team.git && cd ai-team cp .env.example .env # Add API keys (see Configuration below) poetry install
Six ready-to-run scenarios that exercise the full pipeline:
| # | Demo | Description |
|---|---|---|
| 1 | 01_hello_world | Minimal Flask REST API — health, items CRUD, pytest, Dockerfile |
| 2 | 02_todo_app | Full-stack TODO app — Flask + SQLite backend, HTML/JS frontend |
| 3 | 03_data_pipeline | ETL pipeline — CSV ingest, validate/transform, SQLite load, CLI report |
| 4 | 04_ml_api | FastAPI ML inference service — scikit-learn model, predict/health/metrics endpoints |
| 5 | 05_microservices | Three-service system — API Gateway, User Service, Notification Service + docker-compose |
| 6 | 06_karpathy_optimization | AutoOptimizer Loop — iterative metric-driven optimization with keep/revert and RAG lessons |
```bash poetry run python scripts/run_demo.py demos/01_hello_world poetry run python scripts/run_demo.py demos/02_todo_app --skip-estimate
ai-team optimize demos/06_karpathy_optimization/workspace \ --metric demos/06_karpathy_optimization/metric.yaml \ --strategy demos/06_karpathy_optimization/strategy.md \ --budget 2.00 ```
Each demo directory contains input.json with the project spec and expected_output.json as an acceptance contract. After a run, scripts/capture_demo.py verifies the output and writes RESULTS.md.
For the full file layout, schema reference, capture/verification workflow, and instructions for adding new demos, see docs/DEMOS.md.
ai-team optimize ./workspace/my-app \ --metric metric.yaml \ --strategy strategy.md \ --backend claude-agent-sdk \ --team research-optimizer \ --budget 5.00 \ --max-experiments 50 \ --branch optimize/my-run \ --editable src/ lib/
| Flag | Description | Default |
|------|-------------|---------|
| `--metric` | Path to `metric.yaml` (name, evaluation_command, direction) | required |
| `--strategy` | Path to a Markdown hints file for the optimizer agent | — |
| `--backend` | Backend to use | `claude-agent-sdk` |
| `--team` | Team profile | `research-optimizer` |
| `--budget` | Total USD budget across all experiments | `10.0` |
| `--max-experiments` | Max iterations | `50` |
| `--branch` | Git branch for winning commits | `optimize/karpathy-loop` |
| `--editable` | Paths the agent may edit (informational; enforced via prompt) | `src/` |
Results land in `logs/experiments.jsonl` inside the workspace. See [`demos/06_karpathy_optimization/`](demos/06_karpathy_optimization/) for a ready-to-run example and [`docs/EVALS.md`](docs/EVALS.md) for eval methodology.
Example excerpt:
md
| Variable | Description | Default |
|---|---|---|
OPENROUTER_API_KEY | OpenRouter API key (CrewAI / LangGraph backends) | — |
ANTHROPIC_API_KEY | Anthropic API key (Claude Agent SDK backend) | — |
AI_TEAM_ENV | Tier: dev, test, prod | dev |
AI_TEAM_BACKEND | Default backend: crewai, langgraph, claude-agent-sdk | crewai |
AI_TEAM_LANGGRAPH_POSTGRES_URI | Postgres URI for LangGraph checkpointing (optional) | SQLite |
OPENROUTER_API_BASE | OpenRouter endpoint | https://openrouter.ai/api/v1 |
OPENROUTER_EMBEDDING_MODEL | Embedding model for crew memory | openai/text-embedding-3-small |
GUARDRAIL_MAX_RETRIES | Max guardrail retries | 3 |
CODE_QUALITY_MIN_SCORE | Min quality score (0–1) | 0.7 |
TEST_COVERAGE_MIN | Min test coverage (0–1) | 0.6 |
MAX_FILE_SIZE_KB | Max file size for tools (KB) | 500 |
OPTIMIZER_MAX_EXPERIMENTS | Default max iterations for AutoOptimizer | 50 |
OPTIMIZER_BUDGET_USD | Default total budget (USD) | 10.0 |
OPTIMIZER_TIMEOUT_PER_EXPERIMENT | Per-experiment timeout (seconds) | 300 |
OPTIMIZER_MIN_IMPROVEMENT_PCT | Min improvement % to keep a commit | 0.5 |
OPTIMIZER_MAX_BUDGET_PER_EXPERIMENT_USD | Per-experiment budget cap | 1.0 |
OPTIMIZER_MAX_TURNS_PER_EXPERIMENT | Max agent turns per experiment | 40 |
OPTIMIZER_DEFAULT_BACKEND | Default backend for optimizer | claude-agent-sdk |
Copy .env.example to .env and set the API key for your chosen backend. Before each run, a pre-flight check validates configured models. Agent→model mapping and guardrail behavior are documented in docs/AGENTS.md and docs/GUARDRAILS.md.
Model IDs are in openrouter/<provider>/<model> form (see src/ai_team/config/models.py). Set AI_TEAM_ENV to dev, test, or prod to choose a tier.
| Role | dev | test | prod |
|---|---|---|---|
| Manager | deepseek/deepseek-chat-v3-0324 | google/gemini-3-flash-preview | anthropic/claude-sonnet-4 |
| Product Owner | deepseek/deepseek-chat-v3-0324 | google/gemini-3-flash-preview | openai/gpt-5.2 |
| Architect | deepseek/deepseek-chat-v3-0324 | deepseek/deepseek-r1-0528 | anthropic/claude-sonnet-4 |
| Backend Developer | mistralai/devstral-2512 | minimax/minimax-m2 | openai/gpt-5.3-codex |
| Frontend Developer | mistralai/devstral-2512 | minimax/minimax-m2 | anthropic/claude-sonnet-4 |
| Fullstack Developer | mistralai/devstral-2512 | minimax/minimax-m2 | openai/gpt-5.3-codex |
| Cloud Engineer | deepseek/deepseek-chat-v3-0324 | deepseek/deepseek-r1-0528 | anthropic/claude-sonnet-4 |
| DevOps | mistralai/devstral-2512 | mistralai/devstral-2512 | openai/gpt-5.3-codex |
| QA Engineer | deepseek/deepseek-chat-v3-0324 | deepseek/deepseek-r1-0528 | anthropic/claude-sonnet-4 |
Embeddings (crew memory) use OPENROUTER_EMBEDDING_MODEL (default: openai/text-embedding-3-small). Current IDs and pricing: OpenRouter models.
The CLI has two top-level subcommands: run (build a project) and optimize (AutoOptimizer Loop).
```bash
poetry run ai-team --backend claude-agent-sdk --team backend-api "Create a REST API for a todo list" ```
For step-by-step setup and troubleshooting, see docs/GETTING_STARTED.md.
A production-grade browser UI with real-time WebSocket streaming, GitHub-dark theme, and side-by-side backend comparison.
```bash
The multi-agent framework landscape is moving fast. CrewAI, LangGraph, Claude Agent SDK, AutoGen, AWS Bedrock Agents — each makes different trade-offs around orchestration control, state management, human-in-the-loop, persistence, streaming, and cost. Rather than pick one and hope, this project runs the same team through multiple backends and lets the data decide.
Run the same demo through multiple backends and compare:
python scripts/compare_backends.py demos/01_hello_world --env dev
python scripts/compare_backends.py demos/01_hello_world --env dev --with-claude
Produces a side-by-side report: output quality, cost, latency, token usage, error rate. Use --with-claude to include the Claude Agent SDK (requires ANTHROPIC_API_KEY).
该项目提供了一个开源的AI工作流系统,支持自然语言请求转换和多agent协同开发,值得关注
AI Skill Hub 为第三方内容聚合平台,本页面信息基于公开数据整理,不对工具功能和质量作任何法律背书。
建议在沙箱或测试环境中充分验证后,再部署至生产环境,并做好必要的安全评估。
✅ MIT 协议 — 最宽松的开源协议之一,可自由商用、修改、分发,仅需保留版权声明。
经综合评估,开源AI工作流 在Agent工作流赛道中表现稳健,质量良好。如果你已有明确的使用需求,可以直接上手体验;如果还在评估阶段,建议对比同类工具后再做决策。
| 原始名称 | ai-team |
| 原始描述 | 开源AI工作流:Autonomous multi-agent software development system — converts natural language r。⭐13 · Python |
| Topics | workflowpython |
| GitHub | https://github.com/RickZee/ai-team |
| License | MIT |
| 语言 | Python |
收录时间:2026-05-22 · 更新时间:2026-05-23 · License:MIT · AI Skill Hub 不对第三方内容的准确性作法律背书。
选择 Agent 类型,复制安装指令后粘贴到对应客户端