能力标签

🔌 MCP 🤖 Agent 🔄 工作流 👁 OCR 🌐 翻译 🐳 Docker 💻 CLI 🔗 REST API 🧬 Embedding 🖼 视觉

🔌

MCP工具

沙堡

基于 Python · 让 AI 助手直接操作你的系统与工具

英文名：Sandcastle

⭐ 71 Stars 🍴 6 Forks 💻 Python 📄 NOASSERTION 🏷 AI 8.0分

8.0AI 综合评分

aimcpcompliance

⚙️ 配置说明

✦ AI Skill Hub 推荐

沙堡是 AI Skill Hub 本期精选MCP工具之一。综合评分 8.0 分，整体质量较高。我们强烈推荐将其纳入你的 AI 工具库，帮助提升工作效率。

📚 深度解析

沙堡是一款基于 MCP（Model Context Protocol）标准协议的 AI 工具扩展。MCP 协议由 Anthropic 开发并开源，旨在建立 AI 模型与外部工具之间的标准化通信接口，目前已被 Claude Desktop、Claude Code、Cursor 等主流 AI 工具采纳。

通过安装沙堡，你的 AI 助手将获得额外的工具调用能力，可以用自然语言直接操控该工具的功能，无需学习复杂的命令行语法。MCP 工具的核心价值在于"一次配置，永久增强"——配置完成后，每次与 AI 对话时都可以无缝调用这些工具。

在技术实现上，MCP 工具通过标准的 JSON-RPC 协议与 AI 客户端通信，工具的功能以"工具列表"的形式暴露给 AI 模型，AI 可以按需调用。沙堡提供了结构化的工具调用接口，使 AI 模型能够精确地理解和使用每个功能点，显著降低 AI 在工具使用上的错误率。

与传统的 API 集成相比，MCP 工具的优势在于无需编写代码——用户只需在配置文件中添加几行 JSON，即可让 AI 获得全新能力。AI Skill Hub 将沙堡评为 AI 评分 8.0 分，属于同类工具中的优质选择。

📋 工具概览

沙堡是一款遵循 MCP（Model Context Protocol）标准协议的 AI 工具扩展。通过 MCP 协议，它可以让 Claude、Cursor 等主流 AI 客户端直接访问和操作外部工具、数据源和服务，实现 AI 能力的无缝扩展。无论是文件操作、数据库查询还是 API 调用，都可以通过自然语言在 AI 对话中直接触发，极大提升生产效率。

GitHub Stars

⭐ 71

开发语言

Python

支持平台

Windows / macOS / Linux

维护状态

轻量级项目，按需更新

开源协议

NOASSERTION

AI 综合评分

8.0 分

工具类型

MCP工具

Forks

📖 中文文档

以下内容由 AI Skill Hub 根据项目信息自动整理，如需查看完整原始文档请访问底部「原始来源」。

📌 核心特色

通过标准 MCP 协议与 Claude、Cursor 等主流 AI 客户端深度集成
提供结构化工具调用接口，显著降低 AI 集成复杂度
支持 Claude Desktop 和 Claude Code 无缝接入，开箱即用
可与其他 MCP 工具组合叠加，构建完整 AI 工作站
轻量无侵入设计，不影响现有系统架构

🎯 主要使用场景

在 Claude Desktop 对话中直接调用本地工具，实现 AI 与系统的深度联动
通过自然语言驱动复杂的多步骤自动化任务，代替繁琐手动操作
将多个 MCP 工具组合使用，构建个人专属 AI 工作站

以下安装命令基于项目开发语言和类型自动生成，实际以官方 README 为准。

安装命令

# 方式一：通过 Claude Code CLI 一键安装
claude skill install https://github.com/gizmax/Sandcastle

# 方式二：手动配置 claude_desktop_config.json
{
  "mcpServers": {
    "--": {
      "command": "npx",
      "args": ["-y", "sandcastle"]
    }
  }
}

# 配置文件位置
# macOS: ~/Library/Application Support/Claude/claude_desktop_config.json
# Windows: %APPDATA%/Claude/claude_desktop_config.json

📋 安装步骤说明

确认已安装 Node.js（v18 或以上版本）
打开 Claude Desktop 或 Claude Code 的 MCP 配置文件
按「交给 Agent 安装 → Claude Desktop」标签中的 JSON 配置填入 mcpServers 字段
保存配置文件并重启 Claude 客户端
重启后，在对话中即可使用本工具

以下用法示例由 AI Skill Hub 整理，涵盖最常见的使用场景。

常用命令 / 代码示例

# 安装后在 Claude 对话中直接使用
# 示例：
用户: 请帮我用 沙堡 执行以下任务...
Claude: [自动调用 沙堡 MCP 工具处理请求]

# 查看可用工具列表
# 在 Claude 中输入："列出所有可用的 MCP 工具"

以下配置示例基于典型使用场景生成，具体参数请参照官方文档调整。

配置示例

// claude_desktop_config.json 配置示例
{
  "mcpServers": {
    "__": {
      "command": "npx",
      "args": ["-y", "sandcastle"],
      "env": {
        // "API_KEY": "your-api-key-here"
      }
    }
  }
}

// 保存后重启 Claude Desktop 生效

📑 README 深度解析真实文档完整度 76/100 含工作流图查看 GitHub 原文 →

以下内容由系统直接从 GitHub README 解析整理，保留代码块、表格与列表结构。

Sandcastle

Describe what you want. Go home. Sandcastle ships it. Production-ready workflow orchestrator for AI agents. 7 AI providers with auto-failover, 22 step types including Claude Managed Agents, 15 agent templates, 4 OCR engines, EU AI Act compliance, and a full-featured dashboard. Define workflows in YAML or let AI design them for you.

v0.32.2 - "Your Sandbox, Your Silicon" Shipped May 19, 2026. Anthropic gave you a model. We gave you a place to put it. - Self-hosted sandboxes on the four blog partners + Docker reference (cloudflare, daytona, modal, vercel, docker). Five cookbooks under deploy/cookbooks/. Your org key never leaves your boundary - the worker refuses to start if ANTHROPIC_API_KEY is present in the sandbox env, only sk-ant-oat01- environment-scoped keys get past the gate. - Memory MCP server. Anthropic declared memory_stores incompatible with self-hosted sandboxes. We filled the gap: sandcastle.engine.memory_mcp_server wraps mem0 + persistent Qdrant + Haiku decisions. Four tools (add, search, forget, list_memories), two resources, one prompt. Helm chart + docker-compose included. - MCP tunnels (gated beta, header mcp-client-2025-11-20). WIF token exchange so your private Jira / Snowflake / Confluence is reachable from a managed agent with no static secrets. cloudflared sidecar pattern documented end to end. - Live work-queue dashboard. WorkQueuePanel on the runs page, SSE-driven depth + sparkline + pill (green < 5, amber 5-50, red > 50), aria-live polite. You see the backlog before the pager fires. - Webhook workers. New session.status_run_started event lets a worker run as a webhook handler instead of long-polling - saves RAM, latency, and your AWS bill. - Three production case-study workflows at workflows/case-studies/ - Amplitude designer, Clay GTM, Rogo analyst on private data. Not tutorials. Blueprints. - 150 new tests validated across 6 iterations (sequential / isolation / random + pollution / stress-repeat / smoke / consolidation). Zero regressions. PyPI: pip install sandcastle-ai==0.32.2. Previous: v0.32.0 - "Claude Agents Deep Integration" (May 16, 2026): Memory Stores, Multiagent, Outcomes, Webhooks, Skills Publisher, Trajectory Replay, Agent SDK runtime, Computer Use, MCP Elicitation, Live Agent Reasoning panel.

<a href="https://gizmax.github.io/Sandcastle/">Try the Live Demo (no backend needed)</a>

---

Features

Capability
Pluggable sandbox backends (E2B, Docker, Local, Cloudflare)	Yes
Multi-provider model routing (Claude, OpenAI, MiniMax, Google/Gemini, Mistral, Ollama, oMLX)	Yes
62 built-in integrations across 9 categories	Yes
22 step types (standard, llm, http, code, race, sensor, gate, parse, managed-agent...)	Yes
Zero-config local mode	Yes
DAG workflow orchestration	Yes
Parallel step execution	Yes
Run Time Machine (replay/fork)	Yes
Budget guardrails	Yes
Run cancellation	Yes
Idempotent run requests	Yes
Persistent storage (S3/MinIO)	Yes
Webhook callbacks (HMAC-signed)	Yes
Scheduled / cron agents	Yes
Retry logic with exponential backoff	Yes
Dead letter queue with full replay	Yes
Per-run cost tracking	Yes
SSE live streaming	Yes
Multi-tenant API keys	Yes
Python SDK + async client	Yes
CLI tool	Yes
MCP server (Claude Desktop, Cursor, Windsurf)	Yes
Docker one-command deploy	Yes
Dashboard with real-time monitoring	Yes
127 built-in workflow templates	Yes
118 community templates (Community Hub)	Yes
Visual workflow builder	Yes
Directory input (file processing)	Yes
CSV export per step	Yes
Human approval gates	Yes
Self-optimizing workflows (AutoPilot)	Yes
Hierarchical workflows (workflow-as-step)	Yes
Policy engine (PII redaction, secret guard)	Yes
Privacy router (PII redaction, 7 patterns)	Yes
Pre-run cost estimation (`POST /runs/estimate`)	Yes
Cost-latency optimizer (SLO-based routing)	Yes
EU AI Act compliance (risk classification, transparency reports, Annex IV)	Yes
Tamper-evident audit trail (SHA-256 hash chain)	Yes
OpenTelemetry instrumentation (workflow + step spans)	Yes
Browser modes (LightPanda headless, Browserbase cloud)	Yes
Concurrency control (rate limiter, semaphores)	Yes
Agent memory (semantic search, decay, conflict detection)	Yes
Evaluations (test suites, assertions, pass rate tracking)	Yes
Credential encryption (Fernet AES-128-CBC)	Yes
API key rotation + IP allowlisting	Yes
Security headers + CSP	Yes
Distributed rate limiting (in-memory + Redis)	Yes
A2A protocol (Google Agent-to-Agent)	Yes
AG-UI protocol (CopilotKit SSE streaming)	Yes
Guided onboarding wizard	Yes
Global search (runs, workflows, integrations)	Yes
Health insights (system health score + per-page banners)	Yes
License key system (community / pro / enterprise tiers)	Yes

---

Install and start PostgreSQL (macOS example)

brew install postgresql@16 brew services start postgresql@16

Install and start Redis (macOS example)

brew install redis brew services start redis

Production Mode - Docker (recommended)

One command. PostgreSQL, Redis, API server, and background worker - all configured.

```bash git clone https://github.com/gizmax/Sandcastle.git cd Sandcastle

Interactive setup wizard (API keys, .env, workflows/)

sandcastle init

Community Hub - browse and install templates

sandcastle hub search "lead scoring" sandcastle hub install competitive-radar sandcastle hub collections sandcastle hub install-collection marketing-pro

Install a community template

sandcastle hub install competitive-radar

Install a whole collection

sandcastle hub collections sandcastle hub install-collection marketing-pro

Docker Sandbox Hardening

When using the Docker backend, every container runs with: - All capabilities dropped (CapDrop: ALL) - Seccomp profile restricting dangerous syscalls - PID limit (default 100, configurable) - CPU quota (default 50%, configurable) - Memory limit (default 512 MiB, configurable) - Unprivileged user (1000:1000) - Auto-remove on exit

---

Quickstart

Example: lead-enrichment.yaml

name: "Lead Enrichment"
description: "Scrape, enrich, and score leads for sales outreach."
default_model: sonnet
default_max_turns: 10
default_timeout: 300

steps:
  - id: "scrape"
    prompt: |
      Visit {input.target_url} and extract:
      company name, employee count, main product, contact info.
      Return as structured JSON.
    output_schema:
      type: object
      properties:
        company_name: { type: string }
        employees: { type: integer }
        product: { type: string }
        contact_email: { type: string }

  - id: "enrich"
    depends_on: ["scrape"]
    prompt: |
      Given this company data: {steps.scrape.output}
      Research: revenue, industry, key decision makers, recent news.
    retry:
      max_attempts: 3
      backoff: exponential
      on_failure: abort

  - id: "score"
    depends_on: ["enrich"]
    prompt: |
      Score this lead 1-100 for B2B SaaS potential.
      Based on: {steps.enrich.output}
    model: haiku

on_complete:
  storage_path: "leads/{run_id}/result.json"

Templates (Quickstart)

15 built-in agent templates across 6 categories:

Category	Template	Role
Research	`researcher`	Web research with citations and source verification
Research	`seo_specialist`	SEO audit, keywords, meta tags, structured data
Development	`coder`	Python development - writes and runs code
Development	`reviewer`	Code review, security audit, best practices
Development	`tester`	Pytest tests, coverage, fixtures
Development	`devops`	Dockerfiles, CI/CD, deployment scripts
Development	`designer`	HTML/CSS/Tailwind prototypes, SVG
Data	`analyst`	Pandas, matplotlib, statistical analysis
Data	`sql_expert`	SQL optimization, schema design, migrations
Data	`scraper`	Web scraping, BeautifulSoup, data extraction
Content	`writer`	Content writing, proofreading, editing
Content	`translator`	Translation with cultural context, 90+ languages
Business	`legal_analyst`	Contract analysis, risk identification, clause extraction
Business	`financial_analyst`	Financial models, charts, forecasting
Operations	`project_manager`	Project breakdown, Gantt charts, status reports

Advanced features: output_format (json/files/markdown), shared_files between agents, fallback_template on failure.

```yaml

Add to .env

echo 'DATABASE_URL=postgresql+asyncpg://localhost/sandcastle' >> .env

Add to .env

echo 'REDIS_URL=redis://localhost:6379' >> .env

Add to .env

echo 'STORAGE_BACKEND=s3' >> .env echo 'S3_BUCKET=sandcastle-artifacts' >> .env echo 'AWS_ACCESS_KEY_ID=...' >> .env echo 'AWS_SECRET_ACCESS_KEY=...' >> .env

Set DATABASE_URL and REDIS_URL in .env

Set in .env or via sandcastle init

SANDBOX_BACKEND=e2b # default SANDBOX_BACKEND=docker # requires Docker + pip install sandcastle-ai[docker] SANDBOX_BACKEND=local # dev only, no isolation SANDBOX_BACKEND=cloudflare # requires deployed CF Worker ```

All backends share the same SandboxBackend protocol - same YAML, same API, same dashboard. Switch backends without changing workflows.

Docker hardening: When using the Docker backend, containers run with all capabilities dropped, a seccomp profile restricting syscalls, PID limits (default 100), CPU quotas (default 50%), memory limits (default 512 MiB), and an unprivileged user (1000:1000). All configurable via environment variables.

---

A) Template - one line to configure

steps: - id: deep-research type: managed-agent managed_agent_config: agent_template: researcher message: "Research {input.topic} and provide a comprehensive report with citations"

Per-workflow configuration

privacy: mode: redact # "redact" | "audit_only" patterns: # optional: restrict to specific patterns - email - credit_card - ssn exclude_steps: # optional: skip privacy check on these steps - internal-analysis

bash

Per-server configuration (env vars)

PRIVACY_MODE=redact PRIVACY_PATTERNS=email,phone,ssn,credit_card,ip,iban,dob ```

The Privacy Router integrates with the audit trail - every redaction event is logged with run ID, step ID, and matched pattern type (not the matched value).

---

Enable EU AI Act enforcement in .env

COMPLIANCE_MODE=eu_ai_act


Check active compliance features:

bash GET /api/compliance/status ```

Configured via environment variables

MEMORY_BACKEND=local # "local" (SQLite + embeddings) or "cloud" (Mem0) MEMORY_GRAPH_ENABLED=false # Enable Neo4j graph backend MEMORY_MAX_AGE_DAYS=90 # TTL for memory decay (0 = keep forever) MEMORY_ADMIT_THRESHOLD=0.3 # Minimum quality score for admission ```

---

Restart API + start a worker in a second terminal

sandcastle serve sandcastle worker


With Redis, workflows run in background workers instead of in-process. You can run multiple workers for parallel execution.

**Step 3 - S3 / MinIO** (artifact storage)

bash

For MinIO, also set: S3_ENDPOINT_URL=http://localhost:9000

Add your API keys

cat > .env << 'EOF' ANTHROPIC_API_KEY=sk-ant-... E2B_API_KEY=e2b_... SANDBOX_BACKEND=e2b WEBHOOK_SECRET=your-signing-secret EOF

docker compose up -d


That's it. Sandcastle is running at `http://localhost:8080` with PostgreSQL 16, Redis 7, auto-migrations, and an arq background worker.

bash docker compose ps # check status docker compose logs -f # tail logs docker compose down # stop everything ```

Start the API server (serves API + dashboard on one port)

uv run python -m sandcastle serve

Python SDK

Install from PyPI and use Sandcastle programmatically from any Python app:

pip install sandcastle-ai

```python from sandcastle import SandcastleClient

client = SandcastleClient(base_url="http://localhost:8080", api_key="sc_...")

Start the server (API + dashboard on one port)

sandcastle serve

Streaming CLI (sandcastle run --stream)

The sandcastle run --stream flag enables live terminal output with color-coded status: green for completed steps, yellow for running, red for failed. Each step shows timing, cost, and responsibility. Pressing Ctrl+C detaches from the stream and lets the workflow continue in background. Three additional commands ship in this release: sandcastle describe, sandcastle lint, and sandcastle owners.

---

Cost Estimation API

Estimate the cost of a workflow before running it. The /runs/estimate endpoint parses the workflow YAML, resolves model assignments per step (including classify/gate overrides), and returns a per-step and total cost breakdown based on average token usage.

curl -X POST http://localhost:8080/api/runs/estimate \
  -H "Content-Type: application/json" \
  -d '{
    "workflow": "lead-enrichment",
    "input": { "target_url": "https://example.com" }
  }'

{
  "data": {
    "valid": true,
    "validation_errors": [],
    "estimated_cost_usd": 0.18,
    "steps": [
      { "id": "scrape",  "model": "sonnet", "estimated_cost_usd": 0.06 },
      { "id": "enrich",  "model": "sonnet", "estimated_cost_usd": 0.09 },
      { "id": "score",   "model": "haiku",  "estimated_cost_usd": 0.03 }
    ]
  }
}

The valid field indicates whether the workflow passes validation. Invalid workflows still return an estimate but include a disclaimer that the figure may be unreliable. Falls back to sonnet pricing for unknown models.

---

Audit Endpoints

Method	Endpoint	Description
`GET`	`/api/audit`	Paginated audit log (filterable by run, event type, date)
`GET`	`/api/runs/{id}/audit`	Audit events for a specific run
`GET`	`/api/audit/verify/{id}`	Verify hash chain integrity for an event

```bash

API Key Rotation

Rotate API keys with zero downtime. The old key remains valid during a configurable grace period (default 24 hours), giving clients time to switch over.

```bash

Your First Workflow

```bash

Run a workflow asynchronously

curl -X POST http://localhost:8080/api/workflows/run \ -H "Content-Type: application/json" \ -d '{ "workflow": "lead-enrichment", "input": { "target_url": "https://example.com", "max_depth": 3 }, "callback_url": "https://your-app.com/api/done" }'

Run a workflow and wait for completion

run = client.run("lead-enrichment", input={"target_url": "https://example.com"}, wait=True, ) print(run.status) # "completed" print(run.total_cost_usd) # 0.12 print(run.outputs) # {"lead_score": 87, "tier": "A", ...}

Stream live events from a running workflow

for event in client.stream(run.run_id): print(event)

Run a workflow

sandcastle run lead-enrichment -i target_url=https://example.com

List runs, workflows, schedules

sandcastle ls runs --status completed --limit 10 sandcastle ls workflows sandcastle ls schedules

Cancel a running workflow

sandcastle cancel <run-id>

MCP Integration

Sandcastle ships with a built-in MCP (Model Context Protocol) server. This lets Claude Desktop, Cursor, Windsurf, and any MCP-compatible client interact with Sandcastle directly from the chat interface - run workflows, check status, manage schedules, browse results.

flowchart LR Client["Claude Desktop\nCursor / Windsurf"] MCP["sandcastle mcp\n(MCP server)"] API["localhost:8080\n(sandcastle serve)"] Client -->|stdio| MCP -->|HTTP| API

Install the MCP extra:

pip install sandcastle-ai[mcp]

Available MCP Tools

Tool	Description
`run_workflow`	Run a saved workflow by name with optional input data and wait mode
`run_workflow_yaml`	Run a workflow from inline YAML definition
`get_run_status`	Get detailed run status including all step results
`cancel_run`	Cancel a queued or running workflow
`list_runs`	List runs with optional status and workflow filters
`save_workflow`	Save a workflow YAML definition to the server
`create_schedule`	Create a cron schedule for a workflow
`delete_schedule`	Delete a workflow schedule

Available MCP Resources

URI	Description
`sandcastle://workflows`	Read-only list of all available workflows
`sandcastle://schedules`	Read-only list of all active schedules
`sandcastle://health`	Server health status (sandbox backend, DB, Redis)

Client Configuration

Claude Desktop - add to ~/Library/Application Support/Claude/claude_desktop_config.json:

{
  "mcpServers": {
    "sandcastle": {
      "command": "sandcastle",
      "args": ["mcp"],
      "env": {
        "SANDCASTLE_URL": "http://localhost:8080",
        "SANDCASTLE_API_KEY": "sc_..."
      }
    }
  }
}

Cursor - add to .cursor/mcp.json in your project root:

{
  "mcpServers": {
    "sandcastle": {
      "command": "sandcastle",
      "args": ["mcp", "--url", "http://localhost:8080"]
    }
  }
}

Windsurf - add to ~/.codeium/windsurf/mcp_config.json:

{
  "mcpServers": {
    "sandcastle": {
      "command": "sandcastle",
      "args": ["mcp"]
    }
  }
}

The MCP server uses stdio transport (spawned as a child process by the client). It requires a running sandcastle serve instance to connect to. Connection is configured via --url / --api-key CLI args or SANDCASTLE_URL / SANDCASTLE_API_KEY env vars.

What You Can Do from Chat

Once connected, ask your AI assistant to:

"Run the lead-enrichment workflow for https://example.com"
"What's the status of my last run?"
"List all failed runs from today"
"Create a schedule to run data-sync every day at 9am"
"Cancel run abc-123"
"Save this workflow YAML to the server"
"Show me all available workflows"
"Check if Sandcastle is healthy"

---

62 Built-in Integrations

Sandcastle ships with 62 zero-config tool connectors across 9 categories. Each integration is a lightweight JavaScript module that agents can call during workflow execution. Named connections let you wire multiple accounts (e.g. "production-slack" vs "staging-slack"), and all credentials are encrypted at rest with Fernet (AES-128-CBC + HMAC-SHA256).

Category	Tools
Communication	Slack, Microsoft Teams, Discord, Twilio, SendGrid, Resend, WhatsApp
Project Management	Jira, Linear, Notion, Airtable, Google Sheets, Figma
CRM	HubSpot, Salesforce, Zendesk, Intercom
Data	MongoDB, Snowflake, Supabase, Pinecone, Redis, Database, Google Drive, Qdrant, GCS, Azure Blob
ERP	SAP, ServiceNow, Helios, ABRA
Payments	Stripe, Shopify, QuickBooks, Plaid, DocuSign
AI	OpenAI, Anthropic, ElevenLabs, Langfuse
DevOps	GitHub, AWS S3, Vercel, Cloudflare Workers, Datadog, PagerDuty
General	Webhook, Zapier, Calendly, Firecrawl, Tavily, Exa, MCP Bridge, Human Input, Filesystem, Shell, Python Runtime, Code Interpreter, Browser

steps:
  - id: "notify"
    type: notify
    service: slack
    connection: production-slack
    template: "Lead {steps.score.output.company} scored {steps.score.output.score}/100"

---

Workflow Engine

Define multi-step agent pipelines as YAML. Each step can run in parallel, depend on previous steps, pass data forward, and use different models.

Self-Describing Workflows

Every step can declare responsibility, source_hint, owner, and added_date metadata so your workflows stay understandable as they grow. Three new CLI commands support this: sandcastle describe prints a human-readable summary of any workflow, sandcastle lint flags missing metadata and structural issues, and sandcastle owners lists who owns each step. The audit trail and dashboard are enriched with this metadata automatically.

Self-Optimizing Workflows (AutoPilot)

A/B test different models, prompts, and configurations for any step. Sandcastle automatically runs variants, evaluates quality (via LLM judge or schema completeness), tracks cost and latency, and picks the best-performing variant. Supports quality, cost, latency, and pareto optimization targets.

steps:
  - id: "enrich"
    prompt: "Enrich this lead: {input.company}"
    autopilot:
      enabled: true
      optimize_for: quality
      min_samples: 20
      auto_deploy: true
      variants:
        - id: fast
          model: haiku
        - id: quality
          model: opus
          prompt: "Thoroughly research and enrich: {input.company}"
      evaluation:
        method: llm_judge
        criteria: "Rate completeness, accuracy, and depth 1-10"

---

Hierarchical Workflows (Workflow-as-Step)

Call one workflow from another. Parent workflows can pass data to children via input mapping, collect results via output mapping, and fan out over lists with configurable concurrency. Depth limiting prevents runaway recursion.

steps:
  - id: "find-leads"
    prompt: "Find 10 leads in {input.industry}"

  - id: "enrich-each"
    type: sub_workflow
    depends_on: ["find-leads"]
    sub_workflow:
      workflow: lead-enrichment
      input_mapping:
        company: steps.find-leads.output.company
      output_mapping:
        result: enriched_data
      max_concurrent: 5
      timeout: 600

  - id: "summarize"
    depends_on: ["enrich-each"]
    prompt: "Summarize enrichment results: {steps.enrich-each.output}"

---

Workflow Version Diff

Every workflow edit is versioned. Compare any two versions side-by-side with YAML diff highlighting. See exactly what changed, when, and roll back if needed.

🎯 aiskill88 AI 点评 A 级 2026-06-01

Sandcastle是一个高质量的开源MCP工具

📚 实用指南（长尾问题）

适合谁