经 AI Skill Hub 精选评估,Castor AI 工作流 获评「推荐使用」。这款Agent工作流在功能完整性、社区活跃度和易用性方面表现出色,AI 评分 7.5 分,适合有一定技术背景的用户使用。
Castor AI 工作流 是一套完整的 AI Agent 自动化工作流方案。通过可视化的节点编排,将复杂的多步骤任务拆解为清晰的自动化流程,实现全程无人值守的智能处理。支持与数百种外部服务和 API 无缝集成,适合构建数据处理管线、业务自动化和 AI 辅助决策系统。
Castor AI 工作流 是一套完整的 AI Agent 自动化工作流方案。通过可视化的节点编排,将复杂的多步骤任务拆解为清晰的自动化流程,实现全程无人值守的智能处理。支持与数百种外部服务和 API 无缝集成,适合构建数据处理管线、业务自动化和 AI 辅助决策系统。
# 方式一:pip 安装(推荐)
pip install castor
# 方式二:虚拟环境安装(推荐生产环境)
python -m venv .venv
source .venv/bin/activate # Windows: .venv\Scripts\activate
pip install castor
# 方式三:从源码安装(获取最新功能)
git clone https://github.com/deepfounder-ai/castor
cd castor
pip install -e .
# 验证安装
python -c "import castor; print('安装成功')"
# 命令行使用
castor --help
# 基本用法
castor input_file -o output_file
# Python 代码中调用
import castor
# 示例
result = castor.process("input")
print(result)
# castor 配置文件示例(config.yml) app: name: "castor" debug: false log_level: "INFO" # 运行时指定配置文件 castor --config config.yml # 或通过环境变量配置 export CASTOR_API_KEY="your-key" export CASTOR_OUTPUT_DIR="./output"
<p align="center"> <pre> ██████╗ █████╗ ███████╗████████╗ ██████╗ ██████╗ ██╔════╝██╔══██╗██╔════╝╚══██╔══╝██╔═══██╗██╔══██╗ ██║ ███████║███████╗ ██║ ██║ ██║██████╔╝ ██║ ██╔══██║╚════██║ ██║ ██║ ██║██╔══██╗ ╚██████╗██║ ██║███████║ ██║ ╚██████╔╝██║ ██║ ╚═════╝╚═╝ ╚═╝╚══════╝ ╚═╝ ╚═════╝ ╚═╝ ╚═╝ </pre> </p>
<p align="center"> Self-hosted AI agent ready to drop into business workflows. Bring any OpenAI-compatible LLM — Azure OpenAI, AWS Bedrock, OpenAI, Groq, OpenRouter, or a local model on your own hardware. Your data, your provider, your rules. </p>
<p align="center"> <a href="#quick-start">Quick Start</a> • <a href="#interfaces">Interfaces</a> • <a href="docs/README.md"><b>Documentation</b></a> • <a href="#tools">Tools</a> • <a href="#skills">Skills</a> • <a href="#telegram-bot">Telegram</a> • <a href="#diagnostics">Doctor</a> </p>
<p align="center"> <img src="https://img.shields.io/badge/version-0.23.2-blue" alt="version"> <img src="https://img.shields.io/badge/python-3.11+-green" alt="python"> <img src="https://img.shields.io/badge/platform-linux%20%7C%20macos%20%7C%20windows-lightgrey" alt="platform"> <img src="https://img.shields.io/badge/license-MIT-orange" alt="license"> <a href="https://t.me/castor_ai"><img src="https://img.shields.io/badge/community-Telegram-blue?logo=telegram" alt="Telegram"></a> </p>
---
Castor's design principle: the system around the LLM does the heavy lifting, so the agent stays reliable on small local models and cheap on large hosted ones. Each feature below has a deep-dive guide in docs/.
Tool Search — a meta-tool architecture that keeps the prompt lean. Only ~8 core tools load by default (memory_search, memory_save, read_file, write_file, shell, http_request, spawn_task, tool_search); the model calls tool_search("browser") / "schedule" / "secret" / … to activate the rest on demand. Saves ~75% of the tokens a flat 49-tool list would burn.
Memory & Knowledge Graph — 3-layer system in one Qdrant collection: raw facts (saved instantly) → entities with typed relations → wiki summaries (both built by a nightly synthesis job). Hybrid retrieval fuses dense (FastEmbed MiniLM, 384d, 50+ languages) + sparse (SPLADE++) + BM25 via RRF. Thread-isolated, auto-chunked, secret-scrubbed. Interactive force-directed graph in the Web UI. → docs/MEMORY.md
Knowledge ingest — 50+ formats via Microsoft MarkItDown: PDF / DOCX / PPTX / XLSX / EPUB / HTML / code / data / images. Drop files, paste a URL, or scan a folder. Chunked, embedded, and queued for entity + wiki synthesis. → docs/KNOWLEDGE.md
Skills — pluggable single-file Python modules. Nine built in (browser, canvas, serial_port, mcp_manager, skill_creator, soul_editor, notes, timer, weather); create new ones from chat (skill_creator runs a plan→code→validate pipeline) or import from the agentskills.io spec. → docs/SKILLS.md · docs/SKILLS_IMPORT.md
Browser — Playwright + Chromium. Navigate, read, click, fill forms, screenshot. Headless by default; visible mode for logged-in sessions and OAuth flows. → docs/BROWSER.md
Hardware — the serial_port skill talks USB-serial / RS-232 / RS-485 to scales, barcode/RFID readers, GPS, label & receipt printers, PLCs (Modbus RTU), VFDs, and sensors. Cross-platform via pyserial. Actuator writes are gated behind an explicit confirm=true with a hex preview. → docs/HARDWARE.md
Canvas — render model-supplied HTML in a sandboxed 480px side panel: blocking forms that return submitted data, saveable dashboards, throwaway mockups. Iframe is sandbox="allow-scripts allow-forms" with no allow-same-origin, so generated HTML can't read parent state. → docs/CANVAS.md
Routines — scheduled tasks that live as chat threads: each firing appends a turn, and corrections you add between runs become context for the next. Natural schedule syntax (every 2h, weekdays 09:00, mon,wed,fri 14:30). Per-routine USD budget caps. → docs/ROUTINES.md
Goals — long-running autonomous tasks. A durable SQLite queue + worker daemon survives disconnects and restarts; an orchestrator breaks the goal into subtasks, dispatches specialized subagents, and an acceptance gate validates deliverables before marking done. → docs/GOALS.md
MCP — connect external Model Context Protocol tool servers (stdio or HTTP). Tools surface as mcp__server__tool and flow through tool_search. Manage via chat (mcp_manager) or Settings. → docs/MCP.md
Providers — any OpenAI-compatible endpoint (LM Studio, Ollama, OpenAI, OpenRouter, Groq, Together, DeepSeek, + more) plus a native Anthropic adapter for prompt caching & thinking budgets. Switch per-thread via /model or Settings. → docs/PROVIDERS.md
Voice & Camera — live voice mode (VAD → STT → LLM → TTS → auto-listen), local or API STT, multiple TTS backends; camera capture via browser PiP or OpenCV. → docs/VOICE.md · docs/CAMERA.md
Personality (Soul) — 8 adjustable traits (humor, honesty, curiosity, brevity, formality, proactivity, empathy, creativity) plus custom traits, agent name, and language. Edit via /soul, Settings, or chat. → docs/SOUL.md
Cost tracking — every LLM call records tokens + USD by thread, source, model, and provider, with LiteLLM-backed pricing. Surfaced in the Web UI. → docs/COST_TRACKING.md
The reliability internals that keep all of this working on small models — JSON repair, anti-hedge nudging, self-check, loop detection, compaction, auto-resume — are described under Engineering around the LLM above.
CASTOR_LLM_URL + CASTOR_LLM_KEY and you're done.For hosted-LLM deployments, Castor itself is light — any modern laptop or small VM works (the agent process is ~300MB resident, plus Qdrant on disk for memory).
For local-LLM deployments where the model runs on the same machine:
| Component | Minimum | Recommended |
|---|---|---|
| GPU | 4GB VRAM (4B Q4) | 8GB VRAM (9B Q4_K_M) or larger |
| RAM | 8GB | 16GB |
| Storage | 10GB | 20GB (models + memory) |
Works on: gaming laptops, desktop GPUs (RTX 3060+), Mac M1+ (via Ollama), Linux servers.
Runs natively on Linux, macOS (Intel & Apple Silicon) and Windows 10/11 — single pip install -e . pulls every runtime dep (including MarkItDown, python-docx/pptx, openpyxl, pdfminer.six, pypdf, fastembed, qdrant-client, uvicorn).
curl -fsSL https://raw.githubusercontent.com/deepfounder-ai/castor/main/install.sh | bash
This clones the repo, creates a venv, installs everything, verifies critical deps, pre-downloads the embedding model, and drops castor on your $PATH.
git clone https://github.com/deepfounder-ai/castor.git
cd castor
setup.bat
On Windows shell commands are routed through Git Bash (auto-detected at install time — install Git for Windows if missing). Falls back to cmd.exe if not found.
```bash git clone https://github.com/deepfounder-ai/castor.git cd castor
pip install -e .
docker compose up
LM Studio / Ollama should be running on the host. Persistent data in ./data/.
A module-by-module map of the codebase lives in ARCHITECTURE.md.
python3 -m venv .venv # or python -m venv .venv on Windows source .venv/bin/activate # macOS/Linux
Environment variables:
CASTOR_LLM_URL=http://localhost:1234/v1 # LLM server URL
CASTOR_LLM_MODEL=qwen/qwen3.5-9b # Model name
CASTOR_LLM_KEY=lm-studio # API key
CASTOR_DB_PATH=~/.castor/castor.db # Database path
CASTOR_DATA_DIR=~/.castor # Where threads / memory / uploads live
CASTOR_QDRANT_MODE=disk # memory | disk | server
CASTOR_PASSWORD= # Web UI password (shows login modal if set)
CASTOR_STT_DEVICE=cpu # STT inference device (cpu | cuda)
Everything else (30+ knobs — context_budget, rag_chunk_size, synthesis_time, tts_api_url, etc.) lives in Settings → Advanced → Settings and persists in SQLite.
castor --web (add --ssl --port 7861 for mic/camera). Single-file SPA, zero runtime JS deps. Chat, memory browser, scheduler, presets, settings, knowledge graph, canvas panel, live voice mode.castor. Rich-formatted chat with 20+ slash commands (/soul, /skills, /memory, /model, /cron, /doctor, …).高质量的开源AI工作流项目,值得关注
AI Skill Hub 为第三方内容聚合平台,本页面信息基于公开数据整理,不对工具功能和质量作任何法律背书。
建议在沙箱或测试环境中充分验证后,再部署至生产环境,并做好必要的安全评估。
✅ MIT 协议 — 最宽松的开源协议之一,可自由商用、修改、分发,仅需保留版权声明。
AI Skill Hub 点评:Castor AI 工作流 的核心功能完整,质量良好。对于自动化工程师和运维人员来说,这是一个值得纳入个人工具库的选择。建议先在非生产环境试用,再逐步推广。
| 原始名称 | castor |
| 原始描述 | 开源AI工作流:A self-hosted AI agent built to drop into business workflows: customer ops, inte。⭐48 · Python |
| Topics | aiagentpythonworkflow |
| GitHub | https://github.com/deepfounder-ai/castor |
| License | MIT |
| 语言 | Python |
收录时间:2026-05-31 · 更新时间:2026-06-01 · License:MIT · AI Skill Hub 不对第三方内容的准确性作法律背书。
选择 Agent 类型,复制安装指令后粘贴到对应客户端