能力标签
AI工作站
⚙️
Agent工作流

AI工作站

基于 Python · 无代码搭建完整 AI 自动化流程
英文名:guaardvark
⭐ 24 Stars 🍴 15 Forks 💻 Python 📄 MIT 🏷 AI 7.5分
7.5AI 综合评分
aiai-agentsflaskpython
✦ AI Skill Hub 推荐

经 AI Skill Hub 精选评估,AI工作站 获评「推荐使用」。这款Agent工作流在功能完整性、社区活跃度和易用性方面表现出色,AI 评分 7.5 分,适合有一定技术背景的用户使用。

📚 深度解析

AI工作站 是一套完整的 AI Agent 自动化工作流方案。随着 AI 能力的不断提升,基于 Agent 的自动化工作流正在成为提升个人和团队效率的核心方式。区别于传统的 RPA 自动化(模拟鼠标键盘操作),AI Agent 工作流通过理解任务意图、动态规划执行路径,能够处理更复杂的非结构化任务。

AI工作站 工作流的设计遵循"最小配置,最大复用"原则:核心逻辑已经封装好,用户只需配置自己的 API Key 和业务参数即可快速上手。工作流内置错误处理和重试机制,在网络波动或 API 限速等情况下仍能稳定运行,适合作为生产环境的自动化基础设施。

在实际部署时,建议先在测试环境中运行 3-5 次,验证各个环节的输出结果符合预期,再部署到生产环境。AI Skill Hub 评分 7.5 分,是同类 Agent 工作流中的精选推荐。

📋 工具概览

AI工作站 是一套完整的 AI Agent 自动化工作流方案。通过可视化的节点编排,将复杂的多步骤任务拆解为清晰的自动化流程,实现全程无人值守的智能处理。支持与数百种外部服务和 API 无缝集成,适合构建数据处理管线、业务自动化和 AI 辅助决策系统。

GitHub Stars
⭐ 24
开发语言
Python
支持平台
Windows / macOS / Linux
维护状态
轻量级项目,按需更新
开源协议
MIT
AI 综合评分
7.5 分
工具类型
Agent工作流
Forks
15

📖 中文文档

以下内容由 AI Skill Hub 根据项目信息自动整理,如需查看完整原始文档请访问底部「原始来源」。

AI工作站 是一套完整的 AI Agent 自动化工作流方案。通过可视化的节点编排,将复杂的多步骤任务拆解为清晰的自动化流程,实现全程无人值守的智能处理。支持与数百种外部服务和 API 无缝集成,适合构建数据处理管线、业务自动化和 AI 辅助决策系统。

📌 核心特色
  • 可视化 Agent 工作流编排,无需编写复杂代码
  • 支持多步骤自动化任务链,实现全流程无人值守
  • 与外部 API、数据库和第三方服务无缝集成
  • 内置错误处理与自动重试机制,保障稳定运行
  • 提供可复用的自动化模板,快速在同类场景部署
🎯 主要使用场景
  • 自动化日常重复性工作,将精力集中于创造性任务
  • 构建数据采集 → 处理 → 输出的完整自动化管线
  • 实现跨平台、跨系统的数据流转和业务协同
以下安装命令基于项目开发语言和类型自动生成,实际以官方 README 为准。
安装命令
# 方式一:pip 安装(推荐)
pip install guaardvark

# 方式二:虚拟环境安装(推荐生产环境)
python -m venv .venv
source .venv/bin/activate  # Windows: .venv\Scripts\activate
pip install guaardvark

# 方式三:从源码安装(获取最新功能)
git clone https://github.com/guaardvark/guaardvark
cd guaardvark
pip install -e .

# 验证安装
python -c "import guaardvark; print('安装成功')"
📋 安装步骤说明
  1. 访问 GitHub 仓库获取工作流文件
  2. 在对应平台(Dify / Flowise / Make 等)中找到「导入工作流」功能
  3. 上传工作流文件
  4. 按照提示配置必要的环境变量和 API Key
  5. 运行测试确认流程正常后投入使用
以下用法示例由 AI Skill Hub 整理,涵盖最常见的使用场景。
常用命令 / 代码示例
# 命令行使用
guaardvark --help

# 基本用法
guaardvark input_file -o output_file

# Python 代码中调用
import guaardvark

# 示例
result = guaardvark.process("input")
print(result)
以下配置示例基于典型使用场景生成,具体参数请参照官方文档调整。
配置示例
# guaardvark 配置文件示例(config.yml)
app:
  name: "guaardvark"
  debug: false
  log_level: "INFO"

# 运行时指定配置文件
guaardvark --config config.yml

# 或通过环境变量配置
export GUAARDVARK_API_KEY="your-key"
export GUAARDVARK_OUTPUT_DIR="./output"
📑 README 深度解析 真实文档 完整度 75/100 查看 GitHub 原文 →
以下内容由系统直接从 GitHub README 解析整理,保留代码块、表格与列表结构。

简介

<p align="center"> <img src="docs/screenshots/og-image.jpg" alt="Guaardvark — Secure Offline AI Platform" width="640"> </p>

What's included

A full creative-professional AI workstation, all running locally:

Generation - Video (Text-to-Video, Image-to-Video) — Wan 2.2, CogVideoX 2B/5B, SVD-XT. No workflow graph required: paste a list of prompts, pick a model and resolution, hit go. The queue handles the rest while you start the next batch. - Audio Studio — music generation (ACE-Step, full songs with vocals or instrumental), sound-effect lab (Stable Audio Open), neural voice (Chatterbox + Kokoro), and 6 Piper voice profiles out of the box. - Voice Cloning — gated behind an explicit consent prompt before any clone is created or used. - Image generation — Stable Diffusion via Diffusers with batch queue, face restoration, anatomy and detail controls. - Image + Video Upscaling — 4K and 8K via HAT-L, RealESRGAN family, NMKD-Superscale, Foolhardy Remacri. Two-pass mode for maximum quality. Frame-by-frame video processing. - Batch CSV Generator — generate unique web pages, post content, or structured data from a CSV using your indexed knowledge base as ground truth. Marketing copy, product pages, unique-content campaigns at scale. - File Generation — code, text, docs, images, video, audio in one queue.

Editing - Video Editor — Shotcut-lite timeline with three lanes (video / text / audio), drag-and-drop from the media library, real text overlay rendering via ffmpeg, visual trim sliders, keyboard shortcuts, one-step undo. - Video Text Overlay — standalone tool for the simpler one-off case.

Agents & Automation - Autonomous screen agents — agents see a real virtual desktop (Xvfb :99), move the mouse, click, type, navigate browsers, and verify their own actions. - AgentBrain — three-tier neural routing: Reflex (<100ms), Instinct (1–3s), Deliberation (5–30s). - Agent Training System — visual hand-eye-coordination teaching: bracket a session with Begin/End Lesson, walk the agent through a flow with thumbs-up pearls, the system distills a structured replayable lesson with parameterized steps. - Agent Memory + Learning — system-message persistent knowledge that survives reboots, recipe induction from successful tasks (Agent Workflow Memory pattern), vision-actionable knowledge with no cached pixel coordinates. - Agent Swarms — up to 20 parallel coding agents, each in an isolated git worktree on its own branch. Dependency-ordered merging. Flight Mode (fully offline). Backends: Claude Code, Cline/OpenClaw via local Ollama. - Agents · Agent Tools · Virtual Agent Screen — explorable surfaces for each capability, with a draggable VNC viewer that works on any page. - Voice Chat — Whisper.cpp transcribes, the agent thinks, Piper speaks. Toggle with /voice. - Outreach System — supervised AI for social-media engagement (Reddit, Discord, Twitter/X, Facebook) grounded in your indexed knowledge. Full detail below. - Self-Improvement — detects test failures, dispatches an agent to read the offending code and fix it, verifies, broadcasts to other instances. Optional Anthropic-API guardian review. - Auto Researcher — autonomous RAG-pipeline optimizer that experiments with parameters, keeps wins, reverts losses.

Workflow Surfaces - File Manager — drag from your real desktop into the in-app File Manager. Color-code files, copy & paste, drag-and-drop reorganize. Folder / List / Media views. Right-click menus (copy, paste, delete, recursive index). Files attach to clients, projects, websites, notes, or code repos. - Notes Manager · Media Manager · Project Management · Client Management · Websites Management — consistent grid+detail UI for the working surfaces a small business actually uses. Cross-linked: documents attach to projects attach to clients attach to websites. - Dashboard — live status grid: model health, GPU usage, RAG state, agent activity, plugin states. - Code Editor — Monaco-based IDE with right-click "explain", "fix", "generate" via the AI assistant. - Code Analyzer · Code Repos — repo-level understanding and per-repo indexing. - Task Scheduler — cron-style scheduling for any agent task or generation job. - Rules & Prompts — import/export rules and prompts as a portable bundle.

Integration - ComfyUI Backend — managed as a plugin, used as the execution layer for advanced video pipelines. - WordPress Connectivity — push generated content directly into a WordPress site via a companion plugin. Functional today; ships with security disclaimers and a finishing-pass on the roadmap before the plugin moves out of beta.

Platform - Plugin System — every heavy capability (ComfyUI, Vision Pipeline, Audio Foundry, Upscaling, Discord, Swarm) is a managed plugin with health monitoring, port-based orphan cleanup, and a System Resource Orchestrator that arbitrates VRAM between them so two big models don't fight for the GPU. - CPU Offload for models that don't fit in VRAM. - GPU + CPU Resource Monitor — live, always visible. - Interconnector / Cluster — install Guaardvark on multiple local machines, master/client architecture with approval workflows, automatic load balancing across the fleet, hardware profile auto-detection. - Model Management — download voice/video/image models from HuggingFace with progress tracking. Quick-switch between local Ollama models. Quick-switch embedding models grouped by parameter count. - Backup & Restore — granular or full system backup, schema-migration-aware restore, cross-version compatible. - Advanced Settings — debugging toggles, RAG knobs, cache controls, diagnostic tools, test runners, self-improvement controls — exposed in the UI, not hidden behind a "config files only" wall.

<p align="center"> <img src="docs/screenshots/guaardvark-demo.gif" alt="Guaardvark Demo" width="100%"> </p>

<p align="center"> <img src="docs/screenshots/swarm-demo.gif" alt="Agent Swarm — parallel Claude Code agents across isolated git worktrees" width="100%"> <br> <em>Agent Swarm — parse a plan, spawn parallel agents in isolated git worktrees, resolve the dependency DAG, merge back to main.</em> </p>

Full Feature Set

Dependency Reconciler

  • Branch-aware sync — on git checkout, inspects venv / requirements.txt / Alembic head / package.json and re-syncs only what differs between branches
  • Single-master migrationsschema_sync.py is the authoritative schema source; saves you from "I just switched branches and now nothing works"
  • TDD-driven — 87 tests cover branch switches, partial states, and rollback scenarios

Requirements

DependencyVersionNotes
Python3.12+Backend
Node.js20+Frontend build
PostgreSQL14+Auto-installed
Redis5.0+Auto-installed
OllamalatestLocal LLM inference
CUDA GPU8GB+ VRAM16GB recommended for video generation

Install via PyPI

pip install guaardvark

The CLI connects to a running Guaardvark instance or launches a lightweight embedded server automatically.

---

Quick Start

git clone https://github.com/guaardvark/guaardvark.git
cd guaardvark
./start.sh

First run handles everything: Python venv, Node dependencies, PostgreSQL, Redis, Ollama, Whisper.cpp, database migrations, frontend build, and all services. Requires your system password once for PostgreSQL setup.

ServiceURL
Web UIhttp://localhost:5173
APIhttp://localhost:5000
Health Checkhttp://localhost:5000/api/health
./start.sh                    # Full startup with health checks
./start.sh --fast             # Skip dependency checks
./start.sh --test             # Health diagnostics
./start.sh --plugins          # Start all enabled plugins
./stop.sh                     # Stop all services

GPU Memory Guide

FeatureMinimumRecommended
Chat + RAG4GB8GB
Image generation6GB12GB
Wan 2.2 video11GB16GB
CogVideoX-5B video16GB20GB
Upscaling0.5GB2–4GB

---

Screenshots

DashboardCode Editor
![Dashboard](docs/screenshots/dashboard-page.png)![Code Editor](docs/screenshots/code-editor-page.png)
Media LibraryVideo Generation
![Media](docs/screenshots/media-library-page.png)![Video Gen](docs/screenshots/video-generation-page.png)
PluginsSwarm Plan Editor
![Plugins](docs/screenshots/plugins-page.png)![Swarm](docs/screenshots/swarm-plan-editor.png)
Settings — RAGSettings — Memory
![Settings RAG](docs/screenshots/settings-page-rag.png)![Settings Memory](docs/screenshots/settings-page-memory.png)

---

Advanced Settings

  • Debugging toggles, RAG knobs, cache controls, diagnostic tools, test runners, self-improvement controls
  • Surfaced in the UI, not hidden behind a "config files only" wall
  • Sectioned by area (Chat, RAG, Memory, Voice, Agents, Plugins, etc.) for quick navigation

Film Crew — End-to-End Production Pipeline

Five specialized agents collaborate to turn a one-line idea into a finished video. Built on the Swarm Orchestrator, so every role runs in parallel where possible and merges back deterministically.

RoleWhat It Does
**Screenwriter**Generates the script + scene breakdown from a logline
**Casting**Assigns characters to LoRAs (via the LoRA Trainer plugin) or stock characters
**Cinematographer**Produces a shot list with camera moves, framing, and lens choices
**Storyboard**Generates keyframe images for every shot via the image pipeline
**Editor**Assembles the generated clips into a finished video via the Video Editor

The LoRA Trainer plugin ships alongside — train character/environment/prop LoRAs from reference images on your local GPU (bf16, ~46 MB per LoRA) and route them automatically to the Casting agent.

Video Generation Pipeline

State-of-the-art video generation running entirely on your GPU. No cloud APIs, no per-minute billing, no content restrictions.

Video GenerationPlugin System
![Video Gen](docs/screenshots/video-generation-page.png)![Plugins](docs/screenshots/plugins-page.png)
ModelTypeMax DurationNative ResolutionVRAM
**Wan 2.2 (14B MoE)**Text-to-Video5s (81 frames @ 16fps)832x48011GB
**CogVideoX-5B**Text-to-Video6s (49 frames @ 8fps)720x48016GB
**CogVideoX-2B**Text-to-Video6s (49 frames @ 8fps)720x48012GB
**CogVideoX-5B I2V**Image-to-Video6s (49 frames @ 8fps)720x48016GB
**SVD XT**Text-to-Video3.5s (25 frames @ 7fps)512x512<8GB
  • Resolution options — 512px, 576px, 720px, 1280px, 1920px (1080p), and custom dimensions (multiples of 8)
  • Quality tiers — Fast (10 steps), Standard (30), High (40), Maximum (50)
  • Frame interpolation — 1x raw, 2x doubled FPS, 2x + upscale for cinema-quality output
  • Prompt enhancement — Cinematic, Realistic, Artistic, Anime, or raw
  • Low VRAM mode — automatically reduces resolution, frames, and inference steps for 8–12GB GPUs
  • Batch processing — queue multiple videos from a prompt list, processed by Celery workers
  • ComfyUI integration — one-click launch to the node editor for custom workflows

Plugin System

  • Managed plugins with health monitoring, port-based orphan cleanup, and auto-restore on restart
  • Manifest vs. runtime-state separationplugin.json is a static manifest (same bytes on every machine); live state (enabled, auto_start, config) lives in data/plugin_state.json (gitignored). Toggling from the UI writes only to runtime state — the manifest never mutates
  • Available plugins: Ollama, ComfyUI, Audio Foundry, Vision Pipeline, Upscaling, Swarm Orchestrator, LoRA Trainer, Discord Bot, GPU Embedding, Training
  • System Resource Orchestrator arbitrates VRAM between plugins so they don't trample each other
  • CPU Offload for models that don't fit in VRAM
  • Live GPU + CPU resource monitor, persistent across the UI
  • Model download management from HuggingFace with progress tracking — voice, video, image models

Vision Pipeline

  • Real-time frame analysis via Ollama vision models with adaptive FPS throttling
  • Two-layer change detection — perceptual hash + semantic analysis
  • Local camera capture with device enumeration and stream management
  • Context buffer with sliding window and compression
🇨🇳 中文文档镜像 AI 翻译 2026-06-09
英文原文章节由系统翻译为中文摘要,便于快速理解。完整原文见上方 "📑 README 深度解析"。
📌 简介

Guaardvark 是一个安全的离线 AI 平台,提供了一个全面的创作和专业的 AI 工作站,所有功能都运行在本地。

⚡ 功能介绍

Guaardvark 包括以下功能:视频(文本到视频、图像到视频)生成、音频工作室(音乐生成、歌曲生成等)等。

📋 环境依赖

Guaardvark 的环境依赖包括 Python 3.12+、Node.js 20+、PostgreSQL 14+、Redis 5.0+、Ollama 等。

🛠 安装步骤(Docker/pip/源码)

可以通过 pip 安装 Guaardvark,命令为 `pip install guaardvark`。

🚀 使用教程

使用 Guaardvark 的步骤包括克隆仓库、切换分支、运行 `start.sh` 脚本等。

⚙️ 配置说明(含 MCP / env)

Guaardvark 提供了多种配置选项,包括调试开关、RAG 控制器、缓存控制器等。

🔄 工作流/模块

Guaardvark 提供了一个完整的工作流和模块系统,包括 Film Crew(电影制作流程)、视频生成管道等。

🎯 aiskill88 AI 点评 A 级 2026-05-29

高质量AI工作流项目,具有较强的实用价值

📚 实用指南(长尾问题)
适合谁
  • 需要让 Claude / Cursor 操作本地工具的 AI 工程师
  • 构建多智能体协作系统的 Agent 开发者
  • 构建企业知识库 / RAG 检索应用的团队
  • 跨境业务、多语言内容运营团队
  • 做语音类 AI 产品的开发者
最佳实践
  • 配置 MCP 服务器时建议使用 stdio 传输 + JSON-RPC,避免暴露公网
  • 本地部署优先选 GGUF 量化模型,节省显存并保持响应速度
  • 分块大小建议 256-512 tokens,向量库优选 pgvector 或 Qdrant
  • Agent 任务先做 dry-run 验证工具调用链,再开启自主执行
常见错误
  • API key 直接提交到 git 仓库(请用 .env 并加入 .gitignore)
  • MCP 配置路径拼错或权限不足,重启 Claude Desktop 才生效
  • embedding 模型与查询模型不一致导致检索失效
  • 显存不足直接 OOM — 优先降低 context 或换更小的量化模型
  • Python 依赖冲突:建议用 venv / uv 隔离环境
部署方案
  • CLI:直接 npm install -g / pip install,命令行调用
  • 本地部署:CPU 8GB 起,GPU 推荐 16GB+ 显存
  • 云端托管:可放在 Vercel / Railway / Fly.io 等 PaaS 平台
相关搜索
guaardvark 中文教程guaardvark 安装报错怎么办guaardvark MCP 配置guaardvark Agent 工作流guaardvark 与同类工具对比guaardvark 最佳实践guaardvark 适合谁用

⚡ 核心功能

👥 适合谁
  • 需要让 Claude / Cursor 操作本地工具的 AI 工程师
  • 构建多智能体协作系统的 Agent 开发者
  • 构建企业知识库 / RAG 检索应用的团队
  • 跨境业务、多语言内容运营团队
⭐ 最佳实践
  • 配置 MCP 服务器时建议使用 stdio 传输 + JSON-RPC,避免暴露公网
  • 本地部署优先选 GGUF 量化模型,节省显存并保持响应速度
  • 分块大小建议 256-512 tokens,向量库优选 pgvector 或 Qdrant
  • Agent 任务先做 dry-run 验证工具调用链,再开启自主执行
⚠️ 常见错误
  • API key 直接提交到 git 仓库(请用 .env 并加入 .gitignore)
  • MCP 配置路径拼错或权限不足,重启 Claude Desktop 才生效
  • embedding 模型与查询模型不一致导致检索失效
  • 显存不足直接 OOM — 优先降低 context 或换更小的量化模型

👥 适合人群

自动化工程师和运维人员项目经理和业务分析师希望减少重复性工作的专业人士数字化转型团队

🎯 使用场景

  • 自动化日常重复性工作,将精力集中于创造性任务
  • 构建数据采集 → 处理 → 输出的完整自动化管线
  • 实现跨平台、跨系统的数据流转和业务协同

⚖️ 优点与不足

✅ 优点
  • +MIT 协议,可免费商用
  • +大幅减少重复性人工操作
  • +可视化流程,清晰直观
  • +可扩展性强,支持复杂场景
⚠️ 不足
  • 初始配置和调试需投入一定时间
  • 强依赖外部服务的稳定性
  • 复杂场景需具备一定技术基础
⚠️ 使用须知

AI Skill Hub 为第三方内容聚合平台,本页面信息基于公开数据整理,不对工具功能和质量作任何法律背书。

建议在沙箱或测试环境中充分验证后,再部署至生产环境,并做好必要的安全评估。

📄 License 说明

✅ MIT 协议 — 最宽松的开源协议之一,可自由商用、修改、分发,仅需保留版权声明。

🔗 相关工具推荐

📰 相关 AI 新闻
🍿 AI 圈相关吃瓜
🗺️ 相关解决方案
🧩 你可能还需要
基于当前 Skill 的能力图谱,自动补全的工具组合

❓ 常见问题 FAQ

guaardvark 是一款Python开发的AI辅助工具。开源AI工作流:The self-hosted AI workstation. Autonomous screen agents, 3-tier neural routing,。⭐24 · Python 主要应用场景包括:自动化AI任务处理。
💡 AI Skill Hub 点评

AI Skill Hub 点评:AI工作站 的核心功能完整,质量良好。对于自动化工程师和运维人员来说,这是一个值得纳入个人工具库的选择。建议先在非生产环境试用,再逐步推广。

⬇️ 获取与下载
⬇ 下载源码 ZIP

✅ MIT 协议 · 可免费商用 · 直接从 aiskill88 服务器下载,无需跳转 GitHub

📚 深入学习 AI工作站
查看分步骤安装教程和完整使用指南,快速上手这款工具
🌐 原始信息
原始名称 guaardvark
原始描述 开源AI工作流:The self-hosted AI workstation. Autonomous screen agents, 3-tier neural routing,。⭐24 · Python
Topics aiai-agentsflaskpython
GitHub https://github.com/guaardvark/guaardvark
License MIT
语言 Python
🔗 原始来源
🐙 GitHub 仓库  https://github.com/guaardvark/guaardvark 🌐 官方网站  https://guaardvark.com

收录时间:2026-05-29 · 更新时间:2026-05-30 · License:MIT · AI Skill Hub 不对第三方内容的准确性作法律背书。