📄 工具详情 ⚙️ 安装教程 📚 使用教程

能力标签

🤖 Agent 🔄 工作流 🐳 Docker 💻 CLI 🔗 REST API 🧬 Embedding 📚 RAG 🧠 Claude ✨ GPT 🖥 本地 LLM

⚙️

Agent工作流

爬虫lama

Q: Crawllama 如何安装和开始使用？

访问 Crawllama 的 GitHub 仓库或官方网站，按照 README 文档中的步骤安装依赖并运行。通常需要 Python 3.8+ 或 Node.js 16+ 基础环境。

Q: Crawllama 是否免费？许可证是什么？

Crawllama 完全免费，采用 NOASSERTION 许可证开源发布，任何人都可以免费使用、修改和分发。

Q: Crawllama 适合哪些用户使用？

Crawllama 主要面向有一定技术基础的用户，包括开发者、数据分析师、AI 工程师等专业人士。

Q: Crawllama 的社区活跃度和项目维护状况如何？

Crawllama 在 GitHub 上已获得 63 个 Star，处于积极发展阶段，社区在持续扩大。

基于 Python · 无代码搭建完整 AI 自动化流程

英文名：Crawllama

⭐ 63 Stars 🍴 4 Forks 💻 Python 📄 NOASSERTION 🏷 AI 7.5分

7.5AI 综合评分

自动化爬虫FastAPI

⚙️ 配置说明

✦ AI Skill Hub 推荐

AI Skill Hub 推荐使用：爬虫lama 是一款优质的Agent工作流。AI 综合评分 7.5 分，在同类工具中表现稳健。如果你正在寻找可靠的Agent工作流解决方案，这是一个值得深入了解的选择。

📚 深度解析

爬虫lama 是一套完整的 AI Agent 自动化工作流方案。随着 AI 能力的不断提升，基于 Agent 的自动化工作流正在成为提升个人和团队效率的核心方式。区别于传统的 RPA 自动化（模拟鼠标键盘操作），AI Agent 工作流通过理解任务意图、动态规划执行路径，能够处理更复杂的非结构化任务。

爬虫lama 工作流的设计遵循"最小配置，最大复用"原则：核心逻辑已经封装好，用户只需配置自己的 API Key 和业务参数即可快速上手。工作流内置错误处理和重试机制，在网络波动或 API 限速等情况下仍能稳定运行，适合作为生产环境的自动化基础设施。

在实际部署时，建议先在测试环境中运行 3-5 次，验证各个环节的输出结果符合预期，再部署到生产环境。AI Skill Hub 评分 7.5 分，是同类 Agent 工作流中的精选推荐。

📋 工具概览

爬虫lama 是一套完整的 AI Agent 自动化工作流方案。通过可视化的节点编排，将复杂的多步骤任务拆解为清晰的自动化流程，实现全程无人值守的智能处理。支持与数百种外部服务和 API 无缝集成，适合构建数据处理管线、业务自动化和 AI 辅助决策系统。

GitHub Stars

⭐ 63

开发语言

Python

支持平台

Windows / macOS / Linux

维护状态

轻量级项目，按需更新

开源协议

NOASSERTION

AI 综合评分

7.5 分

工具类型

Agent工作流

Forks

📖 中文文档

以下内容由 AI Skill Hub 根据项目信息自动整理，如需查看完整原始文档请访问底部「原始来源」。

📌 核心特色

可视化 Agent 工作流编排，无需编写复杂代码
支持多步骤自动化任务链，实现全流程无人值守
与外部 API、数据库和第三方服务无缝集成
内置错误处理与自动重试机制，保障稳定运行
提供可复用的自动化模板，快速在同类场景部署

🎯 主要使用场景

自动化日常重复性工作，将精力集中于创造性任务
构建数据采集 → 处理 → 输出的完整自动化管线
实现跨平台、跨系统的数据流转和业务协同

以下安装命令基于项目开发语言和类型自动生成，实际以官方 README 为准。

安装命令

# 方式一：pip 安装（推荐）
pip install crawllama

# 方式二：虚拟环境安装（推荐生产环境）
python -m venv .venv
source .venv/bin/activate  # Windows: .venv\Scripts\activate
pip install crawllama

# 方式三：从源码安装（获取最新功能）
git clone https://github.com/arn-c0de/Crawllama
cd Crawllama
pip install -e .

# 验证安装
python -c "import crawllama; print('安装成功')"

📋 安装步骤说明

访问 GitHub 仓库获取工作流文件
在对应平台（Dify / Flowise / Make 等）中找到「导入工作流」功能
上传工作流文件
按照提示配置必要的环境变量和 API Key
运行测试确认流程正常后投入使用

以下用法示例由 AI Skill Hub 整理，涵盖最常见的使用场景。

常用命令 / 代码示例

# 命令行使用
crawllama --help

# 基本用法
crawllama input_file -o output_file

# Python 代码中调用
import crawllama

# 示例
result = crawllama.process("input")
print(result)

以下配置示例基于典型使用场景生成，具体参数请参照官方文档调整。

配置示例

# crawllama 配置文件示例（config.yml）
app:
  name: "crawllama"
  debug: false
  log_level: "INFO"

# 运行时指定配置文件
crawllama --config config.yml

# 或通过环境变量配置
export CRAWLLAMA_API_KEY="your-key"
export CRAWLLAMA_OUTPUT_DIR="./output"

📑 README 深度解析真实文档完整度 95/100 查看 GitHub 原文 →

以下内容由系统直接从 GitHub README 解析整理，保留代码块、表格与列表结构。

简介

CrawlLama

Project Website

AI Research Agent with OSINT and Multi-Hop Reasoning Current Version: 1.4.10 (Improvements & Fixes) New in 1.4.8: Company Intelligence Developer Documentation

Overview

A locally-capable AI research agent with advanced intelligence features (local processing requires a local LLM backend such as Ollama; cloud-based LLM APIs like Claude or OpenAI are not local): - OSINT module: email, phone, and IP intelligence; social media analysis; advanced search operators - Multi-hop reasoning using LangGraph for complex queries - Adaptive agent selection based on query complexity (low/mid/high) - REST API with FastAPI for integration - Plugin system for extensibility - Performance optimizations with large context support and asynchronous execution

Features

Core features

Adaptive agent hopping system – Automatic agent selection based on query complexity (low/mid/high), confidence-based escalation, and resource-aware adaptation (NEW v1.4.4)
UI settings for adaptive report – Toggle the Adaptive Intelligence Report directly from the interactive settings menu (NEW v1.4.4)
Multi-hop reasoning – LangGraph-based agent with a multi-step workflow (Router → Search → Analyze → Follow-Up → Synthesize → Critique)
Restart command – Restart the agent without exiting the program
Parallelization – Multi-aspect searches using thread pools for improved performance
Performance optimizations – 16k context support for RTX 3080; async and parallel processing
Multi-source web search – DuckDuckGo, Brave Search, Serper API with fallback
Wikipedia integration – Dedicated Wikipedia search (German/English)
Advanced RAG system – Batch processing, multi-query and hybrid search (RAG Analysis)
Intelligent caching – TTL-based with hash keys, LRU eviction, and a configurable max size (500MB)
Tool orchestration – Automatic tool selection via LLM
Interactive settings menu – Live configuration of LLM, search, RAG and OSINT
Context usage tracker – Real-time token usage monitoring using tiktoken
Health monitoring dashboard – Interactive system monitoring with a rich UI
Lazy-loading – On-demand loading for tools and plugins
Async operations – Parallel HTTP requests with aiohttp
Resource monitoring – RAM usage, performance tracking, and automated garbage collection
FastAPI REST API – 8+ endpoints with auto-documentation (/query, /plugins, /stats, /health) (see app.py)
Plugin system – Dynamic loading and unloading of plugins
Enhanced CLI – Rich formatting and Markdown output
Setup scripts – setup.bat, setup.sh with auto-configuration
Optional cloud LLM support (note: using cloud APIs such as Claude or OpenAI means data is sent to external servers and is not processed locally)

OSINT features

Advanced search operators – site:, inurl:, intext:, filetype:, email:, phone:, ip:
Email intelligence – Validation, MX records, disposable detection, and variations
Phone intelligence – Validation, carrier lookup, country detection, and formatting
Persistent memory store – Survives clear; stores emails, phones, IPs, usernames, domains and notes
Memory store CRUD – Full CRUD functionality with forget command
Batch processing – Analyze multiple emails or phones simultaneously with summary statistics
IP intelligence – IPv4/IPv6 analysis, geolocation, ISP info, security reputation and VPN detection
Social intelligence – Supports 12 platforms (GitHub, LinkedIn, Twitter, Instagram, Facebook, YouTube, Reddit, Pinterest, TikTok, Snapchat, Discord, Steam)
AI query enhancement – Query variations, operator suggestions, entity detection and auto-type detection
Compliance module – Rate limiting, terms of use, audit logging and robots.txt compliance
Privacy protection – Ethical scraping, usage tracking; no API keys required
Safesearch quality filter – Configurable result quality (off/moderate/strict)

3. Install dependencies (takes 5-10 min)

pip install -r requirements.txt

3. Install dependencies (takes 5-10 min)

pip install -r requirements.txt

3. Dependencies

pip install -r requirements.txt

Reinstall dependencies

pip install -r requirements.txt

Installation

Important — Keep dependencies up to date: If you are using a pre-built / downloaded package release (zip/tar), always install dependencies from the latest requirements.txt in this repository rather than any bundled copy. Package versions are updated regularly to address security vulnerabilities and compatibility issues.
> pip install -r requirements.txt
> 
Always fetch the current file from the repository before installing to ensure you have the latest, secure package versions.

Windows: 1. Download Crawllama 2. Extract to any folder (e.g., C:\Crawllama) 3. Install Ollama from ollama.ai/download 4. Start Ollama and load model:

 ollama serve
 ollama pull qwen3:4b

5. In the Crawllama folder:

 setup.bat
 run.bat

Linux/macOS: 1. Download and extract:

 wget https://github.com/arn-c0de/Crawllama/archive/refs/tags/v1.4.7-preview.zip
 unzip v1.4.7-preview.zip
 cd Crawllama-v1.4.7-preview

2. Install Ollama:

 curl -fsSL https://ollama.ai/install.sh | sh
 ollama serve &
 ollama pull qwen3:4b

3. Setup and start:

 chmod +x setup.sh run.sh
 ./setup.sh
 ./run.sh

---

Option 1: Setup Scripts (Recommended for Git Installation)

Windows:

setup.bat

Linux/macOS:

chmod +x setup.sh
./setup.sh

Note: After the initial setup, you must select at least one LLM model during setup. If a model is already installed, you can skip this step—otherwise, selection is required to avoid errors in the test program.

The setup script: - Checks Python version (3.10+) - Creates virtual environment - Lets you select features and LLM models to install (core is always installed) - Installs all selected dependencies - Creates necessary directories - Copies .env.example to .env - Checks Ollama status

Note for initial installation:

When running pip install -r requirements.txt for the first time within the newly created virtual environment, installing all dependencies—especially packages like torch, sentence-transformers, and scientific libraries—may take 5–10 minutes (or longer, depending on connection and hardware). Please wait until the process completes; afterward, the virtual environment is ready for use.

Note on disk space: After installation (including venv), the project typically requires about 1.2–1.5 GB of free disk space (v1.4: ~1.23 GB). This value may vary significantly depending on the operating system, Python packages (e.g., larger PyTorch/CUDA wheels), and additional models. Plan for ample additional space if storage is limited.

Model download sizes (approximate):

qwen3:4b — ~2–4 GB (depending on format/quantization)
qwen3:8b — ~8–12 GB
deepseek-r1:8b — ~6–10 GB
llama3:7b — ~6–9 GB
mistral:7b — ~4–8 GB
phi3:14b — ~12–20+ GB

Note: Model sizes vary significantly depending on the provider, format (FP16, INT8 quantization, etc.), and additional assets. Quantized models (e.g., INT8) can significantly reduce size, while FP32/FP16 or models with additional tokenizer/vocab files require more space. Plan for sufficient additional storage if using larger models or multiple models simultaneously.

Option 2: Manual Installation

Prerequisites: - Python 3.10+ (python.org) - Git (git-scm.com) - Ollama (ollama.ai/download)

Windows - Step by Step:

```cmd

6. Install and start Ollama

curl -fsSL https://ollama.ai/install.sh | sh ollama serve &

Option 3: Git Clone (Quick Installation)

```bash

Ollama Setup

```bash

Install Ollama

curl -fsSL https://ollama.ai/install.sh | sh # Linux/macOS

Or with setup script

run.bat # Windows ./run.sh # Linux/macOS

╭──────────────────────────────────────────────────────────────╮ │ CrawlLama - Local Search and Response Agent │ │ Commands: │ │ clear - Reset session (history + cache) │ │ clear-cache - Clear cache only │ │ save - Manually save session │ │ load - Reload session │ │ stats - Display statistics │ │ status - Show context usage │ │ settings - Show/edit settings │ │ restart - Restart agent (reload config) │ │ exit, quit - Exit │ ╰──────────────────────────────────────────────────────────────╯

What is Machine Learning?


**New Commands:**

- `status` - Shows token usage and available context capacity

status

Context Usage Tracker ┏━━━━━━━━━━━━━━━━━━━┳━━━━━━━━━━━┳━━━━━━━━━━━┓ ┃ Source ┃ Tokens ┃ Share ┃ ┡━━━━━━━━━━━━━━━━━━━╇━━━━━━━━━━━╇━━━━━━━━━━━┩ │ Conversation │ 850 │ 8.5% │ │ Search Results │ 320 │ 3.2% │ │ Total Used │ 1,170 │ 11.7% │ │ Available │ 8,830 │ 88.3% │ │ Maximum │ 10,000 │ 100% │ └───────────────────┴───────────┴───────────┘


- `settings` - Interactive configuration editor

settings

Displays all settings and allows: • Category selection (llm, search, rag, cache, osint, all) • Change LLM model (qwen3:8b, deepseek-r1:8b, etc.) • Adjust temperature (0.0-1.0) • Configure max tokens (now 16,000 for RTX 3080+) • Change search region (de-de, us-en, wt-wt) • Configure OSINT max results & rate limits • Enable/disable RAG • Enable/disable cache • Save changes directly to config.json • Auto-restart after saving (optional)


- `restart` - Restart agent

restart

• Reloads config.json • Fully reinitializes agent • Optional session preservation • No session interruption ```

Or re-run setup

./setup.sh # or setup.bat ```

Quickstart

Usage

Note: The first start may take significantly longer than subsequent starts! Initialization, dependency installation, and model downloads may take several minutes, depending on hardware and internet connection. After the first successful start, all subsequent starts are significantly faster.

Quickstart

1. Start API Server

run_api.bat

2. Open API Documentation - Interactive Docs: http://localhost:8000/docs - ReDoc: http://localhost:8000/redoc

3. Send Query

curl -X POST http://localhost:8000/query \
 -H "X-API-Key: your-key" \
 -H "Content-Type: application/json" \
 -d '{"query": "What is Python?", "use_tools": false}'

User Guides

Installation Guide - Detailed installation
LangGraph Guide - Multi-hop reasoning
Plugin Tutorial - Plugin development
Health Monitoring - System monitoring

2. Create virtual environment

python -m venv venv venv\Scripts\activate

5. Configuration

copy .env.example .env notepad .env # Optional: Add API keys

2. Create virtual environment

python3 -m venv venv source venv/bin/activate

5. Configuration

cp .env.example .env nano .env # Optional: Add API keys

2. Virtual Environment

python -m venv venv source venv/bin/activate # Linux/macOS venv\Scripts\activate # Windows

5. Config

cp .env.example .env ```

1. Set API key in .env

CRAWLLAMA_API_KEY=your_secure_api_key_min_32_chars

3. Adjust rate limit (optional)

RATE_LIMIT=100

4. Configure CORS origins (optional)

ALLOWED_ORIGINS=http://localhost:3000,http://localhost:8080


**Usage with API Key:**

bash

CLI Commands and Options

Basic Options

Option	Description
`--interactive`	Interactive mode
`--debug`	Enable debug logging
`--no-web`	Offline mode (no web search)
`--model MODEL`	Choose Ollama model
`--stats`	Display system statistics
`--clear-cache`	Clear cache

Advanced Options (v1.1)

Option	Description
`--multihop`	Enable multi-hop reasoning
`--max-hops N`	Max reasoning steps (1-5)
`--api`	Start API server
`--plugins`	List available plugins
`--load-plugin NAME`	Load plugin
`--help-extended`	Show extended help
`--examples`	Show usage examples
`--setup-keys`	Securely set up API keys

Configuration

config.json

{
 "llm": {
 "base_url": "http://127.0.0.1:11434",
 "model": "qwen3:8b",
 "temperature": 0.7,
 "max_tokens": 10000,
 "stream": true
 },
 "search": {
 "provider": "duckduckgo",
 "max_results": 5,
 "timeout": 10
 },
 "rag": {
 "enabled": true,
 "batch_size": 100,
 "max_workers": 4
 },
 "cache": {
 "enabled": true,
 "ttl_hours": 24,
 "max_size_mb": 500,
 "clear_on_startup": false
 },
 "osint": {
 "max_results": 20,
 "email_search_limit": 50,
 "phone_search_limit": 50,
 "general_osint_limit": 100
 },
 "multihop": {
 "enabled": true,
 "max_hops": 3,
 "confidence_threshold": 0.7,
 "enable_critique": true
 },
 "plugins": {
 "example_plugin": {
 "enabled": true
 }
 },
 "security": {
 "rate_limit": 1.0,
 "max_context_length": 8000,
 "check_robots_txt": true
 }
}

Recommended max_tokens Settings:

GPU/Hardware	Recommended max_tokens	Model
RTX 3080+ (10GB+)	10,000 - 16,000	qwen3:8b, deepseek-r1:8b
RTX 3060/3070 (8GB)	6,000 - 8,000	qwen3:4b, llama3:7b
CPU Only	2,000 - 4,000	qwen3:4b

Tip: Use the status command to monitor your token usage in real-time!

.env (Optional)

```bash

API Keys (optional)

BRAVE_API_KEY=your_brave_api_key SERPER_API_KEY=your_serper_api_key

Proxy (optional)

HTTP_PROXY=http://proxy:port HTTPS_PROXY=https://proxy:port ```

Adjust in config.json

"security": { "rate_limit": 2.0 # 2 req/s } ```

Interactive CLI Interface

CrawlLama's adaptive intelligence system with automatic agent selection and interactive commands.

CLI Interface

1. CLI - Interactive Mode

```bash python main.py --interactive

4. CLI - Direct Queries

```bash

5. FastAPI Server

```bash

2. For local development (without API key)

CRAWLLAMA_DEV_MODE=true

With API key header

curl -X POST http://localhost:8000/query \ -H "Content-Type: application/json" \ -H "X-API-Key: your_api_key_here" \ -d '{"query": "test"}'

Or in dev mode (without API key)

export CRAWLLAMA_DEV_MODE=true python app.py


**Example Requests:**

bash

REST API

CrawlLama provides a complete REST API for integration into custom applications.

Start API Server

Windows:

run_api.bat

Linux/macOS:

./run_api.sh

Or manually:

uvicorn app:app --host 0.0.0.0 --port 8000

Key Endpoints

POST /query - Execute queries (with/without web search, multi-hop)
GET /health - Health check
GET /stats - System statistics
POST /memory/remember - Store data (OSINT)
GET /memory/recall/{category} - Retrieve data
GET /plugins - Manage plugins
POST /cache/clear - Clear cache

API Documentation

Swagger UI: http://localhost:8000/docs
ReDoc: http://localhost:8000/redoc

Web Scraping

Respects robots.txt
Rate limiting (1 req/s default)
Identifiable user agent
Users are responsible for compliance with local laws

API Keys

Brave Search API: brave.com/search/api
Serper API: serper.dev

API rate limits

```bash

List plugins

curl http://localhost:8000/plugins

Load plugin

curl -X POST http://localhost:8000/plugins/example_plugin/load ```

Plugin Development

Creating a Simple Plugin

```python

plugins/my_plugin.py

from core.plugin_manager import Plugin, PluginMetadata

class MyPlugin(Plugin): def get_metadata(self) -> PluginMetadata: return PluginMetadata( name="MyPlugin", version="1.0.0", description="My custom plugin", author="Your Name", dependencies=[] )

def get_tools(self): return [self.my_tool]

def my_tool(self, input: str) -> str: return f"Processed: {input}" ```

See: Plugin Tutorial for details

Alternative: deepseek-r1:8b, llama3:7b, mistral

```

Troubleshooting

🇨🇳 中文文档镜像 AI 翻译 2026-05-30

英文原文章节由系统翻译为中文摘要，便于快速理解。完整原文见上方 "📑 README 深度解析"。

📌 简介

CrawlLama 是一个具备本地处理能力的 AI 研究智能体。它集成了强大的 OSINT（开源情报）模块，支持对 Email、电话、IP 地址及社交媒体进行深度分析。通过 LangGraph 实现多跳推理（Multi-hop reasoning），能够处理复杂的查询任务。请注意，虽然支持通过 Claude 或 OpenAI 等云端 API 进行增强，但若要实现完全的本地化运行，需要配置如 Ollama 这样的本地 LLM 后端。

⚡ 功能介绍

项目核心功能包括：自适应智能体切换系统（Adaptive agent hopping），可根据查询复杂度自动选择合适的 Agent 并进行资源感知适配；基于 LangGraph 的多步工作流，支持复杂逻辑推理；强大的 OSINT 能力，支持使用 site:、inurl: 等高级搜索运算符进行情报搜集；此外，系统还具备持久化记忆存储功能，能够记录并关联 Email、域名及用户信息，确保研究过程的连续性。

📋 环境依赖

运行 CrawlLama 需要 Python 3.11+ 环境。在安装依赖阶段，请确保网络环境稳定，执行 `pip install -r requirements.txt` 可能需要 5-10 分钟。建议开发者在安装前检查 Python 版本，并确保系统环境符合项目对依赖包的安全与兼容性要求。

🛠 安装步骤（Docker/pip/源码）

推荐使用官方提供的 Setup Scripts 进行安装：Windows 用户运行 `setup.bat`，Linux/macOS 用户运行 `setup.sh`。脚本会自动检查 Python 版本、创建虚拟环境并引导模型选择。若选择手动安装，请确保已安装 Git、Python 3.10+ 以及 Ollama。特别提醒：若使用下载的压缩包，请务必使用仓库中最新的 `requirements.txt` 以修复潜在的安全漏洞。

🚀 使用教程

项目启动流程分为两步：首先通过 `run_api.bat` 启动 API Server；随后可以通过交互式文档（http://localhost:8000/docs）进行测试。请注意，首次启动由于涉及初始化、依赖安装及模型下载，耗时会显著增加，但后续启动将非常迅速。你可以通过 `curl` 命令向本地 API 发送 POST 请求来执行查询任务。

⚙️ 配置说明（含 MCP / env）

项目通过 `.env` 文件进行配置管理。在完成虚拟环境创建并激活后，请将 `.env.example` 复制并重命名为 `.env`。你可以通过编辑器手动添加必要的 API keys（如 Claude 或 OpenAI 的密钥）以增强模型能力。对于本地化部署，请确保 Ollama 服务已正确配置并处于运行状态。

🔌 API 说明

CrawlLama 提供功能完备的 API 接口与交互式 CLI 界面。开发者可以通过 `python main.py --interactive` 进入交互模式进行智能对话，也可以通过标准的 RESTful API 进行集成。API 支持插件化扩展，你可以通过查询 `/plugins` 接口获取当前加载的插件列表，并使用 POST 请求动态加载新的插件模块。

🔄 工作流/模块

CrawlLama 的工作流基于高度智能的路由机制。系统会根据用户输入的查询复杂度（低/中/高）自动进行 Agent 路由与任务分发。通过 LangGraph 构建的推理链条，智能体能够执行多步骤的搜索与验证任务。同时，系统支持插件化开发模式，允许开发者通过 API 动态加载自定义插件来扩展其情报搜集与处理能力。

❓ FAQ 摘要

在使用过程中，如遇到启动缓慢或报错，请优先检查网络连接及模型下载状态。若遇到依赖冲突，建议重新创建虚拟环境并使用最新的 `requirements.txt` 进行安装。对于模型选择问题，请确保在初始化设置阶段已正确配置 LLM 模型，以避免测试程序��行异常。

📚 实用指南（长尾问题）

适合谁

构建多智能体协作系统的 Agent 开发者
构建企业知识库 / RAG 检索应用的团队

最佳实践

生产部署优先使用 Docker Compose 隔离依赖，并挂载 volume 持久化数据
本地部署优先选 GGUF 量化模型，节省显存并保持响应速度
分块大小建议 256-512 tokens，向量库优选 pgvector 或 Qdrant
Agent 任务先做 dry-run 验证工具调用链，再开启自主执行

常见错误

API key 直接提交到 git 仓库（请用 .env 并加入 .gitignore）
容器内无法访问宿主机 localhost — 使用 host.docker.internal
embedding 模型与查询模型不一致导致检索失效
显存不足直接 OOM — 优先降低 context 或换更小的量化模型
Python 依赖冲突：建议用 venv / uv 隔离环境

部署方案

Docker：Crawllama 提供官方镜像，docker compose up 一键启动
CLI：直接 npm install -g / pip install，命令行调用
本地部署：CPU 8GB 起，GPU 推荐 16GB+ 显存
云端托管：可放在 Vercel / Railway / Fly.io 等 PaaS 平台

⚡ 核心功能

可视化 Agent 工作流编排，无需编写复杂代码
支持多步骤自动化任务链，实现全流程无人值守
与外部 API、数据库和第三方服务无缝集成
内置错误处理与自动重试机制，保障稳定运行
提供可复用的自动化模板，快速在同类场景部署

👥 适合谁

构建多智能体协作系统的 Agent 开发者
构建企业知识库 / RAG 检索应用的团队

⭐ 最佳实践

生产部署优先使用 Docker Compose 隔离依赖，并挂载 volume 持久化数据
本地部署优先选 GGUF 量化模型，节省显存并保持响应速度
分块大小建议 256-512 tokens，向量库优选 pgvector 或 Qdrant
Agent 任务先做 dry-run 验证工具调用链，再开启自主执行

⚠️ 常见错误

API key 直接提交到 git 仓库（请用 .env 并加入 .gitignore）
容器内无法访问宿主机 localhost — 使用 host.docker.internal
embedding 模型与查询模型不一致导致检索失效
显存不足直接 OOM — 优先降低 context 或换更小的量化模型

👥 适合人群

自动化工程师和运维人员项目经理和业务分析师希望减少重复性工作的专业人士数字化转型团队

🎯 使用场景

自动化日常重复性工作，将精力集中于创造性任务
构建数据采集 → 处理 → 输出的完整自动化管线
实现跨平台、跨系统的数据流转和业务协同

⚖️ 优点与不足

✅ 优点

+大幅减少重复性人工操作
+可视化流程，清晰直观
+可扩展性强，支持复杂场景

⚠️ 不足

−初始配置和调试需投入一定时间
−强依赖外部服务的稳定性
−复杂场景需具备一定技术基础

⚠️ 使用须知

该工具使用 NOASSERTION 协议，商用场景请仔细阅读协议条款，必要时咨询法律意见。

AI Skill Hub 为第三方内容聚合平台，本页面信息基于公开数据整理，不对工具功能和质量作任何法律背书。

建议在沙箱或测试环境中充分验证后，再部署至生产环境，并做好必要的安全评估。

📄 License 说明

📄 NOASSERTION — 请查阅原始协议条款了解具体使用限制。

🔗 相关工具推荐

📚 相关教程推荐

SEO 自动化工具链搭建：用 AI 提升搜索流量的系统方法

帮助中心 · AI Skill Hub

电商运营 AI 自动化方案：从选品到客服的全链路 AI 工具应用

帮助中心 · AI Skill Hub

n8n vs Make vs Zapier 2025：哪款工作流自动化工具最适合 AI 时代

帮助中心 · AI Skill Hub

📰 相关 AI 新闻

AI 前沿资讯：Hiring Senior Founding AI/Back…

AI 前沿资讯：Does your CEO have AI psychosi…

AI 资讯 · 知识关联

🍿 AI 圈相关吃瓜

AutoGPT 自主完成了任务：把我的文件夹全部重命名了

AI 圈观察

Karpathy 的编程陷阱清单，我用来当 Claude 的「紧箍咒」

AI 圈观察

设计 Agent 说能生成 71 个系统，我信了

🗺️ 相关解决方案

ai-workflow-templates

🧩 你可能还需要

基于当前 Skill 的能力图谱，自动补全的工具组合

❓ 常见问题 FAQ

Crawllama 是什么工具？−

Crawllama 是一款Python开发的AI辅助工具。开源AI工作流：CrawlLama 🦙 is an local AI agent that answers questions via Ollama and integra。⭐63 · Python 主要应用场景包括：本地AI问答。

Crawllama 如何安装和开始使用？+

Crawllama 是否免费？许可证是什么？+

Crawllama 适合哪些用户使用？+

Crawllama 的社区活跃度和项目维护状况如何？+

什么是 Agent 工作流？和普通自动化有什么区别？+

导入工作流后，我需要修改哪些配置？+

工作流运行失败了，如何排查问题？+

💡 AI Skill Hub 点评

总体来看，爬虫lama 是一款质量良好的Agent工作流，在同类工具中具备一定竞争力。AI Skill Hub 将持续追踪其更新动态，建议收藏备用，结合自身场景选择合适时机引入使用。

⬇️ 获取与下载

📚 深入学习爬虫lama

查看分步骤安装教程和完整使用指南，快速上手这款工具

⚙️ 安装教程 📚 使用教程

🌐 原始信息

原始名称	`Crawllama`
原始描述	开源AI工作流：CrawlLama 🦙 is an local AI agent that answers questions via Ollama and integra。⭐63 · Python
Topics	`自动化爬虫FastAPI`
GitHub	https://github.com/arn-c0de/Crawllama
License	NOASSERTION
语言	Python

🔗 原始来源

🐙 GitHub 仓库 https://github.com/arn-c0de/Crawllama

收录时间：2026-05-30 · 更新时间：2026-05-30 · License：NOASSERTION · AI Skill Hub 不对第三方内容的准确性作法律背书。

爬虫lama

📚 深度解析

📋 工具概览

📖 中文文档

简介

CrawlLama

Overview

Features

Core features

OSINT features

3. Install dependencies (takes 5-10 min)

3. Install dependencies (takes 5-10 min)

3. Dependencies

Reinstall dependencies

Installation

Option 1: Setup Scripts (Recommended for Git Installation)

Option 2: Manual Installation

6. Install and start Ollama

Option 3: Git Clone (Quick Installation)

Ollama Setup

Install Ollama

Or with setup script

Or re-run setup

Quickstart

Usage

Quickstart

User Guides

2. Create virtual environment

5. Configuration

2. Create virtual environment

5. Configuration

2. Virtual Environment

5. Config

1. Set API key in .env

3. Adjust rate limit (optional)

4. Configure CORS origins (optional)

CLI Commands and Options

Basic Options

Advanced Options (v1.1)

Configuration

config.json

.env (Optional)

API Keys (optional)

Proxy (optional)

Adjust in config.json

Interactive CLI Interface

1. CLI - Interactive Mode

4. CLI - Direct Queries

5. FastAPI Server

2. For local development (without API key)

With API key header

Or in dev mode (without API key)

REST API

Start API Server

Key Endpoints

API Documentation

Web Scraping

API Keys

API rate limits

List plugins

Load plugin

Plugin Development

Creating a Simple Plugin

plugins/my_plugin.py

Alternative: deepseek-r1:8b, llama3:7b, mistral

Troubleshooting

⚡ 核心功能

👥 适合人群

🎯 使用场景

⚖️ 优点与不足

🔗 相关工具推荐

❓ 常见问题 FAQ

🤖 交给 Agent 安装 · 爬虫lama