SearxNcrawl 是 AI Skill Hub 本期精选MCP工具之一。综合评分 7.5 分,整体质量较高。我们推荐使用将其纳入你的 AI 工具库,帮助提升工作效率。
SearxNcrawl 是一款遵循 MCP(Model Context Protocol)标准协议的 AI 工具扩展。通过 MCP 协议,它可以让 Claude、Cursor 等主流 AI 客户端直接访问和操作外部工具、数据源和服务,实现 AI 能力的无缝扩展。无论是文件操作、数据库查询还是 API 调用,都可以通过自然语言在 AI 对话中直接触发,极大提升生产效率。
SearxNcrawl 是一款遵循 MCP(Model Context Protocol)标准协议的 AI 工具扩展。通过 MCP 协议,它可以让 Claude、Cursor 等主流 AI 客户端直接访问和操作外部工具、数据源和服务,实现 AI 能力的无缝扩展。无论是文件操作、数据库查询还是 API 调用,都可以通过自然语言在 AI 对话中直接触发,极大提升生产效率。
# 方式一:通过 Claude Code CLI 一键安装
claude skill install https://github.com/DasDigitaleMomentum/searxNcrawl
# 方式二:手动配置 claude_desktop_config.json
{
"mcpServers": {
"searxncrawl": {
"command": "npx",
"args": ["-y", "searxncrawl"]
}
}
}
# 配置文件位置
# macOS: ~/Library/Application Support/Claude/claude_desktop_config.json
# Windows: %APPDATA%/Claude/claude_desktop_config.json
# 安装后在 Claude 对话中直接使用 # 示例: 用户: 请帮我用 SearxNcrawl 执行以下任务... Claude: [自动调用 SearxNcrawl MCP 工具处理请求] # 查看可用工具列表 # 在 Claude 中输入:"列出所有可用的 MCP 工具"
// claude_desktop_config.json 配置示例
{
"mcpServers": {
"searxncrawl": {
"command": "npx",
"args": ["-y", "searxncrawl"],
"env": {
// "API_KEY": "your-api-key-here"
}
}
}
}
// 保存后重启 Claude Desktop 生效
searxNcrawl is a minimal MCP server and CLI toolkit for search and crawling, built on top of Crawl4AI and SearXNG.
This project is published as searxNcrawl at https://github.com/DasDigitaleMomentum/searxNcrawl and is maintained by DDM – Das Digitale Momentum GmbH & Co KG. It is the successor to searxng-mcp https://github.com/tisDDM/searxng-mcp (which should be marked deprecated).
Compared to plain Crawl4AI usage, searxNcrawl provides a proven, production-tested crawl configuration for documentation-heavy sites, optimized for clean, model-ready Markdown with less noise and better token efficiency.
It also includes built-in markdown deduplication and early support for authenticated crawling (WIP) via Playwright storage state — including a practical CDP export flow for real Chrome/Chromium login sessions.
python -m crawler.mcp_server --transport http --cors-origins "http://localhost:3000,https://myapp.com"
Minimal dependencies:
crawl4ai>=0.7.4 - The underlying crawler enginetldextract>=5.1.2 - Domain parsing for site crawlsplaywright>=1.40.0 - Browser automationfastmcp>=2.0.0 - MCP server frameworkhttpx>=0.27.0 - HTTP client for SearXNG```bash cd searxNcrawl python -m venv .venv source .venv/bin/activate pip install -e .
playwright install chromium ```
crawl-mcp --transport http --port 8000
Create a .env file (see .env.example) and run:
docker compose up --build
The MCP HTTP port is configurable via MCP_PORT in .env. Default is 9555, so the server is available at http://localhost:9555/mcp.
To run real‑world checks against the Docker setup (crawl, crawl_site, search), use:
scripts/test-realworld.sh
For extended tests including new features (remove_links, Unicode handling, schema validation):
scripts/test-extended.sh
mkdir -p ~/.config/searxncrawl cp .env.example ~/.config/searxncrawl/.env
Crawled: 2025-01-09 12:00:00 UTC
[Page content as markdown...]
---
Crawled: 2025-01-09 12:00:01 UTC
[Page content as markdown...] ```
After installation (pip install -e .), the crawl and search commands are available globally.
google-chrome --remote-debugging-port=9222 --user-data-dir="$HOME/.chrome-cdp-searxncrawl"
2) Log in manually to your target app in that browser.
3) List selectable sessions:
bash crawl-capture --cdp-url http://127.0.0.1:9222 --list-sessions
4) Export by explicit session index:
bash crawl-capture \ --cdp-url http://127.0.0.1:9222 \ --cdp-session 2 \ --output ./state.json
Or let CLI selection guide you interactively:
bash crawl-capture \ --cdp-url http://127.0.0.1:9222 \ --list-sessions \ --select \ --output ./state.json
After capture/export, use the file for authenticated crawling:
bash crawl https://example.com/private --storage-state ./state.json ```
Explicit outcomes: - success (exit 0): storage state written. - timeout (exit 2): completion condition not reached in time (manual flow only). - abort (exit 130): browser/session closed before completion (manual flow only).
Safety notes: - Keep storage_state files out of version control. - Capture/export is intentionally isolated from standard crawl / crawl_site execution paths. - If multiple tabs share one browser context/profile, they share the same exported session state.
| Variable | Default | Description |
|---|---|---|
SEARXNG_URL | http://localhost:8888 | SearXNG instance URL |
SEARXNG_USERNAME | (none) | Optional basic auth username |
SEARXNG_PASSWORD | (none) | Optional basic auth password |
SearXNG is a privacy-respecting metasearch engine that aggregates results from multiple search engines without tracking users. To use the search functionality of searxNcrawl, you need access to a SearXNG instance with:
settings.yml under search.formats).You can either self-host a SearXNG instance or use a public one. For reliable results, self-hosting is recommended as public instances may have rate limits or restricted API access.
The CLI tools (crawl, search) look for .env files in this order:
./.env~/.config/searxncrawl/.envIf no .env is found and .env.example exists in the package, it will be automatically copied to ~/.config/searxncrawl/.env as a starting point.
Quick setup for global CLI usage:
```bash
export SEARXNG_URL=http://your-searxng:8888
#### CORS Configuration (HTTP Transport)
When using HTTP transport, browser-based MCP clients may need CORS (Cross-Origin Resource Sharing) headers. Use `--cors-origins` to enable them:
bash
Add to your MCP client configuration (examples include Zed, opencode, antigravity, VS Code, Claude Code, Codex, OpenClaw, etc.):
{
"mcpServers": {
"crawler": {
"command": "python",
"args": ["-m", "crawler.mcp_server"],
"cwd": "/path/to/searxNcrawl",
"env": {
"SEARXNG_URL": "http://your-searxng-instance:8888"
}
}
}
}
Or with uv:
{
"mcpServers": {
"crawler": {
"command": "uv",
"args": ["run", "--directory", "/path/to/searxNcrawl", "python", "-m", "crawler.mcp_server"],
"env": {
"SEARXNG_URL": "http://your-searxng-instance:8888"
}
}
}
}
OpenClaw is a popular autonomous AI agent (150k+ GitHub stars) that supports MCP natively. To integrate searxNcrawl with OpenClaw, add the following to your OpenClaw MCP config file (~/.clawdbot/mcp.json or openclaw.json):
Python with venv:
{
"searxNcrawl": {
"command": "python",
"args": ["-m", "crawler.mcp_server"],
"cwd": "/path/to/searxNcrawl",
"env": {
"SEARXNG_URL": "http://your-searxng-instance:8888"
}
}
}
With uv (no manual venv needed):
{
"searxNcrawl": {
"command": "uv",
"args": ["run", "--directory", "/path/to/searxNcrawl", "python", "-m", "crawler.mcp_server"],
"env": {
"SEARXNG_URL": "http://your-searxng-instance:8888"
}
}
}
Docker HTTP endpoint:
If you prefer running searxNcrawl via Docker, start the server with:
docker compose up --build
Then configure OpenClaw to connect to the HTTP endpoint at http://localhost:9555/mcp.
Once configured, OpenClaw will have access to the crawl, crawl_site, and search tools.
The default configuration is optimized for documentation sites. For advanced customization:
```python from crawler import crawl_page_async, build_markdown_run_config, RunConfigOverrides
config = build_markdown_run_config( RunConfigOverrides( delay_before_return_html=1.0, # Wait longer for JS mean_delay=1.0, # Delay between requests scan_full_page=True, ) )
doc = await crawl_page_async("https://example.com", config=config) ```
crawl - Crawl pages from the command linecrawl-capture - Manual login capture + CDP session list/select/exportsearch - Search the web via SearXNGpip install -e .crawl https://example.com --json
python -m crawler.mcp_server
功能齐全,代码质量较好
AI Skill Hub 为第三方内容聚合平台,本页面信息基于公开数据整理,不对工具功能和质量作任何法律背书。
建议在沙箱或测试环境中充分验证后,再部署至生产环境,并做好必要的安全评估。
✅ MIT 协议 — 最宽松的开源协议之一,可自由商用、修改、分发,仅需保留版权声明。
经综合评估,SearxNcrawl 在MCP工具赛道中表现稳健,质量良好。如果你已有明确的使用需求,可以直接上手体验;如果还在评估阶段,建议对比同类工具后再做决策。
| 原始名称 | searxNcrawl |
| 原始描述 | 开源MCP工具:MCP Server and CLI Tools for searxing and fetching websites。⭐105 · Python |
| Topics | mcppythonweb爬虫 |
| GitHub | https://github.com/DasDigitaleMomentum/searxNcrawl |
| License | MIT |
| 语言 | Python |
收录时间:2026-05-28 · 更新时间:2026-05-28 · License:MIT · AI Skill Hub 不对第三方内容的准确性作法律背书。
选择 Agent 类型,复制安装指令后粘贴到对应客户端