kreuzberg MCP工具 是 AI Skill Hub 本期精选AI工具之一。已获得 8.3k 颗 GitHub Star,综合评分 8.2 分,整体质量较高。我们强烈推荐将其纳入你的 AI 工具库,帮助提升工作效率。
基于Rust核心的多语言文档智能框架,支持文本提取、元数据识别等功能。通过FFI与C#、Elixir等语言集成,适合需要高性能文档处理的开发者和企业应用。
kreuzberg MCP工具 是一款基于 Rust 开发的开源工具,专注于 文档处理、Rust核心、多语言FFI 等核心功能。作为 GitHub 开源项目,它拥有活跃的社区支持和持续的版本迭代,代码完全透明可审计,支持本地部署以保护数据隐私。无论是个人使用还是集成到企业工作流,都能提供稳定可靠的解决方案。
基于Rust核心的多语言文档智能框架,支持文本提取、元数据识别等功能。通过FFI与C#、Elixir等语言集成,适合需要高性能文档处理的开发者和企业应用。
kreuzberg MCP工具 是一款基于 Rust 开发的开源工具,专注于 文档处理、Rust核心、多语言FFI 等核心功能。作为 GitHub 开源项目,它拥有活跃的社区支持和持续的版本迭代,代码完全透明可审计,支持本地部署以保护数据隐私。无论是个人使用还是集成到企业工作流,都能提供稳定可靠的解决方案。
# 方式一:cargo install(推荐) cargo install kreuzberg # 方式二:从源码编译 git clone https://github.com/kreuzberg-dev/kreuzberg cd kreuzberg cargo build --release # 二进制在 ./target/release/kreuzberg
# 查看帮助 kreuzberg --help # 基本运行 kreuzberg [options] <input> # 详细使用说明请查阅文档 # https://github.com/kreuzberg-dev/kreuzberg
# kreuzberg 配置说明 # 查看配置选项 kreuzberg --config-example > config.yml # 常见配置项 # output_dir: ./output # log_level: info # workers: 4 # 环境变量(覆盖配置文件) export KREUZBERG_CONFIG="/path/to/config.yml"
One Rust engine — 96 file formats, 306 programming languages, native bindings for 16 languages, dual model runtimes, 6 output formats, OCR from any backend, embeddings, structured LLM extraction, token reduction, and more.
Xberg is the next iteration of Kreuzberg. Same document-intelligence engine, rebuilt and rebranded under a fresh v1 line.
Feed documents → get clean text, tables, metadata, transcripts, code intelligence · Run it library, CLI, REST API, or MCP server · No GPU needed · Stream multi-GB files · Cache results.
Documents · Images · Spreadsheets · Email · Archives · Code · Audio · Video
Quick start · What you get · Capabilities · CLI · Docs
</div>
---
<p align="center"><img src="docs/assets/demos/extract.gif" alt="Extracting clean Markdown from a PDF in the CLI" width="820"></p> <p align="center"><em>Feed any document—get structured text. Extract, batch, stream, or crawl.</em></p>
---
<details> <summary><strong>Full feature list</strong></summary>
<details> <summary><strong>CLI Tool</strong></summary>
brew install xberg-io/tap/xberg
12 commands: extract, batch, detect, formats, version, cache (stats/clear/manifest/warm), serve, mcp, api, embed, chunk, completions.
See CLI usage guide for detailed documentation.
</details>
<details> <summary><strong>Docker</strong></summary>
docker pull ghcr.io/xberg-io/xberg:latest
Run in API, CLI, or MCP modes. See Docker guide for examples.
</details>
<details> <summary><strong>REST API Server</strong></summary>
xberg serve --host 0.0.0.0 --port 8000
One POST endpoint handles all formats. Returns JSON or Markdown. Stream large files. See API server guide.
</details>
<details> <summary><strong>MCP Server</strong></summary>
xberg mcp --transport stdio
9 tools (extract, extract_batch, detect_mime_type, cache_stats, list_formats, cache_clear, get_version, cache_manifest, cache_warm). 3 prompts (extract_document, extract_with_ocr, semantic_search). 4 resources (formats, models, OCR languages, embedding presets).
Add to Claude Desktop or Cursor:
{
"mcpServers": {
"xberg": { "command": "xberg", "args": ["mcp"] }
}
}
</details>
| Mode | Command | Transport | Use case |
|---|---|---|---|
| **Library** | xberg::extract() | Async functions | Embed in your application |
| **CLI** | xberg extract document.pdf | 12 commands | Scripts, batch jobs, CI/CD |
| **REST API** | xberg serve | HTTP POST | Microservice, serverless deployment |
| **MCP Server** | xberg mcp | stdio or HTTP | Claude, Cursor, IDE agents |
| **Docker** | docker run ghcr.io/xberg-io/xberg | All modes | Container deployment |
Extract text from a document:
use xberg::{extract, ExtractInput, ExtractionConfig};
#[tokio::main]
async fn main() -> xberg::Result<()> {
let config = ExtractionConfig::default();
let output = extract(
ExtractInput::from_uri("document.pdf"),
&config
).await?;
println!("{}", output.results[0].content);
Ok(())
}
Common use cases — see Quick start guide for language-specific examples, OCR, batch processing, and API configuration.
---
<p align="center"><img src="docs/assets/demos/cli.gif" alt="Xberg CLI: extract, batch, detect, formats, cache, serve, mcp" width="760"></p> <p align="center"><em>The CLI: 12 commands for extraction, caching, serving, and MCP.</em></p>
<p align="center"><img src="docs/assets/demos/ocr.gif" alt="OCR from a scanned image with confidence scores and bounding boxes" width="820"></p> <p align="center"><em>OCR with confidence scores and bounding boxes. Switch backends without code changes.</em></p>
<p align="center"><img src="docs/assets/demos/crawl.gif" alt="Crawling a website and extracting all linked documents" width="820"></p> <p align="center"><em>Web crawl: fetch a page, follow links, extract all documents recursively.</em></p>
<p align="center"><img src="docs/assets/demos/mcp.gif" alt="MCP server integration with Claude Desktop showing extraction tools and prompts" width="820"></p> <p align="center"><em>MCP server: AI agents extract documents, detect formats, warm models, manage cache.</em></p>
<p align="center"><img src="docs/assets/demos/serve.gif" alt="REST API: POST a document, get JSON extraction results with streaming support" width="820"></p> <p align="center"><em>REST API: stream large files, get JSON or Markdown, one endpoint for all formats.</em></p>
---
<details> <summary><strong>All 12 commands</strong></summary>
| Command | Subcommands | Purpose |
|---|---|---|
extract | — | Extract text from a single document (path, URL, or stdin) |
batch | — | Extract from multiple documents in parallel |
detect | — | Identify MIME type of a file |
formats | — | List all 96 supported formats and MIME types |
version | — | Show Xberg version |
cache | stats, clear, manifest, warm | Manage extraction cache and models |
serve | — | Start REST API server (default: http://127.0.0.1:8000) |
mcp | — | Start MCP server (stdio or HTTP transport) |
api | schema | Output OpenAPI 3.1 specification |
embed | — | Generate embeddings for text (local or provider-hosted) |
chunk | — | Split text into chunks (text, markdown, YAML, or semantic) |
completions | — | Generate shell completion scripts |
Run xberg --help or xberg <command> --help for detailed options.
</details>
---
<details open> <summary><strong>Python</strong></summary>
pip install xberg
See Python README for full documentation.
</details>
<details> <summary><strong>Node.js / TypeScript</strong></summary>
npm install @xberg-io/xberg
See Node.js README for full documentation.
</details>
<details> <summary><strong>Rust</strong></summary>
cargo add xberg
See Rust README for full documentation.
</details>
<details> <summary><strong>Go</strong></summary>
go get github.com/xberg-io/xberg
See Go README for full documentation.
</details>
<details> <summary><strong>Java</strong></summary>
Available on Maven Central as io.xberg:xberg. See Java README for the dependency snippet.
</details>
<details> <summary><strong>C#</strong></summary>
dotnet add package Xberg
See C# README for full documentation.
</details>
<details> <summary><strong>Ruby</strong></summary>
gem install xberg
See Ruby README for full documentation.
</details>
<details> <summary><strong>PHP</strong></summary>
composer require xberg-io/xberg
See PHP README for full documentation.
</details>
<details> <summary><strong>Elixir</strong></summary>
Add {:xberg, "~> 1.0"} to your mix.exs dependencies. See Elixir README for full documentation.
</details>
<details> <summary><strong>WebAssembly</strong></summary>
npm install @xberg-io/xberg-wasm
See WebAssembly README for full documentation.
</details>
<details> <summary><strong>R</strong></summary>
Install from r-universe. See R README for full documentation.
</details>
<details> <summary><strong>Kotlin (Android)</strong></summary>
Available on Maven Central as io.xberg:xberg-android. See Kotlin README for the dependency snippet.
</details>
<details> <summary><strong>Swift</strong></summary>
Add via Swift Package Manager. See Swift README for full documentation.
</details>
<details> <summary><strong>Dart / Flutter</strong></summary>
dart pub add xberg
See Dart README for full documentation.
</details>
<details> <summary><strong>Zig</strong></summary>
Add via zig fetch. See Zig README for full documentation.
</details>
<details> <summary><strong>C/C++ (FFI)</strong></summary>
Build from source as part of this workspace. See C (FFI) README for full documentation.
</details>
成熟的���档智能框架,Rust核心保证性能,多语言支持扩展性强。Star数8.3k表明社区认可度高,维护活跃。
该工具使用 NOASSERTION 协议,商用场景请仔细阅读协议条款,必要时咨询法律意见。
AI Skill Hub 为第三方内容聚合平台,本页面信息基于公开数据整理,不对工具功能和质量作任何法律背书。
建议在沙箱或测试环境中充分验证后,再部署至生产环境,并做好必要的安全评估。
📄 NOASSERTION — 请查阅原始协议条款了解具体使用限制。
经综合评估,kreuzberg MCP工具 在AI工具赛道中表现稳健,质量优秀。如果你已有明确的使用需求,可以直接上手体验;如果还在评估阶段,建议对比同类工具后再做决策。
| 原始名称 | kreuzberg |
| 原始描述 | 开源MCP工具:A polyglot document intelligence framework with a Rust core. Extract text, metad。⭐8.3k · Rust |
| Topics | 文档处理Rust核心多语言FFIMCP工具高性能 |
| GitHub | https://github.com/kreuzberg-dev/kreuzberg |
| License | NOASSERTION |
| 语言 | Rust |
收录时间:2026-05-14 · 更新时间:2026-05-16 · License:NOASSERTION · AI Skill Hub 不对第三方内容的准确性作法律背书。