经 AI Skill Hub 精选评估,Cube AI 获评「推荐使用」。这款AI工具在功能完整性、社区活跃度和易用性方面表现出色,AI 评分 7.5 分,适合有一定技术背景的用户使用。
Cube AI 是一款基于 Go 开发的开源工具,专注于 AI、LLM、TEE 等核心功能。作为 GitHub 开源项目,它拥有活跃的社区支持和持续的版本迭代,代码完全透明可审计,支持本地部署以保护数据隐私。无论是个人使用还是集成到企业工作流,都能提供稳定可靠的解决方案。
Cube AI 是一款基于 Go 开发的开源工具,专注于 AI、LLM、TEE 等核心功能。作为 GitHub 开源项目,它拥有活跃的社区支持和持续的版本迭代,代码完全透明可审计,支持本地部署以保护数据隐私。无论是个人使用还是集成到企业工作流,都能提供稳定可靠的解决方案。
# 方式一:go install(推荐) go install github.com/ultravioletrs/cube@latest # 方式二:从源码编译 git clone https://github.com/ultravioletrs/cube cd cube go build -o cube . # 方式三:下载预编译二进制 # 访问 Releases 页面下载对应平台二进制文件 # https://github.com/ultravioletrs/cube/releases
# 查看帮助 cube --help # 基本运行 cube [options] <input> # 详细使用说明请查阅文档 # https://github.com/ultravioletrs/cube
# cube 配置说明 # 查看配置选项 cube --config-example > config.yml # 常见配置项 # output_dir: ./output # log_level: info # workers: 4 # 环境变量(覆盖配置文件) export CUBE_CONFIG="/path/to/config.yml"
Cube AI is a framework for building GPT-based applications using confidential computing. It protects user data and AI models with a trusted execution environment (TEE), which is a secure area of the processor that ensures code and data loaded inside it remain confidential and intact. This provides strong data confidentiality and code integrity even when the host environment is not fully trusted.
<p align="center"> <img src="https://raw.githubusercontent.com/ultravioletrs/cube-docs/refs/heads/main/static/img/cube-ai.png"> </p>
Cube ships two self-contained Docker stacks:
| Stack | Path | Use | Includes |
|---|---|---|---|
| **local** | docker/local | Day-to-day development | ATOM identity, Ollama, agent, proxy, embedder (RAG), web UI |
| **prod** | docker/prod | Production | Everything above **plus** Traefik TLS, guardrails, audit pipeline, image embedder, optional attestation and Cloudflare tunnel |
The local stack is intentionally minimal: no attestation (TEE), no audit logs, no guardrails, no reverse proxy. Services are reachable directly on localhost ports.
make up-prod # start make logs-prod # follow logs make down-prod # stop ```
git clone https://github.com/ultravioletrs/cube.git
cd cube
The local stack runs the proxy, agent, and embedder images built from this repo, so build them first:
make docker-proxy docker-agent docker-embedder
This tags ghcr.io/ultravioletrs/cube/{proxy,agent,embedder}:latest (and :<version>), exactly what the compose files reference. Use make dockers to build every service image (including guardrails, image-embedder, and ui).
make up # start
make logs # follow logs
make down # stop
make clean-volumes # stop and wipe databases, models, uploads
The first start pulls Ollama and ATOM images and downloads the llama3.2:3b and nomic-embed-text models — give it a few minutes.
Local access (direct ports, plain HTTP):
| Service | URL | | ------------------------ | ----------------------------- | | Cube web UI | http://localhost:5173 | | ATOM identity UI | http://localhost:3005 | | Cube proxy (API gateway) | http://localhost:8900 | | Cube embedder (RAG) | http://localhost:8082 | | ATOM API (GraphQL) | http://localhost:8080/graphql |
Open http://localhost:5173. The local ATOM has self-signup enabled, so create an account (email + password) from the login screen — email verification is skipped in local dev. After signing in, create a workspace (domain) and start chatting. Uploading documents in the UI exercises the RAG pipeline through the embedder.
For frontend hot-reload development, see the UI development guide.
Get a token from ATOM via GraphQL (use the account you created):
curl -s http://localhost:8080/graphql \
-H "Content-Type: application/json" \
-d '{"query":"mutation($i:LoginInput!){login(input:$i){token entityId}}","variables":{"i":{"identity":"you@example.com","secret":"your-password","kind":"password"}}}'
The token is your bearer token. Send chat requests through the proxy, with your workspace (domain) ID in the path:
curl -s http://localhost:8900/YOUR_DOMAIN_ID/v1/chat/completions \
-H "Content-Type: application/json" \
-H "Authorization: Bearer YOUR_TOKEN" \
-d '{"model":"llama3.2:3b","messages":[{"role":"user","content":"Hello!"}]}'
Inference and RAG go through the cube-proxy gateway. Every protected request needs an Authorization: Bearer <token> header (token from ATOM — see step 5 of the Quick Start). The workspace (domain) ID goes in the path.
http://localhost:8900https://<your-domain> (Traefik, under /proxy)Replace {domainID} with your workspace ID.
| Method | Path | Description |
|---|---|---|
| GET | /{domainID}/v1/models | List available models |
| POST | /{domainID}/v1/chat/completions | Create chat completions |
| POST | /{domainID}/v1/completions | Create text completions |
| GET | /{domainID}/api/tags | List Ollama models |
| POST | /{domainID}/api/chat | Chat completions |
Document ingestion, sources, and retrieval are served by the embedder under /{domainID}/api/v1/... (records, sources, conversations, retrieve, chat, models). The web UI drives these; see the embedder runbook for the raw API.
Cube AI now supports vLLM, a high-throughput and memory-efficient inference engine for Large Language Models. vLLM provides:
Cube AI integrates with Ollama for local model deployment, providing:
- Local (docker/local/.env, docker/local/config.json) — Ollama models, ports, and database credentials. No TEE, audit, or guardrails. - Prod (docker/prod/.env, docker/prod/config.json) — set CUBE_DOMAIN, replace every change-me-* secret, and toggle ATTESTED_TLS. Guardrails, the audit pipeline, and the image embedder are always on in prod.
Cube AI 是一个基于保密计算的框架,用于构建 GPT 类型的应用程序。它保护用户数据和 AI 模型的安全性,使用受信任执行环境(TEE),确保代码和数据在其内保持保密和完整。即使主机环境不完全可信,Cube AI 也能提供强大的数据保密性和代码完整性。
Cube AI 的关键功能包括:受信任执行环境(TEE),确保 AI 模型在受控、隔离的环境中执行,保护提示、响应和模型数据,即使主机操作系统被破坏也能保护;AI 安全防护栏,包括 NeMo 防护栏,用于输入和输出验证,包括防止逃逸和提示注入检测,离题过滤,毒性检查等。
Cube AI 的环境依赖和系统要求包括:Docker 和 Docker Compose,约 8 GB 的可用 RAM(Ollama 模型 + Postgres 实例),TEE 硬件(AMD SEV-SNP / Intel TDX)仅在生产堆栈中用于 attestation,开发环境中不需要。
Cube AI 支持两种自包含的 Docker 堆栈:local 和 prod。local 堆栈用于日常开发,包括 ATOM 身份、Ollama、代理、嵌入器(RAG)和 web UI。prod 堆栈用于生产环境,需要编辑 docker/prod/.env 文件(CUBE_DOMAIN、密钥)并执行 make up-prod、make logs-prod 和 make down-prod 等命令。
使用 Cube AI 的快速入门指南包括:克隆仓库、构建 Cube 图像、启动代理、代理和嵌入器等。
Cube AI 的配置说明包括:MCP、env 和关键参数等。
Cube AI 的 API 端点包括: inference 和 RAG 通过 cube-proxy 网关,需要 Authorization: Bearer <token> 头(来自 ATOM),workspace(域)ID 在路径中,local base URL 和 prod base URL 等。
Cube AI 支持 vLLM 和 Ollama 集成,vLLM 提供高吞吐量和内存效率的推理引擎,Ollama 提供本地模型部署、模型管理和部署、本地推理等功能。
Cube AI是一个开源的AI工具,使用TEE保护LLM,具有较高的安全性
AI Skill Hub 为第三方内容聚合平台,本页面信息基于公开数据整理,不对工具功能和质量作任何法律背书。
建议在沙箱或测试环境中充分验证后,再部署至生产环境,并做好必要的安全评估。
✅ Apache 2.0 — 宽松开源协议,可商用,需保留版权声明和 NOTICE 文件,含专利授权条款。
AI Skill Hub 点评:Cube AI 的核心功能完整,质量良好。对于AI 技术爱好者来说,这是一个值得纳入个人工具库的选择。建议先在非生产环境试用,再逐步推广。
| 原始名称 | cube |
| 原始描述 | 开源AI工具:Cube AI - LLM protection with TEEs。⭐28 · Go |
| Topics | AILLMTEEGo |
| GitHub | https://github.com/ultravioletrs/cube |
| License | Apache-2.0 |
| 语言 | Go |
收录时间:2026-06-10 · 更新时间:2026-06-11 · License:Apache-2.0 · AI Skill Hub 不对第三方内容的准确性作法律背书。