经 AI Skill Hub 精选评估,浏览器飞行员 获评「推荐使用」。这款AI工具在功能完整性、社区活跃度和易用性方面表现出色,AI 评分 7.5 分,适合有一定技术背景的用户使用。
浏览器飞行员 是一款基于 Python 开发的开源工具,专注于 浏览器自动化、AI代理、工作流 等核心功能。作为 GitHub 开源项目,它拥有活跃的社区支持和持续的版本迭代,代码完全透明可审计,支持本地部署以保护数据隐私。无论是个人使用还是集成到企业工作流,都能提供稳定可靠的解决方案。
浏览器飞行员 是一款基于 Python 开发的开源工具,专注于 浏览器自动化、AI代理、工作流 等核心功能。作为 GitHub 开源项目,它拥有活跃的社区支持和持续的版本迭代,代码完全透明可审计,支持本地部署以保护数据隐私。无论是个人使用还是集成到企业工作流,都能提供稳定可靠的解决方案。
# 方式一:pip 安装(推荐)
pip install browser-pilot
# 方式二:虚拟环境安装(推荐生产环境)
python -m venv .venv
source .venv/bin/activate # Windows: .venv\Scripts\activate
pip install browser-pilot
# 方式三:从源码安装(获取最新功能)
git clone https://github.com/NoDeskAI/browser-pilot
cd browser-pilot
pip install -e .
# 验证安装
python -c "import browser_pilot; print('安装成功')"
# 命令行使用
browser-pilot --help
# 基本用法
browser-pilot input_file -o output_file
# Python 代码中调用
import browser_pilot
# 示例
result = browser_pilot.process("input")
print(result)
# browser-pilot 配置文件示例(config.yml) app: name: "browser-pilot" debug: false log_level: "INFO" # 运行时指定配置文件 browser-pilot --config config.yml # 或通过环境变量配置 export BROWSER_PILOT_API_KEY="your-key" export BROWSER_PILOT_OUTPUT_DIR="./output"
docker compose build && docker compose up -d ```
Open http://localhost:8000 — you'll see the web UI with session management and a live browser viewer (noVNC).

Requires Docker (with Compose v2).
```bash git clone https://github.com/NoDeskAI/browser-pilot.git cd browser-pilot
cp .env.example .env
| Variable | Default | Description |
|---|---|---|
DATABASE_URL | Required in .env; see .env.example | PostgreSQL connection string for local backend development. Keep it aligned with POSTGRES_*. |
EDITION | ce in Docker Compose; auto-detected by start.sh when no edition argument is provided | Product edition. Use ce for Community Edition or ee for Enterprise Edition. EE requires the ee/ sources to be present. |
POSTGRES_USER | Required in .env; see .env.example | PostgreSQL user used by Docker Compose and local development. |
POSTGRES_PASSWORD | Required in .env; see .env.example | PostgreSQL password. Change it before production/public deployment. |
POSTGRES_DB | Required in .env; see .env.example | PostgreSQL database name. |
MINIO_ROOT_USER | Required in .env; see .env.example | Root user for the bundled S3-compatible storage service. |
MINIO_ROOT_PASSWORD | Required in .env; see .env.example | Root password for the bundled S3-compatible storage service. Change it before production/public deployment. |
MINIO_BUCKET | Required in .env; see .env.example | Bucket created automatically by Docker Compose and preconfigured as the default S3 storage bucket. |
MINIO_ENDPOINT | http://localhost:9000 for start.sh; container-internal endpoint in Docker Compose | Endpoint used by the backend to reach the bundled S3-compatible storage service. |
MINIO_PUBLIC_ENDPOINT | http://localhost:9000 in Docker Compose | Public endpoint embedded in S3 signed download URLs. It must be reachable by browsers and CLI clients. |
SELENIUM_BASE_IMAGE | selenium/standalone-chrome:latest | Base image for browser containers. ARM users: seleniarm/standalone-chromium:latest |
BROWSER_GL_MODE | auto | Browser WebGL runtime mode: auto, swiftshader, angle-swiftshader, angle, egl, or native. auto resolves to angle-swiftshader for ARM Chromium and swiftshader elsewhere. |
DOCKER_HOST_ADDR | localhost | How the backend reaches browser containers. Set to host.docker.internal in Docker deployment (auto-configured by docker-compose) |
BROWSER_RUNTIME_BACKEND_URL | http://host.docker.internal:8000 | Backend URL injected into browser runtime agents for internal file ingest callbacks. |
BROWSER_RUNTIME_CONTROL_URL | — | Optional internal runtime-worker URL. Docker Compose sets this to http://runtime-worker:8001 so the public backend does not mount Docker socket directly. |
BROWSER_RUNTIME_CONTROL_TOKEN | — | Shared bearer token used between backend and runtime-worker. Set a long random value before production/public deployment. |
BROWSER_RUNTIME_COMMAND_MAX_TIMEOUT | 3600 | Maximum timeout, in seconds, accepted for runtime-worker Docker commands. Large first-time runtime image builds can need a longer timeout. |
CLOAK_BROWSER_IMAGE_NAME | browser-pilot-cloak:latest | Optional Cloak Chromium runtime image used by sessions created with browserRuntime=cloak_chromium. |
BROWSER_HOME_URL | https://www.google.com/ | Home page opened automatically when a newly started browser is still on a blank/new-tab page. Set empty to disable. |
BP_LEGACY_DOCKER_DOWNLOAD_WATCHER | false | Temporary fallback for old Selenium images without file-capture-agent. When enabled, backend uses Docker copy commands and reports a degraded warning. |
OPENAI_API_KEY | — | Optional. When set, uses LLM to auto-name sessions on first navigation. Without it, sessions are named by page title. |
LOG_LEVEL | INFO | Backend log verbosity. Set to DEBUG for troubleshooting. |
JWT_EXPIRE_MINUTES | 30 | Short-lived access JWT lifetime in minutes. |
REMEMBER_ME_DAYS | 7 | Duration for the revocable remember-me cookie used to restore short-lived access tokens. |
NETWORK_EGRESS_DOCKER_NETWORK | browser-pilot-net | Docker bridge network used by browser containers and managed egress containers. |
NETWORK_EGRESS_CONFIG_DIR | data/network-egress | Private config storage for managed Clash/OpenVPN egress profiles. |
NETWORK_EGRESS_CLASH_IMAGE | ghcr.io/metacubex/mihomo:latest | Container image used for managed Clash egress profiles. |
NETWORK_EGRESS_CLASH_PROXY_PORT | 7890 | Proxy port exposed by managed Clash containers on the internal Docker network. |
NETWORK_EGRESS_OPENVPN_IMAGE | browser-pilot-openvpn-egress:latest | Container image used for managed OpenVPN egress profiles. The default image is built from services/network-egress-openvpn on first use. |
NETWORK_EGRESS_OPENVPN_PROXY_PORT | 8888 | HTTP proxy port exposed by managed OpenVPN containers on the internal Docker network. |
export BP_OMNIPARSER_URL=http://127.0.0.1:8001
export BP_OMNIPARSER_REPO=/path/to/OmniParser
Example OmniParser server launch from an upstream clone:
bash cd /path/to/OmniParser/omnitool/omniparserserver python omniparserserver.py \ --host 127.0.0.1 \ --port 8001 \ --device cpu \ --som_model_path ../../weights/icon_detect/model.pt \ --caption_model_name florence2 \ --caption_model_path ../../weights/icon_caption_florence \ --BOX_TRESHOLD 0.05 ```
OmniParser code and weights are not vendored in this repository. Check the OmniParser model licenses before redistribution; its icon detection weights inherit the YOLO license noted by the upstream project.
创新的浏览器即基础设施设计,为AI代理提供可靠的Web交互能力。隔离会话和指纹识别特性突出,但项目成熟度需观察,文档完整度有待提升。
AI Skill Hub 为第三方内容聚合平台,本页面信息基于公开数据整理,不对工具功能和质量作任何法律背书。
建议在沙箱或测试环境中充分验证后,再部署至生产环境,并做好必要的安全评估。
✅ Apache 2.0 — 宽松开源协议,可商用,需保留版权声明和 NOTICE 文件,含专利授权条款。
AI Skill Hub 点评:浏览器飞行员 的核心功能完整,质量良好。对于AI爱好者来说,这是一个值得纳入个人工具库的选择。建议先在非生产环境试用,再逐步推广。
| 原始名称 | browser-pilot |
| 原始描述 | 开源AI工作流:Browser-as-infrastructure for AI agents. Isolated Chrome sessions with fingerpri。⭐30 · Python |
| Topics | 浏览器自动化AI代理工作流DockerSelenium |
| GitHub | https://github.com/NoDeskAI/browser-pilot |
| License | Apache-2.0 |
| 语言 | Python |
收录时间:2026-05-29 · 更新时间:2026-05-30 · License:Apache-2.0 · AI Skill Hub 不对第三方内容的准确性作法律背书。