AI Skill Hub 推荐使用:路由竞技场 是一款优质的Agent工作流。AI 综合评分 7.8 分,在同类工具中表现稳健。如果你正在寻找可靠的Agent工作流解决方案,这是一个值得深入了解的选择。
路由竞技场 是一套完整的 AI Agent 自动化工作流方案。通过可视化的节点编排,将复杂的多步骤任务拆解为清晰的自动化流程,实现全程无人值守的智能处理。支持与数百种外部服务和 API 无缝集成,适合构建数据处理管线、业务自动化和 AI 辅助决策系统。
路由竞技场 是一套完整的 AI Agent 自动化工作流方案。通过可视化的节点编排,将复杂的多步骤任务拆解为清晰的自动化流程,实现全程无人值守的智能处理。支持与数百种外部服务和 API 无缝集成,适合构建数据处理管线、业务自动化和 AI 辅助决策系统。
# 方式一:pip 安装(推荐)
pip install routerarena
# 方式二:虚拟环境安装(推荐生产环境)
python -m venv .venv
source .venv/bin/activate # Windows: .venv\Scripts\activate
pip install routerarena
# 方式三:从源码安装(获取最新功能)
git clone https://github.com/RouteWorks/RouterArena
cd RouterArena
pip install -e .
# 验证安装
python -c "import routerarena; print('安装成功')"
# 命令行使用
routerarena --help
# 基本用法
routerarena input_file -o output_file
# Python 代码中调用
import routerarena
# 示例
result = routerarena.process("input")
print(result)
# routerarena 配置文件示例(config.yml) app: name: "routerarena" debug: false log_level: "INFO" # 运行时指定配置文件 routerarena --config config.yml # 或通过环境变量配置 export ROUTERARENA_API_KEY="your-key" export ROUTERARENA_OUTPUT_DIR="./output"
<br> <p> <a href="https://huggingface.co/blog/JerryPotter/who-routes-the-routers"><img alt="Blog" src="https://img.shields.io/badge/Blog-Read-FF5722?logo=rss&logoColor=white&labelColor=555555"></a> <a href="https://arxiv.org/abs/2510.00202"><img alt="arXiv: RouterArena" src="https://img.shields.io/badge/arXiv-RouterArena-b31b1b?logo=arxiv&logoColor=white&labelColor=555555"></a> <a href="https://huggingface.co/datasets/RouteWorks/RouterArena"><img alt="Hugging Face Dataset" src="https://img.shields.io/badge/%20Hugging%20Face-Dataset-yellow?logo=huggingface&logoColor=white&labelColor=555555"></a> <br> </p>
</div>
<p align="center"> <img src="images/routerarena-diagram.png" alt="RouterArena Diagram" width="700" /> </p>
RouterArena is an open evaluation platform and leaderboard for LLM routers—systems that automatically select the best model for a given query. As the LLM ecosystem diversifies with models varying in size, capability, and cost, routing has become critical for balancing performance and cost. Yet, LLM routers currently lack a standardized evaluation framework to assess how effectively they trade off accuracy, cost, and other related metrics.
RouterArena bridges this gap by providing an open evaluation platform and benchmarking framework for both open-source and commercial routers. It has the following key features:
We aim for RouterArena to serve as a foundation for the community to evaluate, understand, and advance LLM routing systems.
[!IMPORTANT] RouterArena is an evaluation-only dataset. Submissions that train, fit, or tune any router component on RouterArena data (including the label files) will be rejected, and any accepted submission found in violation will be withdrawn.
curl -LsSf https://astral.sh/uv/install.sh | sh
cd RouterArena
uv sync
OPENAI_API_KEY=<Your-Key> ANTHROPIC_API_KEY=<Your-Key>
In the project root, copy .env.example as .env and update the API keys in .env. This step is required only if you use our pipeline for LLM inferences.
```bash
Create a config file in ./router_inference/config/<router_name>.json. An example config file is included here.
{
"pipeline_params": {
"router_name": "your-router",
"router_cls_name": "your_router_class_name",
"models": [
"gpt-4o-mini",
"claude-3-haiku-20240307",
"gemini-2.0-flash-001"
]
}
}
For each model in your config, add an entry with the pricing per million tokens in this format at model_cost/model_cost.json:
{
"gpt-4o-mini": {
"input_token_price_per_million": 0.15,
"output_token_price_per_million": 0.6
},
}
[!NOTE] Ensure all models in your above config files are listed in./universal_model_names.py. If you add a new model, you must also add the API inference endpoint inllm_inference/model_inference.py.
uv run python ./router_inference/check_config_prediction_files.py your-router [sub_10|full|robustness]
This script checks: (1) all model names are valid, (2) prediction file has correct size (809 for sub_10, 8400 for full, 420 for robustness), and (3) all entries have valid global_index, prompt, and prediction fields.
专业的LLM路由评估框架,提供标准化基准工具,代码维护活跃,社区认可度中等,适合学术和生产环境评估使用。
AI Skill Hub 为第三方内容聚合平台,本页面信息基于公开数据整理,不对工具功能和质量作任何法律背书。
建议在沙箱或测试环境中充分验证后,再部署至生产环境,并做好必要的安全评估。
✅ Apache 2.0 — 宽松开源协议,可商用,需保留版权声明和 NOTICE 文件,含专利授权条款。
总体来看,路由竞技场 是一款质量良好的Agent工作流,在同类工具中具备一定竞争力。AI Skill Hub 将持续追踪其更新动态,建议收藏备用,结合自身场景选择合适时机引入使用。
| 原始名称 | RouterArena |
| 原始描述 | 开源AI工作流:RouterArena: An open framework for evaluating LLM routers with standardized data。⭐78 · Python |
| Topics | LLM路由工作流多智能体模型评估基准测试 |
| GitHub | https://github.com/RouteWorks/RouterArena |
| License | Apache-2.0 |
| 语言 | Python |
收录时间:2026-05-21 · 更新时间:2026-05-22 · License:Apache-2.0 · AI Skill Hub 不对第三方内容的准确性作法律背书。
选择 Agent 类型,复制安装指令后粘贴到对应客户端