AI Skill Hub 推荐使用:开源AI工作流:永久免费LLM API列表 是一款优质的Agent工作流。已获得 4.5k 颗 GitHub Star,AI 综合评分 7.5 分,在同类工具中表现稳健。如果你正在寻找可靠的Agent工作流解决方案,这是一个值得深入了解的选择。
收集永久免费的LLM API(API密钥),方便开发者使用和学习。
开源AI工作流:永久免费LLM API列表 是一套完整的 AI Agent 自动化工作流方案。通过可视化的节点编排,将复杂的多步骤任务拆解为清晰的自动化流程,实现全程无人值守的智能处理。支持与数百种外部服务和 API 无缝集成,适合构建数据处理管线、业务自动化和 AI 辅助决策系统。
收集永久免费的LLM API(API密钥),方便开发者使用和学习。
开源AI工作流:永久免费LLM API列表 是一套完整的 AI Agent 自动化工作流方案。通过可视化的节点编排,将复杂的多步骤任务拆解为清晰的自动化流程,实现全程无人值守的智能处理。支持与数百种外部服务和 API 无缝集成,适合构建数据处理管线、业务自动化和 AI 辅助决策系统。
# 方式一:npm 全局安装 npm install -g awesome-free-llm-apis # 方式二:npx 直接运行(无需安装) npx awesome-free-llm-apis --help # 方式三:项目依赖安装 npm install awesome-free-llm-apis # 方式四:从源码运行 git clone https://github.com/mnfst/awesome-free-llm-apis cd awesome-free-llm-apis npm install npm start
# 命令行使用
awesome-free-llm-apis --help
# 基本用法
awesome-free-llm-apis [options] <input>
# Node.js 代码中使用
const awesome_free_llm_apis = require('awesome-free-llm-apis');
const result = await awesome_free_llm_apis.run(options);
console.log(result);
# awesome-free-llm-apis 配置说明 # 查看配置选项 awesome-free-llm-apis --config-example > config.yml # 常见配置项 # output_dir: ./output # log_level: info # workers: 4 # 环境变量(覆盖配置文件) export AWESOME_FREE_LLM_APIS_CONFIG="/path/to/config.yml"
<p align="center"> <a href="https://awesome.re"> <img src="https://awesome.re/badge-flat2.svg" alt="Awesome"> </a> </p>
<p align="center">LLM APIs with permanent free tiers for text inference.</p>
<p align="center"><sub>All endpoints are OpenAI SDK-compatible unless noted. Each link points to the provider's API key page.</sub></p>
Free with NVIDIA Developer Program membership. 100+ models. Rate-limited (no daily token cap).
Base URL: https://integrate.api.nvidia.com/v1
| Model Name | Context | Max Output | Modality | Rate Limit |
|---|---|---|---|---|
| deepseek-ai/deepseek-r1 | 128K | ~163K | Text (reasoning) | ~40 RPM |
| nvidia/llama-3.1-nemotron-ultra-253b-v1 | 128K | 4K | Text | ~40 RPM |
| nvidia/nemotron-3-super-120b-a12b | 262K | 262K | Text | ~40 RPM |
| nvidia/nemotron-3-nano-30b-a3b | 128K | 32K | Text | ~40 RPM |
| meta/llama-3.1-405b-instruct | 128K | 4K | Text | ~40 RPM |
| qwen/qwen2.5-72b-instruct | 128K | 8K | Text | ~40 RPM |
| google/gemma-4-31b | 128K | 8K | Text | ~40 RPM |
| mistralai/mistral-large-2-instruct | 128K | 4K | Text | ~40 RPM |
| nvidia/nemotron-nano-2-vl | 128K | 8K | Vision + Text + Video | ~40 RPM |
| minimax/minimax-m2.7 | 128K | 8K | Text | ~40 RPM |
| + 90 more models | Varies | Varies | Text, Image, Video, Speech, Embeddings | ~40 RPM |
100K monthly Inference Provider credits for free users. Routes to Fireworks, Together, Hyperbolic, Nebius, Novita, DeepInfra and others. Thousands of models.
Base URL: https://router.huggingface.co/v1
| Model Name | Context | Max Output | Modality | Rate Limit |
|---|---|---|---|---|
| Meta-Llama-3.1-8B-Instruct | 128K | ~4K | Text | Credit-metered |
| Mistral-7B-Instruct-v0.3 | 32K | ~4K | Text | Credit-metered |
| Mixtral-8x7B-Instruct-v0.1 | 32K | ~4K | Text | Credit-metered |
| Phi-3.5-mini-instruct | 128K | ~4K | Text | Credit-metered |
| Qwen2.5-7B-Instruct | 131K | ~4K | Text | Credit-metered |
| + thousands of community models | Varies | Varies | Text, Image, Audio, Embeddings | 100K credits/month free |
$1 free signup credits, no credit card required. 60+ open-source models via OpenAI-compatible API. EU-based. [^10]
Base URL: https://api.studio.nebius.com/v1
| Model Name | Context | Max Output | Modality | Rate Limit |
|---|---|---|---|---|
| Meta-Llama-3.3-70B-Instruct | 128K | ~8K | Text | Tier-based |
| DeepSeek-V3-0324 | 128K | ~8K | Text | Tier-based |
| DeepSeek-R1 | 128K | ~32K | Text (reasoning) | Tier-based |
| Qwen3-235B-A22B | 128K | ~32K | Text | Tier-based |
| gpt-oss-120b | 128K | ~32K | Text | Tier-based |
| + 55 more open-source models | Varies | Varies | Text, Vision, Code, Embeddings | Tier-based |
Free tier with qualitative usage limits. 400+ models from Ollama library. Not OpenAI SDK-compatible; uses Ollama API. [^3]
Base URL: https://api.ollama.com
| Model Name | Context | Max Output | Modality | Rate Limit |
|---|---|---|---|---|
| gpt-oss:120b-cloud | 128K | Model-dependent | Text | Session/weekly limits (unpublished) |
| deepseek-v3.1:671b-cloud | 128K | Model-dependent | Text | Session/weekly limits (unpublished) |
| qwen3-coder:480b-cloud | 128K | Model-dependent | Text (code) | Session/weekly limits (unpublished) |
| kimi-k2:1t-cloud | 262K | Model-dependent | Text | Session/weekly limits (unpublished) |
| glm-4.6:cloud | 128K | Model-dependent | Text | Session/weekly limits (unpublished) |
| deepseek-r1:cloud | 128K | Model-dependent | Text (reasoning) | Session/weekly limits (unpublished) |
| + 30 more cloud models | Varies | Varies | Text | Session/weekly limits (unpublished) |
APIs run by the companies that train or fine-tune the models themselves.
$10 trial credits at signup, no credit card. Credits expire in 3 months. Covers Jamba Large and Jamba Mini.
Base URL: https://api.ai21.com/studio/v1
| Model Name | Context | Max Output | Modality | Rate Limit |
|---|---|---|---|---|
| Jamba Large 1.7 | 256K | 4K | Text | 200 RPM, 10 RPS |
| Jamba Mini 2 | 256K | 4K | Text | 200 RPM, 10 RPS |
1M free tokens per Qwen model on signup, expires in 90 days (International / Singapore region). No credit card required. [^8]
Base URL: https://dashscope-intl.aliyuncs.com/compatible-mode/v1
| Model Name | Context | Max Output | Modality | Rate Limit |
|---|---|---|---|---|
| Qwen3-Max | 128K | 32K | Text | Tiered by region |
| Qwen3-Plus | 1M | 32K | Text | Tiered by region |
| Qwen3-VL-Plus | 128K | 8K | Text + Vision | Tiered by region |
| Qwen3-Coder-Plus | 256K | 8K | Text (code) | Tiered by region |
| QwQ-Plus | 131K | 32K | Text (reasoning) | Tiered by region |
Free "Trial" API key, no credit card. 1,000 API calls/month. Non-commercial use only.
Base URL: https://api.cohere.com/v2
| Model Name | Context | Max Output | Modality | Rate Limit |
|---|---|---|---|---|
| Command A (111B) | 256K | 4K | Text | 20 RPM |
| Command R+ | 128K | 4K | Text | 20 RPM |
| Command R | 128K | 4K | Text | 20 RPM |
| Command R7B | 128K | 4K | Text | 20 RPM |
| Embed 4 | — | — | Embeddings (Text + Image) | 2,000 inputs/min |
| Rerank 3.5 | — | — | Reranking | 10 RPM |
5M free tokens on signup, no credit card. Credits expire 30 days after signup; pay-as-you-go after. Prompts may be used for training unless opted out. [^9]
Base URL: https://api.deepseek.com/v1
| Model Name | Context | Max Output | Modality | Rate Limit |
|---|---|---|---|---|
| deepseek-chat (V3.2) | 128K | 8K | Text | Dynamic |
| deepseek-reasoner (R1) | 128K | 8K | Text (reasoning) | Dynamic |
Free tier unavailable in EU/UK/Switzerland. Free-tier prompts may be used by Google to improve products. [^1]
Base URL: https://generativelanguage.googleapis.com/v1beta
| Model Name | Context | Max Output | Modality | Rate Limit |
|---|---|---|---|---|
| Gemini 2.5 Pro | 2M | 65K | Text + Image + Audio + Video | 5 RPM, 100 RPD |
| Gemini 2.5 Flash | 1M | 65K | Text + Image + Audio + Video | 10 RPM, 250 RPD |
| Gemini 2.5 Flash-Lite | 1M | 65K | Text + Image + Audio + Video | 15 RPM, 1,000 RPD |
| Gemini 3 Flash (Preview) | 1M | 65K | Text + Image + Audio + Video | Preview limits |
Free "Experiment" plan, no credit card. ~1B tokens/month. Prompts may be used to improve models.
Base URL: https://api.mistral.ai/v1
| Model Name | Context | Max Output | Modality | Rate Limit |
|---|---|---|---|---|
| Mistral Small 4 | 256K | 256K | Text + Image + Code | ~1 RPS, 500K TPM |
| Mistral Medium 3 | 128K | 128K | Text | ~1 RPS, 500K TPM |
| Mistral Large 3 | 256K | 256K | Text | ~1 RPS, 500K TPM |
| Mistral Nemo (12B) | 128K | 128K | Text | ~1 RPS, 500K TPM |
| Codestral | 256K | 256K | Code | ~1 RPS, 500K TPM |
| Pixtral Large | 128K | 128K | Text + Image | ~1 RPS, 500K TPM |
Permanent free models, no credit card required.
Base URL: https://open.bigmodel.cn/api/paas/v4
| Model Name | Context | Max Output | Modality | Rate Limit |
|---|---|---|---|---|
| GLM-4.7-Flash | 200K | 128K | Text | 1 concurrent request |
| GLM-4.5-Flash | 128K | ~8K | Text | 1 concurrent request |
| GLM-4.6V-Flash | 128K | ~4K | Text + Image | 1 concurrent request |
10,000 Neurons/day free. 50+ models available on free tier.
Base URL: https://api.cloudflare.com/client/v4/accounts/{account_id}/ai/run
| Model Name | Context | Max Output | Modality | Rate Limit |
|---|---|---|---|---|
| @cf/meta/llama-3.3-70b-instruct-fp8-fast | 131K | Shared w/ context | Text | 10K neurons/day (shared) |
| @cf/meta/llama-3.1-8b-instruct-fp8-fast | 131K | Shared w/ context | Text | 10K neurons/day (shared) |
| @cf/meta/llama-3.2-11b-vision-instruct | 131K | Shared w/ context | Text + Vision | 10K neurons/day (shared) |
| @cf/meta/llama-4-scout-17b-16e-instruct | Up to 10M | Shared w/ context | Multimodal | 10K neurons/day (shared) |
| @cf/mistralai/mistral-small-3.1-24b-instruct | 128K | Shared w/ context | Text | 10K neurons/day (shared) |
| @cf/google/gemma-4-26b-a4b-it | 256K | Shared w/ context | Text | 10K neurons/day (shared) |
| @cf/moonshotai/kimi-k2.5 | 256K | Shared w/ context | Text + Vision | 10K neurons/day (shared) |
| @cf/deepseek-ai/deepseek-r1-distill-qwen-32b | 32K | Shared w/ context | Text (reasoning) | 10K neurons/day (shared) |
| + 42 more models | Varies | Varies | Text, Image, Audio, Embeddings | 10K neurons/day (shared) |
Free anonymous tier (no API key, no signup): 2 RPM per IP per model. 40+ open-weight models hosted in EU. OpenAI SDK-compatible. [^7]
Base URL: https://oai.endpoints.kepler.ai.cloud.ovh.net/v1
| Model Name | Context | Max Output | Modality | Rate Limit |
|---|---|---|---|---|
| Meta-Llama-3_3-70B-Instruct | 131K | ~4K | Text | 2 RPM (anonymous) |
| Meta-Llama-3_1-8B-Instruct | 131K | ~4K | Text | 2 RPM (anonymous) |
| DeepSeek-R1-Distill-Llama-70B | 131K | ~32K | Text (reasoning) | 2 RPM (anonymous) |
| Qwen3-32B | 131K | ~32K | Text | 2 RPM (anonymous) |
| Qwen3-Coder-30B-A3B-Instruct | 262K | ~32K | Text (code) | 2 RPM (anonymous) |
| Qwen2.5-VL-72B-Instruct | 128K | ~8K | Text + Vision | 2 RPM (anonymous) |
| Mixtral-8x7B-Instruct-v0.1 | 32K | ~4K | Text | 2 RPM (anonymous) |
| Mistral-Nemo-Instruct-2407 | 128K | ~4K | Text | 2 RPM (anonymous) |
| Qwen3Guard-Gen-8B | 32K | ~4K | Text (safety guard) | 2 RPM (anonymous) |
| Qwen3Guard-Gen-0.6B | 32K | ~4K | Text (safety guard) | 2 RPM (anonymous) |
| + 30 more models | Varies | Varies | Text, Vision, Code, Image, Speech | 2 RPM (anonymous) |
该项目收集了大量永久免费的LLM API,非常适合开发者学习和使用AI工作流。然而,项目维护人员较少,可能存在一些问题。
AI Skill Hub 为第三方内容聚合平台,本页面信息基于公开数据整理,不对工具功能和质量作任何法律背书。
建议在沙箱或测试环境中充分验证后,再部署至生产环境,并做好必要的安全评估。
✅ CC0 1.0 — 公共领域贡献,完全放弃版权,无任何使用限制。
总体来看,开源AI工作流:永久免费LLM API列表 是一款质量良好的Agent工作流,在同类工具中具备一定竞争力。AI Skill Hub 将持续追踪其更新动态,建议收藏备用,结合自身场景选择合适时机引入使用。
| 原始名称 | awesome-free-llm-apis |
| 原始描述 | 开源AI工作流:List of Permanent Free LLM API (API Keys)。⭐4.5k · JavaScript |
| Topics | workflowai-agentsanthropicawesomeawesome-listgeminijavascript |
| GitHub | https://github.com/mnfst/awesome-free-llm-apis |
| License | CC0-1.0 |
| 语言 | JavaScript |
收录时间:2026-05-22 · 更新时间:2026-05-22 · License:CC0-1.0 · AI Skill Hub 不对第三方内容的准确性作法律背书。
选择 Agent 类型,复制安装指令后粘贴到对应客户端