LLM工程实验室 是 AI Skill Hub 本期精选AI工具之一。综合评分 6.8 分,整体质量稳定。我们推荐使用将其纳入你的 AI 工具库,帮助提升工作效率。
LLM工程实验室 是一款基于 Jupyter Notebook 开发的开源工具,专注于 自主Agent、工作流编排、LLM工程 等核心功能。作为 GitHub 开源项目,它拥有活跃的社区支持和持续的版本迭代,代码完全透明可审计,支持本地部署以保护数据隐私。无论是个人使用还是集成到企业工作流,都能提供稳定可靠的解决方案。
LLM工程实验室 是一款基于 Jupyter Notebook 开发的开源工具,专注于 自主Agent、工作流编排、LLM工程 等核心功能。作为 GitHub 开源项目,它拥有活跃的社区支持和持续的版本迭代,代码完全透明可审计,支持本地部署以保护数据隐私。无论是个人使用还是集成到企业工作流,都能提供稳定可靠的解决方案。
# 克隆仓库 git clone https://github.com/Eric-LLMs/LLMs-Lab cd LLMs-Lab # 查看安装说明 cat README.md # 按 README 完成环境依赖安装后即可使用
# 查看帮助 llms-lab --help # 基本运行 llms-lab [options] <input> # 详细使用说明请查阅文档 # https://github.com/Eric-LLMs/LLMs-Lab
# llms-lab 配置说明 # 查看配置选项 llms-lab --config-example > config.yml # 常见配置项 # output_dir: ./output # log_level: info # workers: 4 # 环境变量(覆盖配置文件) export LLMS_LAB_CONFIG="/path/to/config.yml"
Full-stack LLM Engineering Lab. Features: Autonomous Agents (ReAct/AutoGPT) | Fine-Tuning Llama/Mistral (SFT/DPO) | Large Model Deployment (DeepSeek 671B / 2.5-bit) | Advanced RAG (Hybrid Search) | Function Calling (Stream/Text-to-SQL/External APIs) | Frameworks (LangChain, Semantic Kernel, OpenAI) | Daily SOTA Paper Tracking. From theory to 0-to-1.
A comprehensive interactive notebook that documents the design philosophy: - Agent Design Paradigms: Explores core concepts like Reflection, Tool Use, and Planning (referenced from Andrew Ng's framework). - System Architecture: Visualizes the Core Module Flowchart and data flow between Memory, Tools, and the Planning engine. - Use Case Demonstrations: Step-by-step walkthroughs of real-world scenarios (e.g., Sales Analysis, Automated Reporting).
- POI(Point of Interest): This demo uses Amap's (Gaode Map) public API to retrieve information about hotels, restaurants, attractions, and other points of interest (POIs) near a specific location. It allows querying nearby POIs relative to a given point.
- SQL: This demo demonstrates how Function Calling handles sophisticated database tasks and generates SQL queries.
- Stream: This demo showcases examples of Function Calling in Stream mode.
LLMs: Large Language Models Chat Models: Generally based on LLMs but restructured for conversational purposes PromptTemplate: Templates for prompt creation OutputParser: Parses the output from models
Document Loaders: Loaders for various file formats Document Transformers: Common operations on documents such as splitting, filtering, translating, and extracting metadata Text Embedding Models: Convert text into vector representations, useful for tasks like retrieval Vectorstores: Stores for vectors (used in retrieval tasks) Retrievers: Tools for retrieving vectors from storage RAG Pipline with Langchain
Memory: Not physical memory; it manages "context", "history", or "memory" from a text perspective
Chain: Implements a single function or a series of sequential functions, LangChain Expression Language (LCEL) Agent: Automatically plans and executes steps based on user input, selecting the necessary tools for each step to achieve the desired task Tools: Functions for calling external functionalities, such as Google search, file I/O, Linux shell, etc. Toolkits: A set of tools designed to operate specific software, such as a toolkit for managing databases or Gmail
- run_RAG_vector_database_pipeline: RAG Pipeline based on ChromaDB Vector Database.
The Offline Steps are as follows:
| Document Loading | Document Splitting | Vectorization | Insert into Vector Database | |-----------------------|---------------------|---------------|------------------------------| | → | → | → | → | The Online Steps are as follows: | Receive User Query | Vectorize User Query | Retrieve from Vector Database | Populate Prompt Template | Call LLM with Final Prompt | Generate Response | |-----------------------|----------------------|-------------------------------|---------------------------|----------------------------|---------------------| | → | → | → | → | → | → |
- run_RAG_ES_pipeline: RAG Pipeline based on Elasticsearch (ES).
- RAG_pipeline_pdf_table_processing: In this RAG Pipline, use data from tables in PDFs to implement RAG. Offline: Convert PDF to images and extract tables from the images → Use GPT-4 to generate textual descriptions of the table images → Store the textual descriptions (documents), their embeddings (embeddings), and image URLs (metadatas) into the vector database. Online: Receive a query and search the vector database → Retrieve table image URLs from search results (based on similarity between textual descriptions and the query) → Use GPT-4 to query and retrieve information from the table images. The pipeline flowchart is as follows: 
实验室性质的开源项目,覆盖Agent核心框架和工作流。结构清晰、代码可读性强,但维护度一般、社区活跃度低。适合学习参考而非生产使用。
该工具未明确声明开源协议,商业使用前请联系原作者确认授权范围,避免侵权风险。
AI Skill Hub 为第三方内容聚合平台,本页面信息基于公开数据整理,不对工具功能和质量作任何法律背书。
建议在沙箱或测试环境中充分验证后,再部署至生产环境,并做好必要的安全评估。
经综合评估,LLM工程实验室 在AI工具赛道中表现稳健,质量良好。如果你已有明确的使用需求,可以直接上手体验;如果还在评估阶段,建议对比同类工具后再做决策。
| 原始名称 | LLMs-Lab |
| 原始描述 | 开源AI工作流:Full-stack LLM Engineering Lab. Features: Autonomous Agents (ReAct/AutoGPT) | Fi。⭐6 · Jupyter Notebook |
| Topics | 自主Agent工作流编排LLM工程ReAct框架AutoGPTDeepSeek-R1 |
| GitHub | https://github.com/Eric-LLMs/LLMs-Lab |
| 语言 | Jupyter Notebook |
收录时间:2026-05-23 · 更新时间:2026-05-30 · License:未公布 · AI Skill Hub 不对第三方内容的准确性作法律背书。