能力标签
AI工程从零开始
🛠
AI工具

AI工程从零开始

基于 Python · 开源 AI 工具,GitHub 社区精选
英文名:ai-engineering-from-scratch
⭐ 9.1k Stars 🍴 1.9k Forks 💻 Python 📄 MIT 🏷 AI 8.5分
8.5AI 综合评分
MCP协议AI智能体AI工程计算机视觉开源学习
✦ AI Skill Hub 推荐

经 AI Skill Hub 精选评估,AI工程从零开始 获评「强烈推荐」。已获得 9.1k 颗 GitHub Star,这款AI工具在功能完整性、社区活跃度和易用性方面表现出色,AI 评分 8.5 分,适合有一定技术背景的用户使用。

📚 深度解析

AI工程从零开始 是一款基于 Python 的开源工具,在 GitHub 上收获 9k+ Star,是MCP协议、AI智能体、AI工程、计算机视觉领域中的优质开源项目。开源工具的最大优势在于代码完全透明,你可以审计每一行代码的安全性,也可以根据自身需求进行二次开发和定制。

**为什么要使用开源工具而非商业 SaaS?**
对于个人开发者和有隐私需求的用户,本地部署的开源工具意味着数据不离本机,不受第三方服务商的数据政策约束。同时,开源工具通常没有使用次数限制和月度费用,一次安装即可长期使用,对于高频使用场景的总拥有成本(TCO)远低于订阅制商业工具。

**安装与环境准备**
AI工程从零开始 依赖 Python 运行环境。建议通过 pyenv(Python)或 nvm(Node.js)管理 Python 版本,避免全局环境污染。对于新手用户,推荐先创建虚拟环境(python -m venv venv && source venv/bin/activate),再安装依赖,这样即使出现问题也可以随时删除虚拟环境重新开始,不影响系统稳定性。

**社区与维护**
GitHub Issue 和 Discussion 是获取帮助的最快渠道。在提问前建议先检查 Closed Issues(已关闭的问题),大多数常见问题都已有解答。遇到 Bug 时,提供 pip list 的输出、完整错误堆栈和最小可复现示例,能显著提高开发者响应速度。AI Skill Hub 将持续追踪 AI工程从零开始 的版本更新,及时通知重要功能变化。

📋 工具概览

AI工程从零开始 是一款基于 Python 开发的开源工具,专注于 MCP协议、AI智能体、AI工程 等核心功能。作为 GitHub 开源项目,它拥有活跃的社区支持和持续的版本迭代,代码完全透明可审计,支持本地部署以保护数据隐私。无论是个人使用还是集成到企业工作流,都能提供稳定可靠的解决方案。

GitHub Stars
⭐ 9.1k
开发语言
Python
支持平台
Windows / macOS / Linux
维护状态
持续维护,定期更新
开源协议
MIT
AI 综合评分
8.5 分
工具类型
AI工具
Forks
1.9k

📖 中文文档

以下内容由 AI Skill Hub 根据项目信息自动整理,如需查看完整原始文档请访问底部「原始来源」。

AI工程从零开始 是一款基于 Python 开发的开源工具,专注于 MCP协议、AI智能体、AI工程 等核心功能。作为 GitHub 开源项目,它拥有活跃的社区支持和持续的版本迭代,代码完全透明可审计,支持本地部署以保护数据隐私。无论是个人使用还是集成到企业工作流,都能提供稳定可靠的解决方案。

📌 核心特色
  • 开源免费,支持本地部署,数据完全自主可控
  • 活跃的 GitHub 开源社区,持续迭代更新
  • 提供详细文档和使用示例,新手友好
  • 支持自定义配置,灵活适配不同使用环境
  • 可作为基础组件集成进现有技术栈或进行二次开发
🎯 主要使用场景
  • 本地部署运行,保护数据隐私,满足合规要求
  • 自定义集成到现有系统,扩展技术栈能力
  • 作为开源基础组件进行商业化二次开发
以下安装命令基于项目开发语言和类型自动生成,实际以官方 README 为准。
安装命令
# 方式一:pip 安装(推荐)
pip install ai-engineering-from-scratch

# 方式二:虚拟环境安装(推荐生产环境)
python -m venv .venv
source .venv/bin/activate  # Windows: .venv\Scripts\activate
pip install ai-engineering-from-scratch

# 方式三:从源码安装(获取最新功能)
git clone https://github.com/rohitg00/ai-engineering-from-scratch
cd ai-engineering-from-scratch
pip install -e .

# 验证安装
python -c "import ai_engineering_from_scratch; print('安装成功')"
📋 安装步骤说明
  1. 访问 GitHub 仓库页面
  2. 按照 README 文档完成依赖安装
  3. 根据系统环境完成初始化配置
  4. 参考官方示例或文档开始使用
  5. 遇到问题可在 GitHub Issues 中查找解答
以下用法示例由 AI Skill Hub 整理,涵盖最常见的使用场景。
常用命令 / 代码示例
# 命令行使用
ai-engineering-from-scratch --help

# 基本用法
ai-engineering-from-scratch input_file -o output_file

# Python 代码中调用
import ai_engineering_from_scratch

# 示例
result = ai_engineering_from_scratch.process("input")
print(result)
以下配置示例基于典型使用场景生成,具体参数请参照官方文档调整。
配置示例
# ai-engineering-from-scratch 配置文件示例(config.yml)
app:
  name: "ai-engineering-from-scratch"
  debug: false
  log_level: "INFO"

# 运行时指定配置文件
ai-engineering-from-scratch --config config.yml

# 或通过环境变量配置
export AI_ENGINEERING_FROM_SCRATCH_API_KEY="your-key"
export AI_ENGINEERING_FROM_SCRATCH_OUTPUT_DIR="./output"
📑 README 深度解析 真实文档 完整度 31/100 含工作流图 查看 GitHub 原文 →
以下内容由系统直接从 GitHub README 解析整理,保留代码块、表格与列表结构。

简介

<p align="center"> <img src="assets/banner.svg" alt="AI Engineering from Scratch — reference manual banner" width="100%"> </p>

<p align="center"> <a href="LICENSE"><img src="https://img.shields.io/badge/license-MIT-1a1a1a?style=flat-square&labelColor=fafaf5" alt="MIT License"></a> <a href="ROADMAP.md"><img src="https://img.shields.io/badge/lessons-503-3553ff?style=flat-square&labelColor=fafaf5" alt="503 lessons"></a> <a href="#contents"><img src="https://img.shields.io/badge/phases-20-3553ff?style=flat-square&labelColor=fafaf5" alt="20 phases"></a> <a href="https://github.com/rohitg00/ai-engineering-from-scratch/stargazers"><img src="https://img.shields.io/github/stars/rohitg00/ai-engineering-from-scratch?style=flat-square&labelColor=fafaf5&color=3553ff" alt="GitHub stars"></a> <a href="https://aiengineeringfromscratch.com"><img src="https://img.shields.io/badge/web-aiengineeringfromscratch.com-3553ff?style=flat-square&labelColor=fafaf5" alt="Website"></a> </p>

Prerequisites

  • You can write code (any language; Python helps).
  • You want to understand how AI actually works, not just call APIs.

Getting started

Three ways in. Pick one.

Option A — read. Open any completed lesson on aiengineeringfromscratch.com or expand a phase under Contents. No setup, no cloning.

Option B — clone and run.

git clone https://github.com/rohitg00/ai-engineering-from-scratch.git
cd ai-engineering-from-scratch
python phases/01-math-foundations/01-linear-algebra-intuition/code/vectors.py

**Option C — find your level (recommended).** Skip ahead intelligently. Inside Claude, Cursor, Codex, OpenClaw, Hermes, or any agent with the curriculum skills installed:

/find-your-level

Ten questions. Maps your knowledge to a starting phase, builds a personalized path with hour estimates. After each phase:

```bash /check-understanding 3 # quiz yourself on phase 3 ls phases/03-deep-learning-core/05-loss-functions/outputs/

Phase 0: Setup & Tooling `12 lessons`

Get your environment ready for everything that follows.
#LessonTypeLang
01[Dev Environment](phases/00-setup-and-tooling/01-dev-environment/)BuildPython
02[Git & Collaboration](phases/00-setup-and-tooling/02-git-and-collaboration/)Learn
03[GPU Setup & Cloud](phases/00-setup-and-tooling/03-gpu-setup-and-cloud/)BuildPython
04[APIs & Keys](phases/00-setup-and-tooling/04-apis-and-keys/)BuildPython
05[Jupyter Notebooks](phases/00-setup-and-tooling/05-jupyter-notebooks/)BuildPython
06[Python Environments](phases/00-setup-and-tooling/06-python-environments/)BuildShell
07[Docker for AI](phases/00-setup-and-tooling/07-docker-for-ai/)BuildDocker
08[Editor Setup](phases/00-setup-and-tooling/08-editor-setup/)Build
09[Data Management](phases/00-setup-and-tooling/09-data-management/)BuildPython
10[Terminal & Shell](phases/00-setup-and-tooling/10-terminal-and-shell/)Learn
11[Linux for AI](phases/00-setup-and-tooling/11-linux-for-ai/)Learn
12[Debugging & Profiling](phases/00-setup-and-tooling/12-debugging-and-profiling/)BuildPython

<details id="phase-1"> <summary><b>Phase 1 — Math Foundations</b> &nbsp;<code>22 lessons</code>&nbsp; <em>The intuition behind every AI algorithm, through code.</em></summary> <br/>

#LessonTypeLang
01[Linear Algebra Intuition](phases/01-math-foundations/01-linear-algebra-intuition/)LearnPython, Julia
02[Vectors, Matrices & Operations](phases/01-math-foundations/02-vectors-matrices-operations/)BuildPython, Julia
03[Matrix Transformations & Eigenvalues](phases/01-math-foundations/03-matrix-transformations/)BuildPython, Julia
04[Calculus for ML: Derivatives & Gradients](phases/01-math-foundations/04-calculus-for-ml/)LearnPython
05[Chain Rule & Automatic Differentiation](phases/01-math-foundations/05-chain-rule-and-autodiff/)BuildPython
06[Probability & Distributions](phases/01-math-foundations/06-probability-and-distributions/)LearnPython
07[Bayes' Theorem & Statistical Thinking](phases/01-math-foundations/07-bayes-theorem/)BuildPython
08[Optimization: Gradient Descent Family](phases/01-math-foundations/08-optimization/)BuildPython
09[Information Theory: Entropy, KL Divergence](phases/01-math-foundations/09-information-theory/)LearnPython
10[Dimensionality Reduction: PCA, t-SNE, UMAP](phases/01-math-foundations/10-dimensionality-reduction/)BuildPython
11[Singular Value Decomposition](phases/01-math-foundations/11-singular-value-decomposition/)BuildPython, Julia
12[Tensor Operations](phases/01-math-foundations/12-tensor-operations/)BuildPython
13[Numerical Stability](phases/01-math-foundations/13-numerical-stability/)BuildPython
14[Norms & Distances](phases/01-math-foundations/14-norms-and-distances/)BuildPython
15[Statistics for ML](phases/01-math-foundations/15-statistics-for-ml/)BuildPython
16[Sampling Methods](phases/01-math-foundations/16-sampling-methods/)BuildPython
17[Linear Systems](phases/01-math-foundations/17-linear-systems/)BuildPython
18[Convex Optimization](phases/01-math-foundations/18-convex-optimization/)BuildPython
19[Complex Numbers for AI](phases/01-math-foundations/19-complex-numbers/)LearnPython
20[The Fourier Transform](phases/01-math-foundations/20-fourier-transform/)BuildPython
21[Graph Theory for ML](phases/01-math-foundations/21-graph-theory/)BuildPython
22[Stochastic Processes](phases/01-math-foundations/22-stochastic-processes/)LearnPython

</details>

<details id="phase-2"> <summary><b>Phase 2 — ML Fundamentals</b> &nbsp;<code>18 lessons</code>&nbsp; <em>Classical ML — still the backbone of most production AI.</em></summary> <br/>

#LessonTypeLang
01[What Is Machine Learning](phases/02-ml-fundamentals/01-what-is-machine-learning/)LearnPython
02[Linear Regression from Scratch](phases/02-ml-fundamentals/02-linear-regression/)BuildPython
03[Logistic Regression & Classification](phases/02-ml-fundamentals/03-logistic-regression/)BuildPython
04[Decision Trees & Random Forests](phases/02-ml-fundamentals/04-decision-trees/)BuildPython
05[Support Vector Machines](phases/02-ml-fundamentals/05-support-vector-machines/)BuildPython
06[KNN & Distance Metrics](phases/02-ml-fundamentals/06-knn-and-distances/)BuildPython
07[Unsupervised Learning: K-Means, DBSCAN](phases/02-ml-fundamentals/07-unsupervised-learning/)BuildPython
08[Feature Engineering & Selection](phases/02-ml-fundamentals/08-feature-engineering/)BuildPython
09[Model Evaluation: Metrics, Cross-Validation](phases/02-ml-fundamentals/09-model-evaluation/)BuildPython
10[Bias, Variance & the Learning Curve](phases/02-ml-fundamentals/10-bias-variance/)LearnPython
11[Ensemble Methods: Boosting, Bagging, Stacking](phases/02-ml-fundamentals/11-ensemble-methods/)BuildPython
12[Hyperparameter Tuning](phases/02-ml-fundamentals/12-hyperparameter-tuning/)BuildPython
13[ML Pipelines & Experiment Tracking](phases/02-ml-fundamentals/13-ml-pipelines/)BuildPython
14[Naive Bayes](phases/02-ml-fundamentals/14-naive-bayes/)BuildPython
15[Time Series Fundamentals](phases/02-ml-fundamentals/15-time-series/)BuildPython
16[Anomaly Detection](phases/02-ml-fundamentals/16-anomaly-detection/)BuildPython
17[Handling Imbalanced Data](phases/02-ml-fundamentals/17-imbalanced-data/)BuildPython
18[Feature Selection](phases/02-ml-fundamentals/18-feature-selection/)BuildPython

</details>

<details id="phase-3"> <summary><b>Phase 3 — Deep Learning Core</b> &nbsp;<code>13 lessons</code>&nbsp; <em>Neural networks from first principles. No frameworks until you build one.</em></summary> <br/>

#LessonTypeLang
01[The Perceptron: Where It All Started](phases/03-deep-learning-core/01-the-perceptron/)BuildPython
02[Multi-Layer Networks & Forward Pass](phases/03-deep-learning-core/02-multi-layer-networks/)BuildPython
03[Backpropagation from Scratch](phases/03-deep-learning-core/03-backpropagation/)BuildPython
04[Activation Functions: ReLU, Sigmoid, GELU & Why](phases/03-deep-learning-core/04-activation-functions/)BuildPython
05[Loss Functions: MSE, Cross-Entropy, Contrastive](phases/03-deep-learning-core/05-loss-functions/)BuildPython
06[Optimizers: SGD, Momentum, Adam, AdamW](phases/03-deep-learning-core/06-optimizers/)BuildPython
07[Regularization: Dropout, Weight Decay, BatchNorm](phases/03-deep-learning-core/07-regularization/)BuildPython
08[Weight Initialization & Training Stability](phases/03-deep-learning-core/08-weight-initialization/)BuildPython
09[Learning Rate Schedules & Warmup](phases/03-deep-learning-core/09-learning-rate-schedules/)BuildPython
10[Build Your Own Mini Framework](phases/03-deep-learning-core/10-mini-framework/)BuildPython
11[Introduction to PyTorch](phases/03-deep-learning-core/11-intro-to-pytorch/)BuildPython
12[Introduction to JAX](phases/03-deep-learning-core/12-intro-to-jax/)BuildPython
13[Debugging Neural Networks](phases/03-deep-learning-core/13-debugging-neural-networks/)BuildPython

</details>

<details id="phase-4"> <summary><b>Phase 4 — Computer Vision</b> &nbsp;<code>28 lessons</code>&nbsp; <em>From pixels to understanding — image, video, 3D, VLMs, and world models.</em></summary> <br/>

#LessonTypeLang
01[Image Fundamentals: Pixels, Channels, Color Spaces](phases/04-computer-vision/01-image-fundamentals/)LearnPython
02[Convolutions from Scratch](phases/04-computer-vision/02-convolutions-from-scratch/)BuildPython
03[CNNs: LeNet to ResNet](phases/04-computer-vision/03-cnns-lenet-to-resnet/)BuildPython
04[Image Classification](phases/04-computer-vision/04-image-classification/)BuildPython
05[Transfer Learning & Fine-Tuning](phases/04-computer-vision/05-transfer-learning/)BuildPython
06[Object Detection — YOLO from Scratch](phases/04-computer-vision/06-object-detection-yolo/)BuildPython
07[Semantic Segmentation — U-Net](phases/04-computer-vision/07-semantic-segmentation-unet/)BuildPython
08[Instance Segmentation — Mask R-CNN](phases/04-computer-vision/08-instance-segmentation-mask-rcnn/)BuildPython
09[Image Generation — GANs](phases/04-computer-vision/09-image-generation-gans/)BuildPython
10[Image Generation — Diffusion Models](phases/04-computer-vision/10-image-generation-diffusion/)BuildPython
11[Stable Diffusion — Architecture & Fine-Tuning](phases/04-computer-vision/11-stable-diffusion/)BuildPython
12[Video Understanding — Temporal Modeling](phases/04-computer-vision/12-video-understanding/)BuildPython
13[3D Vision: Point Clouds, NeRFs](phases/04-computer-vision/13-3d-vision-nerf/)BuildPython
14[Vision Transformers (ViT)](phases/04-computer-vision/14-vision-transformers/)BuildPython
15[Real-Time Vision: Edge Deployment](phases/04-computer-vision/15-real-time-edge/)BuildPython
16[Build a Complete Vision Pipeline](phases/04-computer-vision/16-vision-pipeline-capstone/)BuildPython
17[Self-Supervised Vision — SimCLR, DINO, MAE](phases/04-computer-vision/17-self-supervised-vision/)BuildPython
18[Open-Vocabulary Vision — CLIP](phases/04-computer-vision/18-open-vocab-clip/)BuildPython
19[OCR & Document Understanding](phases/04-computer-vision/19-ocr-document-understanding/)BuildPython
20[Image Retrieval & Metric Learning](phases/04-computer-vision/20-image-retrieval-metric/)BuildPython
21[Keypoint Detection & Pose Estimation](phases/04-computer-vision/21-keypoint-pose/)BuildPython
22[3D Gaussian Splatting from Scratch](phases/04-computer-vision/22-3d-gaussian-splatting/)BuildPython
23[Diffusion Transformers & Rectified Flow](phases/04-computer-vision/23-diffusion-transformers-rectified-flow/)BuildPython
24[SAM 3 & Open-Vocabulary Segmentation](phases/04-computer-vision/24-sam3-open-vocab-segmentation/)BuildPython
25[Vision-Language Models (ViT-MLP-LLM)](phases/04-computer-vision/25-vision-language-models/)BuildPython
26[Monocular Depth & Geometry Estimation](phases/04-computer-vision/26-monocular-depth/)BuildPython
27[Multi-Object Tracking & Video Memory](phases/04-computer-vision/27-multi-object-tracking/)BuildPython
28[World Models & Video Diffusion](phases/04-computer-vision/28-world-models-video-diffusion/)BuildPython

</details>

<details id="phase-5"> <summary><b>Phase 5 — NLP: Foundations to Advanced</b> &nbsp;<code>29 lessons</code>&nbsp; <em>Language is the interface to intelligence.</em></summary> <br/>

#LessonTypeLang
01[Text Processing: Tokenization, Stemming, Lemmatization](phases/05-nlp-foundations-to-advanced/01-text-processing/)BuildPython
02[Bag of Words, TF-IDF & Text Representation](phases/05-nlp-foundations-to-advanced/02-bag-of-words-tfidf/)BuildPython
03[Word Embeddings: Word2Vec from Scratch](phases/05-nlp-foundations-to-advanced/03-word-embeddings-word2vec/)BuildPython
04[GloVe, FastText & Subword Embeddings](phases/05-nlp-foundations-to-advanced/04-glove-fasttext-subword/)BuildPython
05[Sentiment Analysis](phases/05-nlp-foundations-to-advanced/05-sentiment-analysis/)BuildPython
06[Named Entity Recognition (NER)](phases/05-nlp-foundations-to-advanced/06-named-entity-recognition/)BuildPython
07[POS Tagging & Syntactic Parsing](phases/05-nlp-foundations-to-advanced/07-pos-tagging-parsing/)BuildPython
08[Text Classification — CNNs & RNNs for Text](phases/05-nlp-foundations-to-advanced/08-cnns-rnns-for-text/)BuildPython
09[Sequence-to-Sequence Models](phases/05-nlp-foundations-to-advanced/09-sequence-to-sequence/)BuildPython
10[Attention Mechanism — The Breakthrough](phases/05-nlp-foundations-to-advanced/10-attention-mechanism/)BuildPython
11[Machine Translation](phases/05-nlp-foundations-to-advanced/11-machine-translation/)BuildPython
12[Text Summarization](phases/05-nlp-foundations-to-advanced/12-text-summarization/)BuildPython
13[Question Answering Systems](phases/05-nlp-foundations-to-advanced/13-question-answering/)BuildPython
14[Information Retrieval & Search](phases/05-nlp-foundations-to-advanced/14-information-retrieval-search/)BuildPython
15[Topic Modeling: LDA, BERTopic](phases/05-nlp-foundations-to-advanced/15-topic-modeling/)BuildPython
16[Text Generation](phases/05-nlp-foundations-to-advanced/16-text-generation-pre-transformer/)BuildPython
17[Chatbots: Rule-Based to Neural](phases/05-nlp-foundations-to-advanced/17-chatbots-rule-to-neural/)BuildPython
18[Multilingual NLP](phases/05-nlp-foundations-to-advanced/18-multilingual-nlp/)BuildPython
19[Subword Tokenization: BPE, WordPiece, Unigram, SentencePiece](phases/05-nlp-foundations-to-advanced/19-subword-tokenization/)LearnPython
20[Structured Outputs & Constrained Decoding](phases/05-nlp-foundations-to-advanced/20-structured-outputs-constrained-decoding/)BuildPython
21[NLI & Textual Entailment](phases/05-nlp-foundations-to-advanced/21-nli-textual-entailment/)LearnPython
22[Embedding Models Deep Dive](phases/05-nlp-foundations-to-advanced/22-embedding-models-deep-dive/)LearnPython
23[Chunking Strategies for RAG](phases/05-nlp-foundations-to-advanced/23-chunking-strategies-rag/)BuildPython
24[Coreference Resolution](phases/05-nlp-foundations-to-advanced/24-coreference-resolution/)LearnPython
25[Entity Linking & Disambiguation](phases/05-nlp-foundations-to-advanced/25-entity-linking/)BuildPython
26[Relation Extraction & Knowledge Graph Construction](phases/05-nlp-foundations-to-advanced/26-relation-extraction-kg/)BuildPython
27[LLM Evaluation: RAGAS, DeepEval, G-Eval](phases/05-nlp-foundations-to-advanced/27-llm-evaluation-frameworks/)BuildPython
28[Long-Context Evaluation: NIAH, RULER, LongBench, MRCR](phases/05-nlp-foundations-to-advanced/28-long-context-evaluation/)LearnPython
29[Dialogue State Tracking](phases/05-nlp-foundations-to-advanced/29-dialogue-state-tracking/)BuildPython

</details>

<details id="phase-6"> <summary><b>Phase 6 — Speech & Audio</b> &nbsp;<code>17 lessons</code>&nbsp; <em>Hear, understand, speak.</em></summary> <br/>

#LessonTypeLang
01[Audio Fundamentals: Waveforms, Sampling, FFT](phases/06-speech-and-audio/01-audio-fundamentals)LearnPython
02[Spectrograms, Mel Scale & Audio Features](phases/06-speech-and-audio/02-spectrograms-mel-features)BuildPython
03[Audio Classification](phases/06-speech-and-audio/03-audio-classification)BuildPython
04[Speech Recognition (ASR)](phases/06-speech-and-audio/04-speech-recognition-asr)BuildPython
05[Whisper: Architecture & Fine-Tuning](phases/06-speech-and-audio/05-whisper-architecture-finetuning)BuildPython
06[Speaker Recognition & Verification](phases/06-speech-and-audio/06-speaker-recognition-verification)BuildPython
07[Text-to-Speech (TTS)](phases/06-speech-and-audio/07-text-to-speech)BuildPython
08[Voice Cloning & Voice Conversion](phases/06-speech-and-audio/08-voice-cloning-conversion)BuildPython
09[Music Generation](phases/06-speech-and-audio/09-music-generation)BuildPython
10[Audio-Language Models](phases/06-speech-and-audio/10-audio-language-models)BuildPython
11[Real-Time Audio Processing](phases/06-speech-and-audio/11-real-time-audio-processing)BuildPython
12[Build a Voice Assistant Pipeline](phases/06-speech-and-audio/12-voice-assistant-pipeline)BuildPython
13[Neural Audio Codecs — EnCodec, SNAC, Mimi, DAC](phases/06-speech-and-audio/13-neural-audio-codecs)LearnPython
14[Voice Activity Detection & Turn-Taking](phases/06-speech-and-audio/14-voice-activity-detection-turn-taking)BuildPython
15[Streaming Speech-to-Speech — Moshi, Hibiki](phases/06-speech-and-audio/15-streaming-speech-to-speech-moshi-hibiki)LearnPython
16[Voice Anti-Spoofing & Audio Watermarking](phases/06-speech-and-audio/16-anti-spoofing-audio-watermarking)BuildPython
17[Audio Evaluation — WER, MOS, MMAU, Leaderboards](phases/06-speech-and-audio/17-audio-evaluation-metrics)LearnPython

</details>

<details id="phase-7"> <summary><b>Phase 7 — Transformers Deep Dive</b> &nbsp;<code>14 lessons</code>&nbsp; <em>The architecture that changed everything.</em></summary> <br/>

#LessonTypeLang
01[Why Transformers: The Problems with RNNs](phases/07-transformers-deep-dive/01-why-transformers/)LearnPython
02[Self-Attention from Scratch](phases/07-transformers-deep-dive/02-self-attention-from-scratch/)BuildPython
03[Multi-Head Attention](phases/07-transformers-deep-dive/03-multi-head-attention/)BuildPython
04[Positional Encoding: Sinusoidal, RoPE, ALiBi](phases/07-transformers-deep-dive/04-positional-encoding/)BuildPython
05[The Full Transformer: Encoder + Decoder](phases/07-transformers-deep-dive/05-full-transformer/)BuildPython
06[BERT — Masked Language Modeling](phases/07-transformers-deep-dive/06-bert-masked-language-modeling/)BuildPython
07[GPT — Causal Language Modeling](phases/07-transformers-deep-dive/07-gpt-causal-language-modeling/)BuildPython
08[T5, BART — Encoder-Decoder Models](phases/07-transformers-deep-dive/08-t5-bart-encoder-decoder/)LearnPython
09[Vision Transformers (ViT)](phases/07-transformers-deep-dive/09-vision-transformers/)BuildPython
10[Audio Transformers — Whisper Architecture](phases/07-transformers-deep-dive/10-audio-transformers-whisper/)LearnPython
11[Mixture of Experts (MoE)](phases/07-transformers-deep-dive/11-mixture-of-experts/)BuildPython
12[KV Cache, Flash Attention & Inference Optimization](phases/07-transformers-deep-dive/12-kv-cache-flash-attention/)BuildPython
13[Scaling Laws](phases/07-transformers-deep-dive/13-scaling-laws/)LearnPython
14[Build a Transformer from Scratch](phases/07-transformers-deep-dive/14-build-a-transformer-capstone/)BuildPython
15[Attention Variants — Sliding Window, Sparse, Differential](phases/07-transformers-deep-dive/15-attention-variants/)BuildPython
16[Speculative Decoding — Draft, Verify, Repeat](phases/07-transformers-deep-dive/16-speculative-decoding/)BuildPython

</details>

<details id="phase-8"> <summary><b>Phase 8 — Generative AI</b> &nbsp;<code>14 lessons</code>&nbsp; <em>Create images, video, audio, 3D, and more.</em></summary> <br/>

#LessonTypeLang
01[Generative Models: Taxonomy & History](phases/08-generative-ai/01-generative-models-taxonomy-history/)LearnPython
02[Autoencoders & VAE](phases/08-generative-ai/02-autoencoders-vae/)BuildPython
03[GANs: Generator vs Discriminator](phases/08-generative-ai/03-gans-generator-discriminator/)BuildPython
04[Conditional GANs & Pix2Pix](phases/08-generative-ai/04-conditional-gans-pix2pix/)BuildPython
05[StyleGAN](phases/08-generative-ai/05-stylegan/)BuildPython
06[Diffusion Models — DDPM from Scratch](phases/08-generative-ai/06-diffusion-ddpm-from-scratch/)BuildPython
07[Latent Diffusion & Stable Diffusion](phases/08-generative-ai/07-latent-diffusion-stable-diffusion/)BuildPython
08[ControlNet, LoRA & Conditioning](phases/08-generative-ai/08-controlnet-lora-conditioning/)BuildPython
09[Inpainting, Outpainting & Editing](phases/08-generative-ai/09-inpainting-outpainting-editing/)BuildPython
10[Video Generation](phases/08-generative-ai/10-video-generation/)BuildPython
11[Audio Generation](phases/08-generative-ai/11-audio-generation/)BuildPython
12[3D Generation](phases/08-generative-ai/12-3d-generation/)BuildPython
13[Flow Matching & Rectified Flows](phases/08-generative-ai/13-flow-matching-rectified-flows/)BuildPython
14[Evaluation: FID, CLIP Score](phases/08-generative-ai/14-evaluation-fid-clip-score/)BuildPython
19[Visual Autoregressive Modeling (VAR): Next-Scale Prediction](phases/08-generative-ai/19-visual-autoregressive-var/)BuildPython

</details>

<details id="phase-9"> <summary><b>Phase 9 — Reinforcement Learning</b> &nbsp;<code>12 lessons</code>&nbsp; <em>The foundation of RLHF and game-playing AI.</em></summary> <br/>

#LessonTypeLang
01[MDPs, States, Actions & Rewards](phases/09-reinforcement-learning/01-mdps-states-actions-rewards/)LearnPython
02[Dynamic Programming](phases/09-reinforcement-learning/02-dynamic-programming/)BuildPython
03[Monte Carlo Methods](phases/09-reinforcement-learning/03-monte-carlo-methods/)BuildPython
04[Q-Learning, SARSA](phases/09-reinforcement-learning/04-q-learning-sarsa/)BuildPython
05[Deep Q-Networks (DQN)](phases/09-reinforcement-learning/05-dqn/)BuildPython
06[Policy Gradients — REINFORCE](phases/09-reinforcement-learning/06-policy-gradients-reinforce/)BuildPython
07[Actor-Critic — A2C, A3C](phases/09-reinforcement-learning/07-actor-critic-a2c-a3c/)BuildPython
08[PPO](phases/09-reinforcement-learning/08-ppo/)BuildPython
09[Reward Modeling & RLHF](phases/09-reinforcement-learning/09-reward-modeling-rlhf/)BuildPython
10[Multi-Agent RL](phases/09-reinforcement-learning/10-multi-agent-rl/)BuildPython
11[Sim-to-Real Transfer](phases/09-reinforcement-learning/11-sim-to-real-transfer/)BuildPython
12[RL for Games](phases/09-reinforcement-learning/12-rl-for-games/)BuildPython

</details>

<details id="phase-10"> <summary><b>Phase 10 — LLMs from Scratch</b> &nbsp;<code>22 lessons</code>&nbsp; <em>Build, train, and understand large language models.</em></summary> <br/>

#LessonTypeLang
01[Tokenizers: BPE, WordPiece, SentencePiece](phases/10-llms-from-scratch/01-tokenizers/)BuildPython, Rust
02[Building a Tokenizer from Scratch](phases/10-llms-from-scratch/02-building-a-tokenizer/)BuildPython
03[Data Pipelines for Pre-Training](phases/10-llms-from-scratch/03-data-pipelines/)BuildPython
04[Pre-Training a Mini GPT (124M)](phases/10-llms-from-scratch/04-pre-training-mini-gpt/)BuildPython
05[Distributed Training, FSDP, DeepSpeed](phases/10-llms-from-scratch/05-scaling-distributed/)BuildPython
06[Instruction Tuning — SFT](phases/10-llms-from-scratch/06-instruction-tuning-sft/)BuildPython
07[RLHF — Reward Model + PPO](phases/10-llms-from-scratch/07-rlhf/)BuildPython
08[DPO — Direct Preference Optimization](phases/10-llms-from-scratch/08-dpo/)BuildPython
09[Constitutional AI & Self-Improvement](phases/10-llms-from-scratch/09-constitutional-ai-self-improvement/)BuildPython
10[Evaluation — Benchmarks, Evals](phases/10-llms-from-scratch/10-evaluation/)BuildPython
11[Quantization: INT8, GPTQ, AWQ, GGUF](phases/10-llms-from-scratch/11-quantization/)BuildPython
12[Inference Optimization](phases/10-llms-from-scratch/12-inference-optimization/)BuildPython
13[Building a Complete LLM Pipeline](phases/10-llms-from-scratch/13-building-complete-llm-pipeline/)BuildPython
14[Open Models: Architecture Walkthroughs](phases/10-llms-from-scratch/14-open-models-architecture-walkthroughs/)LearnPython
15[Speculative Decoding and EAGLE-3](phases/10-llms-from-scratch/15-speculative-decoding-eagle3/)BuildPython
16[Differential Attention (V2)](phases/10-llms-from-scratch/16-differential-attention-v2/)BuildPython
17[Native Sparse Attention (DeepSeek NSA)](phases/10-llms-from-scratch/17-native-sparse-attention/)BuildPython
18[Multi-Token Prediction (MTP)](phases/10-llms-from-scratch/18-multi-token-prediction/)BuildPython
19[DualPipe Parallelism](phases/10-llms-from-scratch/19-dualpipe-parallelism/)LearnPython
20[DeepSeek-V3 Architecture Walkthrough](phases/10-llms-from-scratch/20-deepseek-v3-walkthrough/)LearnPython
21[Jamba — Hybrid SSM-Transformer](phases/10-llms-from-scratch/21-jamba-hybrid-ssm-transformer/)LearnPython
22[Async and Hogwild! Inference](phases/10-llms-from-scratch/22-async-hogwild-inference/)BuildPython
25[Speculative Decoding and EAGLE](phases/10-llms-from-scratch/25-speculative-decoding/)BuildPython
34[Gradient Checkpointing and Activation Recomputation](phases/10-llms-from-scratch/34-gradient-checkpointing/)BuildPython

</details>

<details id="phase-11"> <summary><b>Phase 11 — LLM Engineering</b> &nbsp;<code>17 lessons</code>&nbsp; <em>Put LLMs to work in production.</em></summary> <br/>

#LessonTypeLang
01[Prompt Engineering: Techniques & Patterns](phases/11-llm-engineering/01-prompt-engineering/)BuildPython
02[Few-Shot, CoT, Tree-of-Thought](phases/11-llm-engineering/02-few-shot-cot/)BuildPython
03[Structured Outputs](phases/11-llm-engineering/03-structured-outputs/)BuildPython
04[Embeddings & Vector Representations](phases/11-llm-engineering/04-embeddings/)BuildPython
05[Context Engineering](phases/11-llm-engineering/05-context-engineering/)BuildPython
06[RAG: Retrieval-Augmented Generation](phases/11-llm-engineering/06-rag/)BuildPython
07[Advanced RAG: Chunking, Reranking](phases/11-llm-engineering/07-advanced-rag/)BuildPython
08[Fine-Tuning with LoRA & QLoRA](phases/11-llm-engineering/08-fine-tuning-lora/)BuildPython
09[Function Calling & Tool Use](phases/11-llm-engineering/09-function-calling/)BuildPython
10[Evaluation & Testing](phases/11-llm-engineering/10-evaluation/)BuildPython
11[Caching, Rate Limiting & Cost](phases/11-llm-engineering/11-caching-cost/)BuildPython
12[Guardrails & Safety](phases/11-llm-engineering/12-guardrails/)BuildPython
13[Building a Production LLM App](phases/11-llm-engineering/13-production-app/)BuildPython
14[Model Context Protocol (MCP)](phases/11-llm-engineering/14-model-context-protocol/)BuildPython
15[Prompt Caching & Context Caching](phases/11-llm-engineering/15-prompt-caching/)BuildPython
16[LangGraph: State Machines for Agents](phases/11-llm-engineering/16-langgraph-state-machines/)BuildPython
17[Agent Framework Tradeoffs](phases/11-llm-engineering/17-agent-framework-tradeoffs/)LearnPython

</details>

<details id="phase-12"> <summary><b>Phase 12 — Multimodal AI</b> &nbsp;<code>25 lessons</code>&nbsp; <em>See, hear, read, and reason across modalities — from ViT patches to computer-use agents.</em></summary> <br/>

#LessonTypeLang
01[Vision Transformers and the Patch-Token Primitive](phases/12-multimodal-ai/01-vision-transformer-patch-tokens/)LearnPython
02[CLIP and Contrastive Vision-Language Pretraining](phases/12-multimodal-ai/02-clip-contrastive-pretraining/)BuildPython
03[BLIP-2 Q-Former as Modality Bridge](phases/12-multimodal-ai/03-blip2-qformer-bridge/)BuildPython
04[Flamingo and Gated Cross-Attention](phases/12-multimodal-ai/04-flamingo-gated-cross-attention/)LearnPython
05[LLaVA and Visual Instruction Tuning](phases/12-multimodal-ai/05-llava-visual-instruction-tuning/)BuildPython
06[Any-Resolution Vision — Patch-n'-Pack and NaFlex](phases/12-multimodal-ai/06-any-resolution-patch-n-pack/)BuildPython
07[Open-Weight VLM Recipes: What Actually Matters](phases/12-multimodal-ai/07-open-weight-vlm-recipes/)LearnPython
08[LLaVA-OneVision: Single, Multi, Video](phases/12-multimodal-ai/08-llava-onevision-single-multi-video/)BuildPython
09[Qwen-VL Family and Dynamic-FPS Video](phases/12-multimodal-ai/09-qwen-vl-family-dynamic-fps/)LearnPython
10[InternVL3 Native Multimodal Pretraining](phases/12-multimodal-ai/10-internvl3-native-multimodal/)LearnPython
11[Chameleon Early-Fusion Token-Only](phases/12-multimodal-ai/11-chameleon-early-fusion-tokens/)BuildPython
12[Emu3 Next-Token Prediction for Generation](phases/12-multimodal-ai/12-emu3-next-token-for-generation/)LearnPython
13[Transfusion Autoregressive + Diffusion](phases/12-multimodal-ai/13-transfusion-autoregressive-diffusion/)BuildPython
14[Show-o Discrete-Diffusion Unified](phases/12-multimodal-ai/14-show-o-discrete-diffusion-unified/)LearnPython
15[Janus-Pro Decoupled Encoders](phases/12-multimodal-ai/15-janus-pro-decoupled-encoders/)BuildPython
16[MIO Any-to-Any Streaming](phases/12-multimodal-ai/16-mio-any-to-any-streaming/)LearnPython
17[Video-Language Temporal Grounding](phases/12-multimodal-ai/17-video-language-temporal-grounding/)BuildPython
18[Long-Video at Million-Token Context](phases/12-multimodal-ai/18-long-video-million-token/)BuildPython
19[Audio-Language Models: Whisper to AF3](phases/12-multimodal-ai/19-audio-language-whisper-to-af3/)BuildPython
20[Omni Models: Thinker-Talker Streaming](phases/12-multimodal-ai/20-omni-models-thinker-talker/)BuildPython
21[Embodied VLAs: RT-2, OpenVLA, π0, GR00T](phases/12-multimodal-ai/21-embodied-vlas-openvla-pi0-groot/)LearnPython
22[Document and Diagram Understanding](phases/12-multimodal-ai/22-document-diagram-understanding/)BuildPython
23[ColPali Vision-Native Document RAG](phases/12-multimodal-ai/23-colpali-vision-native-rag/)BuildPython
24[Multimodal RAG and Cross-Modal Retrieval](phases/12-multimodal-ai/24-multimodal-rag-cross-modal/)BuildPython
25[Multimodal Agents and Computer-Use (Capstone)](phases/12-multimodal-ai/25-multimodal-agents-computer-use/)BuildPython

</details>

<details id="phase-13"> <summary><b>Phase 13 — Tools & Protocols</b> &nbsp;<code>23 lessons</code>&nbsp; <em>The interfaces between AI and the real world.</em></summary> <br/>

#LessonTypeLang
01[The Tool Interface](phases/13-tools-and-protocols/01-the-tool-interface/)LearnPython
02[Function Calling Deep Dive](phases/13-tools-and-protocols/02-function-calling-deep-dive/)BuildPython
03[Parallel and Streaming Tool Calls](phases/13-tools-and-protocols/03-parallel-and-streaming-tool-calls/)BuildPython
04[Structured Output](phases/13-tools-and-protocols/04-structured-output/)BuildPython
05[Tool Schema Design](phases/13-tools-and-protocols/05-tool-schema-design/)LearnPython
06[MCP Fundamentals](phases/13-tools-and-protocols/06-mcp-fundamentals/)LearnPython
07[Building an MCP Server](phases/13-tools-and-protocols/07-building-an-mcp-server/)BuildPython
08[Building an MCP Client](phases/13-tools-and-protocols/08-building-an-mcp-client/)BuildPython
09[MCP Transports](phases/13-tools-and-protocols/09-mcp-transports/)LearnPython
10[MCP Resources and Prompts](phases/13-tools-and-protocols/10-mcp-resources-and-prompts/)BuildPython
11[MCP Sampling](phases/13-tools-and-protocols/11-mcp-sampling/)BuildPython
12[MCP Roots and Elicitation](phases/13-tools-and-protocols/12-mcp-roots-and-elicitation/)BuildPython
13[MCP Async Tasks](phases/13-tools-and-protocols/13-mcp-async-tasks/)BuildPython
14[MCP Apps](phases/13-tools-and-protocols/14-mcp-apps/)BuildPython
15[MCP Security I — Tool Poisoning](phases/13-tools-and-protocols/15-mcp-security-tool-poisoning/)LearnPython
16[MCP Security II — OAuth 2.1](phases/13-tools-and-protocols/16-mcp-security-oauth-2-1/)BuildPython
17[MCP Gateways and Registries](phases/13-tools-and-protocols/17-mcp-gateways-and-registries/)LearnPython
18[MCP Auth in Production — Enrollment, JWKS Refresh, Audience Pinning](phases/13-tools-and-protocols/18-mcp-auth-production/)BuildPython
19[A2A Protocol](phases/13-tools-and-protocols/19-a2a-protocol/)BuildPython
20[OpenTelemetry GenAI](phases/13-tools-and-protocols/20-opentelemetry-genai/)BuildPython
21[LLM Routing Layer](phases/13-tools-and-protocols/21-llm-routing-layer/)LearnPython
22[Skills and Agent SDKs](phases/13-tools-and-protocols/22-skills-and-agent-sdks/)LearnPython
23[Capstone — Tool Ecosystem](phases/13-tools-and-protocols/23-capstone-tool-ecosystem/)BuildPython

</details>

<details id="phase-14"> <summary><b>Phase 14 — Agent Engineering</b> &nbsp;<code>42 lessons</code>&nbsp; <em>Build agents from first principles — loop, memory, planning, frameworks, benchmarks, production, workbench.</em></summary> <br/>

#LessonTypeLang
01[The Agent Loop](phases/14-agent-engineering/01-the-agent-loop/)BuildPython
02[ReWOO and Plan-and-Execute](phases/14-agent-engineering/02-rewoo-plan-and-execute/)BuildPython
03[Reflexion and Verbal Reinforcement Learning](phases/14-agent-engineering/03-reflexion-verbal-rl/)BuildPython
04[Tree of Thoughts and LATS](phases/14-agent-engineering/04-tree-of-thoughts-lats/)BuildPython
05[Self-Refine and CRITIC](phases/14-agent-engineering/05-self-refine-and-critic/)BuildPython
06[Tool Use and Function Calling](phases/14-agent-engineering/06-tool-use-and-function-calling/)BuildPython
07[Memory — Virtual Context and MemGPT](phases/14-agent-engineering/07-memory-virtual-context-memgpt/)BuildPython
08[Memory Blocks and Sleep-Time Compute](phases/14-agent-engineering/08-memory-blocks-sleep-time-compute/)BuildPython
09[Hybrid Memory — Mem0 Vector + Graph + KV](phases/14-agent-engineering/09-hybrid-memory-mem0/)BuildPython
10[Skill Libraries and Lifelong Learning — Voyager](phases/14-agent-engineering/10-skill-libraries-voyager/)BuildPython
11[Planning with HTN and Evolutionary Search](phases/14-agent-engineering/11-planning-htn-and-evolutionary/)BuildPython
12[Anthropic's Workflow Patterns](phases/14-agent-engineering/12-anthropic-workflow-patterns/)BuildPython
13[LangGraph — Stateful Graphs and Durable Execution](phases/14-agent-engineering/13-langgraph-stateful-graphs/)BuildPython
14[AutoGen v0.4 — Actor Model](phases/14-agent-engineering/14-autogen-actor-model/)BuildPython
15[CrewAI — Role-Based Crews and Flows](phases/14-agent-engineering/15-crewai-role-based-crews/)BuildPython
16[OpenAI Agents SDK — Handoffs, Guardrails, Tracing](phases/14-agent-engineering/16-openai-agents-sdk/)BuildPython
17[Claude Agent SDK — Subagents and Session Store](phases/14-agent-engineering/17-claude-agent-sdk/)BuildPython
18[Agno and Mastra — Production Runtimes](phases/14-agent-engineering/18-agno-and-mastra-runtimes/)LearnPython
19[Benchmarks — SWE-bench, GAIA, AgentBench](phases/14-agent-engineering/19-benchmarks-swebench-gaia/)LearnPython
20[Benchmarks — WebArena and OSWorld](phases/14-agent-engineering/20-benchmarks-webarena-osworld/)LearnPython
21[Computer Use — Claude, OpenAI CUA, Gemini](phases/14-agent-engineering/21-computer-use-agents/)BuildPython
22[Voice Agents — Pipecat and LiveKit](phases/14-agent-engineering/22-voice-agents-pipecat-livekit/)BuildPython
23[OpenTelemetry GenAI Semantic Conventions](phases/14-agent-engineering/23-otel-genai-conventions/)BuildPython
24[Agent Observability — Langfuse, Phoenix, Opik](phases/14-agent-engineering/24-agent-observability-platforms/)LearnPython
25[Multi-Agent Debate and Collaboration](phases/14-agent-engineering/25-multi-agent-debate/)BuildPython
26[Failure Modes — Why Agents Break](phases/14-agent-engineering/26-failure-modes-agentic/)BuildPython
27[Prompt Injection and the PVE Defense](phases/14-agent-engineering/27-prompt-injection-defense/)BuildPython
28[Orchestration Patterns — Supervisor, Swarm, Hierarchical](phases/14-agent-engineering/28-orchestration-patterns/)BuildPython
29[Production Runtimes — Queue, Event, Cron](phases/14-agent-engineering/29-production-runtimes/)LearnPython
30[Eval-Driven Agent Development](phases/14-agent-engineering/30-eval-driven-agent-development/)BuildPython
31[Agent Workbench: Why Capable Models Still Fail](phases/14-agent-engineering/31-agent-workbench-why-models-fail/)LearnPython
32[The Minimal Agent Workbench](phases/14-agent-engineering/32-minimal-agent-workbench/)BuildPython
33[Agent Instructions as Executable Constraints](phases/14-agent-engineering/33-instructions-as-executable-constraints/)BuildPython
34[Repo Memory and Durable State](phases/14-agent-engineering/34-repo-memory-and-state/)BuildPython
35[Initialization Scripts for Agents](phases/14-agent-engineering/35-initialization-scripts/)BuildPython
36[Scope Contracts and Task Boundaries](phases/14-agent-engineering/36-scope-contracts/)BuildPython
37[Runtime Feedback Loops](phases/14-agent-engineering/37-runtime-feedback-loops/)BuildPython
38[Verification Gates](phases/14-agent-engineering/38-verification-gates/)BuildPython
39[Reviewer Agent: Separate Builder from Marker](phases/14-agent-engineering/39-reviewer-agent/)BuildPython
40[Multi-Session Handoff](phases/14-agent-engineering/40-multi-session-handoff/)BuildPython
41[The Workbench on a Real Repo](phases/14-agent-engineering/41-workbench-for-real-repos/)BuildPython
42[Capstone: Ship a Reusable Agent Workbench Pack](phases/14-agent-engineering/42-agent-workbench-capstone/)BuildPython

Each Phase 14 workbench lesson (31-42) ships a mission.md briefing the agent before it opens the full lesson docs.

</details>

<details id="phase-15"> <summary><b>Phase 15 — Autonomous Systems</b> &nbsp;<code>22 lessons</code>&nbsp; <em>Long-horizon agents, self-improvement, and the 2026 safety stack.</em></summary> <br/>

#LessonTypeLang
01[From Chatbots to Long-Horizon Agents (METR)](phases/15-autonomous-systems/01-long-horizon-agents/)LearnPython
02[STaR, V-STaR, Quiet-STaR: Self-Taught Reasoning](phases/15-autonomous-systems/02-star-family-reasoning/)LearnPython
03[AlphaEvolve: Evolutionary Coding Agents](phases/15-autonomous-systems/03-alphaevolve-evolutionary-coding/)LearnPython
04[Darwin Gödel Machine: Self-Modifying Agents](phases/15-autonomous-systems/04-darwin-godel-machine/)LearnPython
05[AI Scientist v2: Workshop-Level Research](phases/15-autonomous-systems/05-ai-scientist-v2/)LearnPython
06[Automated Alignment Research (Anthropic AAR)](phases/15-autonomous-systems/06-automated-alignment-research/)LearnPython
07[Recursive Self-Improvement: Capability vs Alignment](phases/15-autonomous-systems/07-recursive-self-improvement/)LearnPython
08[Bounded Self-Improvement Designs](phases/15-autonomous-systems/08-bounded-self-improvement/)LearnPython
09[Autonomous Coding Agent Landscape (SWE-bench, CodeAct)](phases/15-autonomous-systems/09-coding-agent-landscape/)LearnPython
10[Claude Code Permission Modes and Auto Mode](phases/15-autonomous-systems/10-claude-code-permission-modes/)LearnPython
11[Browser Agents and Indirect Prompt Injection](phases/15-autonomous-systems/11-browser-agents/)LearnPython
12[Durable Execution for Long-Running Agents](phases/15-autonomous-systems/12-durable-execution/)LearnPython
13[Action Budgets, Iteration Caps, Cost Governors](phases/15-autonomous-systems/13-cost-governors/)LearnPython
14[Kill Switches, Circuit Breakers, Canary Tokens](phases/15-autonomous-systems/14-kill-switches-canaries/)LearnPython
15[HITL: Propose-Then-Commit](phases/15-autonomous-systems/15-propose-then-commit/)LearnPython
16[Checkpoints and Rollback](phases/15-autonomous-systems/16-checkpoints-rollback/)LearnPython
17[Constitutional AI and Rule Overrides](phases/15-autonomous-systems/17-constitutional-ai/)LearnPython
18[Llama Guard and Input/Output Classification](phases/15-autonomous-systems/18-llama-guard/)LearnPython
19[Anthropic Responsible Scaling Policy v3.0](phases/15-autonomous-systems/19-anthropic-rsp/)LearnPython
20[OpenAI Preparedness Framework and DeepMind FSF](phases/15-autonomous-systems/20-openai-preparedness-deepmind-fsf/)LearnPython
21[METR Time Horizons and External Evaluation](phases/15-autonomous-systems/21-metr-external-evaluation/)LearnPython
22[CAIS, CAISI, and Societal-Scale Risk](phases/15-autonomous-systems/22-cais-caisi-societal-risk/)LearnPython

</details>

<details id="phase-16"> <summary><b>Phase 16 — Multi-Agent & Swarms</b> &nbsp;<code>25 lessons</code>&nbsp; <em>Coordination, emergence, and collective intelligence.</em></summary> <br/>

#LessonTypeLang
01[Why Multi-Agent](phases/16-multi-agent-and-swarms/01-why-multi-agent/)LearnTypeScript
02[FIPA-ACL Heritage and Speech Acts](phases/16-multi-agent-and-swarms/02-fipa-acl-heritage/)LearnPython
03[Communication Protocols](phases/16-multi-agent-and-swarms/03-communication-protocols/)BuildTypeScript
04[The Multi-Agent Primitive Model](phases/16-multi-agent-and-swarms/04-primitive-model/)LearnPython
05[Supervisor / Orchestrator-Worker Pattern](phases/16-multi-agent-and-swarms/05-supervisor-orchestrator-pattern/)BuildPython
06[Hierarchical Architecture and Decomposition Drift](phases/16-multi-agent-and-swarms/06-hierarchical-architecture/)LearnPython
07[Society of Mind and Multi-Agent Debate](phases/16-multi-agent-and-swarms/07-society-of-mind-debate/)BuildPython
08[Role Specialization — Planner / Critic / Executor / Verifier](phases/16-multi-agent-and-swarms/08-role-specialization/)BuildPython
09[Parallel Swarm and Networked Architectures](phases/16-multi-agent-and-swarms/09-parallel-swarm-networks/)BuildPython
10[Group Chat and Speaker Selection](phases/16-multi-agent-and-swarms/10-group-chat-speaker-selection/)BuildPython
11[Handoffs and Routines (Stateless Orchestration)](phases/16-multi-agent-and-swarms/11-handoffs-and-routines/)BuildPython
12[A2A — The Agent-to-Agent Protocol](phases/16-multi-agent-and-swarms/12-a2a-protocol/)BuildPython
13[Shared Memory and Blackboard Patterns](phases/16-multi-agent-and-swarms/13-shared-memory-blackboard/)BuildPython
14[Consensus and Byzantine Fault Tolerance](phases/16-multi-agent-and-swarms/14-consensus-and-bft/)BuildPython
15[Voting, Self-Consistency, and Debate Topology](phases/16-multi-agent-and-swarms/15-voting-debate-topology/)BuildPython
16[Negotiation and Bargaining](phases/16-multi-agent-and-swarms/16-negotiation-bargaining/)BuildPython
17[Generative Agents and Emergent Simulation](phases/16-multi-agent-and-swarms/17-generative-agents-simulation/)BuildPython
18[Theory of Mind and Emergent Coordination](phases/16-multi-agent-and-swarms/18-theory-of-mind-coordination/)BuildPython
19[Swarm Optimization (PSO, ACO)](phases/16-multi-agent-and-swarms/19-swarm-optimization-pso-aco/)BuildPython
20[MARL — MADDPG, QMIX, MAPPO](phases/16-multi-agent-and-swarms/20-marl-maddpg-qmix-mappo/)LearnPython
21[Agent Economies, Token Incentives, Reputation](phases/16-multi-agent-and-swarms/21-agent-economies/)LearnPython
22[Production Scaling — Queues, Checkpoints, Durability](phases/16-multi-agent-and-swarms/22-production-scaling-queues-checkpoints/)BuildPython
23[Failure Modes — MAST, Groupthink, Monoculture](phases/16-multi-agent-and-swarms/23-failure-modes-mast-groupthink/)LearnPython
24[Evaluation and Coordination Benchmarks](phases/16-multi-agent-and-swarms/24-evaluation-coordination-benchmarks/)LearnPython
25[Case Studies and 2026 State of the Art](phases/16-multi-agent-and-swarms/25-case-studies-2026-sota/)LearnPython

</details>

<details id="phase-17"> <summary><b>Phase 17 — Infrastructure & Production</b> &nbsp;<code>28 lessons</code>&nbsp; <em>Ship AI to the real world.</em></summary> <br/>

#LessonTypeLang
01[Managed LLM Platforms — Bedrock, Azure OpenAI, Vertex AI](phases/17-infrastructure-and-production/01-managed-llm-platforms/)LearnPython
02[Inference Platform Economics — Fireworks, Together, Baseten, Modal](phases/17-infrastructure-and-production/02-inference-platform-economics/)LearnPython
03[GPU Autoscaling on Kubernetes — Karpenter, KAI Scheduler](phases/17-infrastructure-and-production/03-gpu-autoscaling-kubernetes/)LearnPython
04[vLLM Serving Internals — PagedAttention, Continuous Batching, Chunked Prefill](phases/17-infrastructure-and-production/04-vllm-serving-internals/)LearnPython
05[EAGLE-3 Speculative Decoding in Production](phases/17-infrastructure-and-production/05-eagle3-speculative-decoding/)LearnPython
06[SGLang and RadixAttention for Prefix-Heavy Workloads](phases/17-infrastructure-and-production/06-sglang-radixattention/)LearnPython
07[TensorRT-LLM on Blackwell with FP8 and NVFP4](phases/17-infrastructure-and-production/07-tensorrt-llm-blackwell/)LearnPython
08[Inference Metrics — TTFT, TPOT, ITL, Goodput, P99](phases/17-infrastructure-and-production/08-inference-metrics-goodput/)LearnPython
09[Production Quantization — AWQ, GPTQ, GGUF, FP8, NVFP4](phases/17-infrastructure-and-production/09-production-quantization/)LearnPython
10[Cold Start Mitigation for Serverless LLMs](phases/17-infrastructure-and-production/10-cold-start-mitigation/)LearnPython
11[Multi-Region LLM Serving and KV Cache Locality](phases/17-infrastructure-and-production/11-multi-region-kv-locality/)LearnPython
12[Edge Inference — ANE, Hexagon, WebGPU, Jetson](phases/17-infrastructure-and-production/12-edge-inference/)LearnPython
13[LLM Observability Stack Selection](phases/17-infrastructure-and-production/13-llm-observability/)LearnPython
14[Prompt Caching and Semantic Caching Economics](phases/17-infrastructure-and-production/14-prompt-semantic-caching/)LearnPython
15[Batch APIs — the 50% Discount as Industry Standard](phases/17-infrastructure-and-production/15-batch-apis/)LearnPython
16[Model Routing as a Cost-Reduction Primitive](phases/17-infrastructure-and-production/16-model-routing/)LearnPython
17[Disaggregated Prefill/Decode — NVIDIA Dynamo and llm-d](phases/17-infrastructure-and-production/17-disaggregated-prefill-decode/)LearnPython
18[vLLM Production Stack with LMCache KV Offloading](phases/17-infrastructure-and-production/18-vllm-production-stack-lmcache/)LearnPython
19[AI Gateways — LiteLLM, Portkey, Kong, Bifrost](phases/17-infrastructure-and-production/19-ai-gateways/)LearnPython
20[Shadow, Canary, and Progressive Deployment](phases/17-infrastructure-and-production/20-shadow-canary-progressive/)LearnPython
21[A/B Testing LLM Features — GrowthBook and Statsig](phases/17-infrastructure-and-production/21-ab-testing-llm-features/)LearnPython
| 22 | [Load Testing LLM APIs — k6, LLMPer

FIG_002 · A worked sample

Phase 14, lesson 1: the agent loop. ~120 lines of pure Python, no dependencies.

code/agent_loop.py &nbsp; <sub><i>build it</i></sub>

def run(query, tools):
    history = [user(query)]
    for step in range(MAX_STEPS):
        msg = llm(history)
        if msg.tool_calls:
            for call in msg.tool_calls:
                result = tools[call.name](**call.args)
                history.append(tool_result(call.id, result))
            continue
        return msg.content
    raise StepLimitExceeded

</td> <td valign="top" width="50%">

outputs/skill-agent-loop.md &nbsp; <sub><i>ship it</i></sub>

🎯 aiskill88 AI 点评 A 级 2026-05-21

高质量AI工程学习资���,整合MCP、智能体、视觉等核心能力,生态完整维护活跃,是AI开发者必备工具库。

📚 实用指南(长尾问题)
适合谁
  • 需要让 Claude / Cursor 操作本地工具的 AI 工程师
  • 构建多智能体协作系统的 Agent 开发者
  • 构建企业知识库 / RAG 检索应用的团队
  • 需要从图片、PDF 提取文字的文档自动化场景
  • 跨境业务、多语言内容运营团队
  • 做语音类 AI 产品的开发者
最佳实践
  • 配置 MCP 服务器时建议使用 stdio 传输 + JSON-RPC,避免暴露公网
  • 生产部署优先使用 Docker Compose 隔离依赖,并挂载 volume 持久化数据
  • 本地部署优先选 GGUF 量化模型,节省显存并保持响应速度
  • 分块大小建议 256-512 tokens,向量库优选 pgvector 或 Qdrant
  • Agent 任务先做 dry-run 验证工具调用链,再开启自主执行
常见错误
  • API key 直接提交到 git 仓库(请用 .env 并加入 .gitignore)
  • MCP 配置路径拼错或权限不足,重启 Claude Desktop 才生效
  • 容器内无法访问宿主机 localhost — 使用 host.docker.internal
  • embedding 模型与查询模型不一致导致检索失效
  • 显存不足直接 OOM — 优先降低 context 或换更小的量化模型
  • Python 依赖冲突:建议用 venv / uv 隔离环境
部署方案
  • Docker:ai-engineering-from-scratch 提供官方镜像,docker compose up 一键启动
  • CLI:直接 npm install -g / pip install,命令行调用
  • 本地部署:CPU 8GB 起,GPU 推荐 16GB+ 显存
  • 云端托管:可放在 Vercel / Railway / Fly.io 等 PaaS 平台

⚡ 核心功能

  • 开源免费,支持本地部署,数据完全自主可控
  • 活跃的 GitHub 开源社区,持续迭代更新
  • 提供详细文档和使用示例,新手友好
  • 支持自定义配置,灵活适配不同使用环境
  • 可作为基础组件集成进现有技术栈或进行二次开发
👥 适合谁
  • 需要让 Claude / Cursor 操作本地工具的 AI 工程师
  • 构建多智能体协作系统的 Agent 开发者
  • 构建企业知识库 / RAG 检索应用的团队
  • 需要从图片、PDF 提取文字的文档自动化场景
⭐ 最佳实践
  • 配置 MCP 服务器时建议使用 stdio 传输 + JSON-RPC,避免暴露公网
  • 生产部署优先使用 Docker Compose 隔离依赖,并挂载 volume 持久化数据
  • 本地部署优先选 GGUF 量化模型,节省显存并保持响应速度
  • 分块大小建议 256-512 tokens,向量库优选 pgvector 或 Qdrant
⚠️ 常见错误
  • API key 直接提交到 git 仓库(请用 .env 并加入 .gitignore)
  • MCP 配置路径拼错或权限不足,重启 Claude Desktop 才生效
  • 容器内无法访问宿主机 localhost — 使用 host.docker.internal
  • embedding 模型与查询模型不一致导致检索失效

👥 适合人群

AI 技术爱好者研究人员和学生开发者和工程师技术创业者

🎯 使用场景

  • 本地部署运行,保护数据隐私,满足合规要求
  • 自定义集成到现有系统,扩展技术栈能力
  • 作为开源基础组件进行商业化二次开发

⚖️ 优点与不足

✅ 优点
  • +GitHub 9.1k Star,社区高度认可
  • +MIT 协议,可免费商用
  • +完全开源免费,无授权费用
  • +本地部署,数据完全自主可控
  • +开发者社区支持,遇问题可查可问
⚠️ 不足
  • 安装和初始配置可能需要一定技术基础
  • 功能完整性通常不如成熟商业产品
  • 技术支持主要依赖开源社区,响应速度不稳定
⚠️ 使用须知

AI Skill Hub 为第三方内容聚合平台,本页面信息基于公开数据整理,不对工具功能和质量作任何法律背书。

建议在沙箱或测试环境中充分验证后,再部署至生产环境,并做好必要的安全评估。

📄 License 说明

✅ MIT 协议 — 最宽松的开源协议之一,可自由商用、修改、分发,仅需保留版权声明。

❓ 常见问题 FAQ

ai-engineering-from-scratch 是一款Python开发的AI辅助工具。开源MCP工具:Learn it. Build it. Ship it for others.。⭐9.1k · Python 主要应用场景包括:AI智能体开发学习、MCP工具集成开发、AI工程实战教学。
💡 AI Skill Hub 点评

AI Skill Hub 点评:AI工程从零开始 的核心功能完整,质量优秀。对于AI爱好者来说,这是一个值得纳入个人工具库的选择。建议先在非生产环境试用,再逐步推广。

📚 深入学习 AI工程从零开始
查看分步骤安装教程和完整使用指南,快速上手这款工具
🌐 原始信息
原始名称 ai-engineering-from-scratch
原始描述 开源MCP工具:Learn it. Build it. Ship it for others.。⭐9.1k · Python
Topics MCP协议AI智能体AI工程计算机视觉开源学习
GitHub https://github.com/rohitg00/ai-engineering-from-scratch
License MIT
语言 Python
🔗 原始来源
🐙 GitHub 仓库  https://github.com/rohitg00/ai-engineering-from-scratch 🌐 官方网站  https://aiengineeringfromscratch.com

收录时间:2026-05-20 · 更新时间:2026-05-30 · License:MIT · AI Skill Hub 不对第三方内容的准确性作法律背书。