Schedule | CS 224G: Building and Scaling LLM Applications

Week 1

Introduction & LLM Landscape

Jan 6 - 8

TueJan 6

CS 224G Re-introduction

Why LLMs are exciting right now; where we are in 2026; what makes now unique; course logistics; introduction to responsible AI development

👨‍🏫 Jan & John Core Lecture + Brainstorming

📄 Lecture Notes

ThuJan 8

224G Project Ideas & Modern LLM Stack

Student project proposal pitches (2 min each); Modern LLM stack overview; ethical considerations in project selection

👨‍🏫 John (MC) Student Pitches + Guidance

📄 Lecture Notes

📅 Project mini-pitch due

Week 2

Reasoning Models & Building Principles

Jan 13 - 15

TueJan 13

LLMs, LMMs & Reasoning Models 2026

Reasoning models (OpenAI o1/o3, DeepSeek R1, Claude Sonnet 4); chain-of-thought prompting; when to use reasoning vs. standard models; LMM architectures; cost/latency tradeoffs

👨‍🏫 John Core Lecture + Hands-on API

📄 Lecture Notes

📅 Final Project Proposal due

ThuJan 15

AI Entrepreneurship & Building with GenAI

Four pillars of building software with LLMs: Iteration, Evaluations, Deployment, Observability; building a data flywheel; continuous systems

👨‍🏫 Arsh Shah Dilbagi + Jan & John Guest Lecture + Discussion

📄 Lecture Notes

📅 Final Project Team due

Week 3

Context Engineering & Data Strategy

Jan 20 - 22

ℹ️ No class Monday, January 19 - MLK Day

TueJan 20

Context Engineering & RAG

Context engineering as evolution of prompt engineering; the full context stack (system prompts, conversation history, tool definitions, parameters); RAG architecture; managing context windows; memory systems; token budget economy; advanced prompting techniques; prompt injection attacks and defenses

👨‍🏫 Jan Core Lecture + Exercises

📄 Lecture Notes

🏃 Sprint 1 Start

ThuJan 22

More Context Engineering

Architecting the full context stack for reliable production-grade AI systems; the compiled request model (what actually goes into an LLM call); call parameters and the context window (temperature, max tokens, logit bias, token budget economy); the three strata of instructions (model system prompt, product system prompt, personas); injected knowledge (memories, RAG, user-submitted data); tools, orchestrators, and agentic loops; the context engineering checklist of five controllable levers

👨‍🏫 John Core Lecture + Exercises

📄 Lecture Notes

Week 4

Scalability & Agentic Patterns

Jan 27 - 29

TueJan 27

Agentic Workflows & Design Patterns

Agent architectures (ReAct, multi-agent systems); orchestration frameworks (LangGraph, CrewAI); sequential, router, and collaborative patterns; tool use and function calling; state management; real-world agentic patterns; agent safety and containment strategies

👨‍🏫 Rakshit Agrawal, Tran Le & Sohit Gatiganti Guest Lecture + Examples

📄 Lecture Notes 📓 Colab Notebook

ThuJan 29

Sprint 1 Presentations

Groups present their progress on Sprint 1 (3 min per team)

👨‍🏫 John (MC) Student Demos + Feedback

🏁 Sprint 1 End

Week 5

Production Systems & Evaluation

Feb 3 - 5

TueFeb 3

Building AI-Native Companies

What "AI-native" actually means vs. AI-assisted; automated consulting and spec-driven development; evaluating agents without losing your mind (179 failures analyzed); the OpenClaw phenomenon and where the industry is heading; real-world examples from Vunda AI and heynoah.io

👨‍🏫 Ryan Brandt (Vunda AI) Guest Lecture + Case Studies

📄 Lecture Notes

🏃 Sprint 2 Start

ThuFeb 5

Building Good LLM Products with the Theory of Constraints

The challenge of non-determinism in LLMs; applying the Theory of Constraints to AI systems; modeling your system (system prompt, user prompt, context, tools); identifying and fixing bottlenecks; proper evals vs. vibe evals; the improvement loop; practical examples from production AI products

👨‍🏫 Andy Bromberg Guest Lecture

📄 Lecture Notes

Week 6

Code Generation & Advanced Agents

Feb 10 - 12

ℹ️ Note: Tuesday, February 10 class will be in room 200-002

TueFeb 10

Code Generation with LLMs

History of AI for code generation; benchmarking code gen (HumanEval, SWE-Bench, competitions, real-world impact); reasoning and decision-making techniques (CoT, Tree-of-Thoughts, ReAct, Reflexion, LATS); RL-driven reasoning in token space; using code generation wisely; coding agent setup (agents.md/CLAUDE.md); vibe engineering and shipping faster with coding agents; ephemeral software and environment engineering

👨‍🏫 Josh Payne Guest Lecture + Architectural Deep Dive

📄 Lecture Notes

ThuFeb 12

Sprint 2 Presentations

Groups present their progress on Sprint 2 (3 min per team)

👨‍🏫 John (MC) Student Demos + Technical Deep Dives

🏁 Sprint 2 End

Week 7

RAG, Memory & Orchestration

Feb 17 - 19

ℹ️ No class Monday, February 16 - Presidents' Day

⚡ Tuesday, February 17 class cancelled due to campus power outage

TueFeb 17

Class Cancelled

Cancelled due to campus power outage

🏃 Sprint 3 Start

ThuFeb 19

Agentic Orchestration with PydanticAI

How to build an (almost) production-ready agent with PydanticAI; lightweight vs. heavy orchestration frameworks (PydanticAI vs. LangGraph, CrewAI, OpenAI Agents SDK); real-world application: AI for manufacturing operations at Dryft; hands-on agent implementation

👨‍🏫 Leonie Freisinger (Dryft) Guest Lecture

📄 Lecture Notes 💻 Base Agent Repo

Week 8

Data Strategy, AI Safety & Ethics

Feb 24 - 26

TueFeb 24

Data Strategy & Memory Layer for AI Agents

Building sustainable competitive advantage in the age of generative AI; data as a strategic moat; 6 sources of advantage (Berkeley Research framework); real-world success stories (ElevenLabs, Harvey AI, Synthesia); product engagement and feedback loops; the data flywheel effect; internal tools and data quality; building effective AI agents with memory layers; CS224G project challenges

👨‍🏫 John Core Lecture

📄 Lecture Notes

ThuFeb 26

Sprint 3 Presentations

Groups present their progress on Sprint 3 (3 min per team)

👨‍🏫 John (MC) Student Demos + Near-Final Products

🏁 Sprint 3 End

📅 Safety Assessment Document due

Week 9

Realtime Voice AI & Safety

Mar 3 - 5

TueMar 3

Workshop: Building & Scaling Realtime Voice AI Apps

Native realtime voice AI with OpenAI's WebRTC integration; legacy chained pipelines (STT→LLM→TTS) vs. native realtime pipelines; ephemeral session tokens and the architecture of trust; WebRTC data channels and media tracks; function calling for voice-driven agents; backend WebSocket sideband pattern for server-side control; scaling concerns (state, reconnection, cost); hands-on escape room challenge building a voice AI app

👨‍🏫 SG Ramanan Workshop + Hands-on

📄 Lecture Notes 💻 Workshop Repo

🏃 Sprint 4 Start

ThuMar 5

AI Safety, Ethics & Guardrails

Constitutional AI and RLAIF; potential harms: bias, privacy violations, misinformation; safety techniques: red teaming, adversarial testing; regulatory landscape 2026; open source vs. closed models

👨‍🏫 Jan & John Core Lecture + Discussion + Case Studies

📄 Lecture Notes

Week 10

Pitching & Final Prep

Mar 10 - 12

TueMar 10

How to Deliver a Great Pitch

Pitch techniques for technical projects; storytelling and structure for demo day; communicating impact and vision; handling Q&A; presentation polish and delivery

👨‍🏫 Patricia Liu Guest Lecture

ThuMar 12

OpenAI's Beginnings & How Investors Qualify Startups

The origin story of OpenAI as YC Research; how the founding team came together; pitching and marketing AI products; growth and distribution strategies for AI startups; defensibility and competitive moats; how YC partners evaluate and qualify AI startups; predictions for where AI is headed

👨‍🏫 Dalton Caldwell (Standard Capital) Guest Lecture

🏁 Sprint 4 End

Demo Day

🏆 Demo Day & Poster Session

Mar 19, 2026

Demo Day & Poster Session

Thursday, March 19th, 3:30-6:30pm at CoDa (Computing and Data Science) E160

3-hour event with investors, entrepreneurs, and guests

🎤 Pitch Session

5-minute project demonstrations per team with recorded demos

📊 Poster Session

Interactive poster presentations (36" x 48") with networking

👥 Attendees

VCs, entrepreneurs, Stanford faculty, CS 224G alumni, tech press

Register for Demo Day Watch Demo Day 2025

📅

Final Deliverable Deadline Thursday, March 19, 2026 Course Reflection Report + Code + Documentation + Poster + Presentation