πŸš€
First Class Jan 6, 2026
🎯
Last Class Mar 12, 2026
πŸ“
Classroom 420-040
πŸ†
Week 1
Introduction & LLM Landscape
Jan 6 - 8
TueJan 6

CS 224G Re-introduction

Why LLMs are exciting right now; where we are in 2026; what makes now unique; course logistics; introduction to responsible AI development

πŸ‘¨β€πŸ« Jan & John Core Lecture + Brainstorming
ThuJan 8

224G Project Ideas & Modern LLM Stack

Student project proposal pitches (2 min each); Modern LLM stack overview; ethical considerations in project selection

πŸ‘¨β€πŸ« John (MC) Student Pitches + Guidance
πŸ“… Project mini-pitch due
Week 2
Reasoning Models & Building Principles
Jan 13 - 15
TueJan 13

LLMs, LMMs & Reasoning Models 2026

Reasoning models (OpenAI o1/o3, DeepSeek R1, Claude Sonnet 4); chain-of-thought prompting; when to use reasoning vs. standard models; LMM architectures; cost/latency tradeoffs

πŸ‘¨β€πŸ« John Core Lecture + Hands-on API
πŸ“… Final Project Proposal due
ThuJan 15

AI Entrepreneurship & Building with GenAI

Four pillars of building software with LLMs: Iteration, Evaluations, Deployment, Observability; building a data flywheel; continuous systems

πŸ‘¨β€πŸ« Arsh Shah Dilbagi + Jan & John Guest Lecture + Discussion
πŸ“… Final Project Team due
Week 3
Context Engineering & Data Strategy
Jan 20 - 22
ℹ️ No class Monday, January 19 - MLK Day
TueJan 20

Context Engineering & RAG

Context engineering as evolution of prompt engineering; the full context stack (system prompts, conversation history, tool definitions, parameters); RAG architecture; managing context windows; memory systems; token budget economy; advanced prompting techniques; prompt injection attacks and defenses

πŸ‘¨β€πŸ« Jan Core Lecture + Exercises
πŸƒ Sprint 1 Start
ThuJan 22

More Context Engineering

Architecting the full context stack for reliable production-grade AI systems; the compiled request model (what actually goes into an LLM call); call parameters and the context window (temperature, max tokens, logit bias, token budget economy); the three strata of instructions (model system prompt, product system prompt, personas); injected knowledge (memories, RAG, user-submitted data); tools, orchestrators, and agentic loops; the context engineering checklist of five controllable levers

πŸ‘¨β€πŸ« John Core Lecture + Exercises
Week 4
Scalability & Agentic Patterns
Jan 27 - 29
TueJan 27

Agentic Workflows & Design Patterns

Agent architectures (ReAct, multi-agent systems); orchestration frameworks (LangGraph, CrewAI); sequential, router, and collaborative patterns; tool use and function calling; state management; real-world agentic patterns; agent safety and containment strategies

πŸ‘¨β€πŸ« Rakshit Agrawal, Tran Le & Sohit Gatiganti Guest Lecture + Examples
ThuJan 29

Sprint 1 Presentations

Groups present their progress on Sprint 1 (3 min per team)

πŸ‘¨β€πŸ« John (MC) Student Demos + Feedback
🏁 Sprint 1 End
Week 5
Production Systems & Evaluation
Feb 3 - 5
TueFeb 3

Building AI-Native Companies

What "AI-native" actually means vs. AI-assisted; automated consulting and spec-driven development; evaluating agents without losing your mind (179 failures analyzed); the OpenClaw phenomenon and where the industry is heading; real-world examples from Vunda AI and heynoah.io

πŸ‘¨β€πŸ« Ryan Brandt (Vunda AI) Guest Lecture + Case Studies
πŸƒ Sprint 2 Start
ThuFeb 5

Building Good LLM Products with the Theory of Constraints

The challenge of non-determinism in LLMs; applying the Theory of Constraints to AI systems; modeling your system (system prompt, user prompt, context, tools); identifying and fixing bottlenecks; proper evals vs. vibe evals; the improvement loop; practical examples from production AI products

πŸ‘¨β€πŸ« Andy Bromberg Guest Lecture
Week 6
Code Generation & Advanced Agents
Feb 10 - 12
ℹ️ Note: Tuesday, February 10 class will be in room 200-002
TueFeb 10

Code Generation with LLMs

History of AI for code generation; benchmarking code gen (HumanEval, SWE-Bench, competitions, real-world impact); reasoning and decision-making techniques (CoT, Tree-of-Thoughts, ReAct, Reflexion, LATS); RL-driven reasoning in token space; using code generation wisely; coding agent setup (agents.md/CLAUDE.md); vibe engineering and shipping faster with coding agents; ephemeral software and environment engineering

πŸ‘¨β€πŸ« Josh Payne Guest Lecture + Architectural Deep Dive
ThuFeb 12

Sprint 2 Presentations

Groups present their progress on Sprint 2 (3 min per team)

πŸ‘¨β€πŸ« John (MC) Student Demos + Technical Deep Dives
🏁 Sprint 2 End
Week 7
RAG, Memory & Orchestration
Feb 17 - 19
ℹ️ No class Monday, February 16 - Presidents' Day
⚑ Tuesday, February 17 class cancelled due to campus power outage
TueFeb 17

Class Cancelled

Cancelled due to campus power outage

πŸƒ Sprint 3 Start
ThuFeb 19

Agentic Orchestration with PydanticAI

How to build an (almost) production-ready agent with PydanticAI; lightweight vs. heavy orchestration frameworks (PydanticAI vs. LangGraph, CrewAI, OpenAI Agents SDK); real-world application: AI for manufacturing operations at Dryft; hands-on agent implementation

πŸ‘¨β€πŸ« Leonie Freisinger (Dryft) Guest Lecture
Week 8
Data Strategy, AI Safety & Ethics
Feb 24 - 26
TueFeb 24

Data Strategy & Memory Layer for AI Agents

Building sustainable competitive advantage in the age of generative AI; data as a strategic moat; 6 sources of advantage (Berkeley Research framework); real-world success stories (ElevenLabs, Harvey AI, Synthesia); product engagement and feedback loops; the data flywheel effect; internal tools and data quality; building effective AI agents with memory layers; CS224G project challenges

πŸ‘¨β€πŸ« John Core Lecture
ThuFeb 26

Sprint 3 Presentations

Groups present their progress on Sprint 3 (3 min per team)

πŸ‘¨β€πŸ« John (MC) Student Demos + Near-Final Products
🏁 Sprint 3 End
πŸ“… Safety Assessment Document due
Week 9
Realtime Voice AI & Safety
Mar 3 - 5
TueMar 3

Workshop: Building & Scaling Realtime Voice AI Apps

Native realtime voice AI with OpenAI's WebRTC integration; legacy chained pipelines (STT→LLM→TTS) vs. native realtime pipelines; ephemeral session tokens and the architecture of trust; WebRTC data channels and media tracks; function calling for voice-driven agents; backend WebSocket sideband pattern for server-side control; scaling concerns (state, reconnection, cost); hands-on escape room challenge building a voice AI app

πŸ‘¨β€πŸ« SG Ramanan Workshop + Hands-on
πŸƒ Sprint 4 Start
ThuMar 5

AI Safety, Ethics & Guardrails

Constitutional AI and RLAIF; potential harms: bias, privacy violations, misinformation; safety techniques: red teaming, adversarial testing; regulatory landscape 2026; open source vs. closed models

πŸ‘¨β€πŸ« Jan & John Core Lecture + Discussion + Case Studies
Week 10
Pitching & Final Prep
Mar 10 - 12
TueMar 10

How to Deliver a Great Pitch

Pitch techniques for technical projects; storytelling and structure for demo day; communicating impact and vision; handling Q&A; presentation polish and delivery

πŸ‘¨β€πŸ« Patricia Liu Guest Lecture
ThuMar 12

OpenAI's Beginnings & How Investors Qualify Startups

The origin story of OpenAI as YC Research; how the founding team came together; pitching and marketing AI products; growth and distribution strategies for AI startups; defensibility and competitive moats; how YC partners evaluate and qualify AI startups; predictions for where AI is headed

πŸ‘¨β€πŸ« Dalton Caldwell (Standard Capital) Guest Lecture
🏁 Sprint 4 End
Demo Day
πŸ† Demo Day & Poster Session
Mar 19, 2026

Demo Day & Poster Session

Thursday, March 19th, 3:30-6:30pm at CoDa (Computing and Data Science) E160

3-hour event with investors, entrepreneurs, and guests

🎀 Pitch Session

5-minute project demonstrations per team with recorded demos

πŸ“Š Poster Session

Interactive poster presentations (36" x 48") with networking

πŸ‘₯ Attendees

VCs, entrepreneurs, Stanford faculty, CS 224G alumni, tech press

πŸ“…
Final Deliverable Deadline Thursday, March 19, 2026 Course Reflection Report + Code + Documentation + Poster + Presentation