Schedule & syllabus

The lecture slides, labs, and assignments will be posted here as the course progresses.
Lecture times are 3pm-4:20pm PST on Tuesdays and Thursdays. All deadlines are at 11:59pm PST.

This schedule is subject to change according to the pace of the class.

Date Description Materials Events
Week 1
Tue Sept 24 Course overview and logistics
Introduction to trustworthiness of LLMs
Project overview, with examples of projects from last year
Slides
References:
TrustLLM: Trustworthiness in Large Language Models (Sections 2.2 and 3)




Thu Sept 26 Overview and structure of LLM tech stack
RAG architecture: Tools needed for RAG (LlamaIndex, TruLens); Evaluation of RAGs,
Presentation of Homework 1
Fine tuning concept and tools, anticipating Homework 2
Slides
Homework 1 Out
Homework 1 Colab
Due Oct 8th
Supplemental Materials:
LlamaIndex
TruLens
Week 2
Tues Oct 1 Evaluation of models and apps:
    Truthfulness, Safety and alignment, Bias and fairness, Robustness and security, Privacy, unlearning, and copyright implications, Calibration and confidence, Transparency and causal interventions
Slides
References:
Grounding and Evaluation for Large Language Models
Thu Oct 3 Project directions
Sample application areas & evaluations
Slides
Final project group formations due Friday, October 4th.
More info available on Ed.
Week 3: Project Proposals and Feedback
Tue Oct 8 Project Proposal Presentations
Wed Oct 9 Homework 1 Due
Thu Oct 10 Project Proposal Presentations
Week 4
Tues Oct 15 Grounding and Factuality 1
Focus: Generating grounded responses
    RAGs, Alignment
    Fine tuning (for factuality)
Homework 2 Introduction
References:
Grounding and Evaluation for Large Language Models: Practical Challenges and Lessons Learned (Survey) Generating with Confidence: Uncertainty Quantification for Black-box Large Language Models
Retrieval-Augmented Generation for Knowledge-Intensive NLP Tasks
TRUE: Re-evaluating Factual Consistency Evaluation
Do Language Models Know When They're Hallucinating References?
RARR: Researching and Revising What Language Models Say, Using Language Models
The Internal State of an LLM Knows When its Lying
SELFCHECKGPT: Zero-Resource Black-Box Hallucination Detection for Generative Large Language Models
Measuring Reliability of Large Language Models through Semantic Consistency
Homework 2 Out Due Date TBD
Thurs Oct 17 Guest Lecture:
    Nitish Joshi (NYU)
Grounding and Factuality 2
Focus: verification and guardrails
    Checking Grounding
    Checking Factuality
    Rewriting
References:
Grounding and Evaluation for Large Language Models: Practical Challenges and Lessons Learned (Survey)
Week 5
Tue Oct 22 Confidence, Calibration, Uncertainty
    Chelsea Finn’s work on Calibration
    Yarin Gal’s work on Uncertainty
    Self-Consistency, GD-Consistency, Prompt-Consistency and other topics
References:
Generating with Confidence: Uncertainty Quantification for Black-box Large Language Models
Thu Oct 24 Explainability
Data Quality for Supervised Fine-Tuning (SFT), RL(HF, AIF)
References:
Towards Monosemanticity: Decomposing Language Models With Dictionary Learning
What makes good data for alignment?
Week 6
Tue Oct 29 Agents 1
References:
Cortex Analyst – An Agentic App for text2sql
Thu Oct 31 Agents 2: Evaluation
References:
Berkeley Function Calling Leaderboard
Spider2-V: How Far Are Multimodal Agents From Automating Data Science and Engineering Workflows?
Week 7: Mid-term Project Presentations with Feedback
Tues Nov 5 No Class (Democracy Day)
Thurs Nov 7 Project mid-term presentations (All groups)
Week 8
Tue Nov 12 Guest Lecture:
    Omar Khattab (Stanford University)
    Mert Yuksekgonul (Stanford University)
Agents 3: End-to to-end app optimization
    DSPy (Omar Khattab)
    TextGrad (Mert Yuksekgonul)
Thu Nov 14 Guest Lecture:
    Deepak Ramachandran (Google)
Multi-modality
Week 9: Project Presentations (dry-runs, with feedback)
Tue Nov 19 Project Presentations
Thu Nov 21 Project Presentations
Thanksgiving Break (Nov 26, Nov 28)
Week 10: Final Project Fair and Presentations
Tue Dec 3 OR Thu Dec 5, 3 - 5:30PM Final Project Fair
  • ONE 2.5 hour meeting either Tues or Thurs from 3PM to 5:30PM
  • Each team gives 2-3 min presentation, followed by group poster/demo
  • Guests are welcome!
Finals Week: Final Project Report Due at the end of the scheduled exam time, which is on Thursday, December 12 at 3:15 PM.