| Event | Date | Description | Course Materials |
| Lecture |
Jan 7 |
Introduction to Reinforcement Learning |
- [Slides, Draft lecture notes]
- Additional Materials:
|
| Lecture |
Jan 9 |
How to act given know how the world works.
- Tabular setting
- Markov processes
- Policy search
- Policy iteration
- Value iteration
|
- [Slides, Draft lecture notes]
- Additional Materials:
- SB (Sutton and Barton) Chp 3, 4.1-4.4
|
| A1 |
Jan 9 |
Assignment 1 released |
Assignment 1
|
| Lecture |
Jan 14 |
Learning to evaluate a policy when don't know how the world works. |
- [Slides, Class slides with annotations, Draft lecture notes]
- Additional Materials:
- SB (Sutton and Barton) Chp 5.1, 5.5, 6.1-6.3
- David Silver's Lecture 4 [link]
|
| Lecture |
Jan 16 |
Model-free learning to make good decisions.
|
- [Slides, Class slides with annotations, Draft lecture notes]
- Additional Materials:
- SB (Sutton and Barton) Chp 5.2, 5.4, 6.4-6.5, 6.7
- Week 2 Session: [video], [ slides]
|
|
Jan 21 |
No Class |
|
| Lecture |
Jan 23 |
Scaling up: RL with function approximation |
- [Slides, Class slides with annotations, Draft lecture notes]
- Additional Materials:
|
| A1 |
Jan 23 |
Assignment 1 due, 11:59pm |
|
| A2 |
Jan 23 |
Assignment 2 released |
Assignment 2
|
| Lecture |
Jan 28 |
RL with function approximation.
|
- [Slides, Class slides with annotations, Draft lecture notes]
- Additional Materials:
|
| Lecture |
Jan 30 |
Imitation learning in large spaces |
- [Draft slides, Class slides with annotations, Draft lecture notes]
- Additional Materials:
|
| Lecture |
Feb 4 |
Policy search |
- [Draft slides, Class slides with annotations, Draft lecture notes]
- Sutton and Barto Chp 13
|
| Project |
Feb 4 |
Project proposal due, 11:59pm |
|
| Lecture |
Feb 6 |
Policy search |
- [Draft slides, Class slides, Draft lecture notes]
- Additional Materials
- Sutton and Barto Chp 13
- Week 5 Session: [video], [ slides]
|
| Project |
Feb 6 |
Assignment 2 due, 11:59pm |
|
| Lecture |
Feb 11 |
Midterm review |
- [Slides, Draft lecture notes]
- [Midterm Review]
- Week 6 Session: [video], [ slides]
|
| Exam |
Feb 13 |
In-class Midterm |
|
| A3 |
Feb 13 |
Assignment 3 released |
|
Lecture |
Feb 18 |
No Class: President's Day Holiday |
|
| Lecture |
Feb 20 |
Exploration/Exploitation |
- [Class slides with annotations, Draft lecture notes]
- Additional Materials:
|
| Lecture |
Feb 25 |
Exploration / Exploitation |
- [Class slides with annotations, Draft lecture notes, Sutton and Barto Sections 2.1-2.7]
- Additional Materials:
|
| A3 |
Feb 25 |
Project Milestone 3 due, 11:59pm |
|
| Lecture |
Feb 27 |
Exploration / Exploitation |
- [Class slides with annotations, Draft lecture notes]
- Supplementary Materials:
|
| Project |
Feb 27 |
Assignment 3 due, 11:59pm |
|
| Lecture |
Mar 4 |
Meta-Learning (Chelsea Finn guest lecture) |
|
| Lecture |
Mar 6 |
Batch Reinforcement Learning |
- [Draft Slides, Class slides with annotations, Draft lecture notes]
|
| Exam |
Mar 11 |
In-class Quiz |
|
| Lecture |
Mar 13 |
Monte Carlo Tree Search |
- [Draft Slides, Class slides with annotations]
|
| Project |
Mar 20 |
Project final paper due, 11:59pm |
|
| Project |
Mar 22 |
Poster Session 8:30 - 11:30am |
ACSR Basketball court 1 and 2
|