$\DeclareMathOperator{\p}{Pr}$ $\DeclareMathOperator{\P}{Pr}$ $\DeclareMathOperator{\c}{^C}$ $\DeclareMathOperator{\or}{ or}$ $\DeclareMathOperator{\and}{ and}$ $\DeclareMathOperator{\var}{Var}$ $\DeclareMathOperator{\E}{E}$ $\DeclareMathOperator{\std}{Std}$ $\DeclareMathOperator{\Ber}{Bern}$ $\DeclareMathOperator{\Bin}{Bin}$ $\DeclareMathOperator{\Poi}{Poi}$ $\DeclareMathOperator{\Uni}{Uni}$ $\DeclareMathOperator{\Exp}{Exp}$ $\DeclareMathOperator{\N}{N}$ $\DeclareMathOperator{\R}{\mathbb{R}}$ $\newcommand{\d}{\, d}$

Syllabus

Updated 2023092206

If you have any questions after reading this Syllabus, post on our discussion forum.

Teaching Team

Chris Piech

Professor: Chris Piech
piech @ cs
Durand 305


We are lucky to have a phenomenal group of Course Assistants:

Kelly Cochran
Kelly Cochran
kcochran @
Head TA
Kathleen Cheng
Kathleen Cheng
kcheng18 @
TA
Jacob Mejia
Jacob Mejia
jamejia @
TA
Noor Fakih
Noor Fakih
nfakih @
TA
Matthew Harvill
Matthew Harvill
mharvill @
TA
Ryan Kang
Ryan Kang
txryank @
TA
Sierra Wang
Sierra Wang
sierraw @
TA
Juliette Woodrow
Juliette Woodrow
jwoodrow @
TA
Isabel Michel
Isabel Michel
imichel @
TA
Nikil Ravi
Nikil Ravi
nravi @
TA
Oumnia Chellah
Oumnia Chellah
oumnia @
TA
Minh Vu
Minh Vu
minhavu @
TA
Tori Qiu
Tori Qiu
toriqiu @
TA
Justin Tinker
Justin Tinker
jatinker @
TA
Hamza El Boudali
Hamza El Boudali
hamza410 @
TA
Michael Xue
Michael Xue
mikexue7 @
TA
Raj Palleti
Raj Palleti
rpalleti @
TA
Koyet Alagbe
Koyet Alagbe
kalagbe @
TA
Abi Lopez
Abi Lopez
abilopez @
TA

I. Course Overview

While the initial foundations of computer science began in the world of discrete mathematics (after all, modern computers are digital in nature), recent years have seen a surge in the use of probability as a tool for the analysis and development of new algorithms and systems. As a result, it is becoming increasingly important for budding computer scientists to understand probability theory, both to provide new perspectives on existing ideas and to help further advance the field in new ways.

CS109: Probability for Computer Scientists starts by providing a fundamental grounding in combinatorics, and then quickly moves into the basics of probability theory. We will then cover many essential concepts in probability theory, including particular probability distributions, properties of probabilities, and mathematical tools for analyzing probabilities. Finally, the last third of the class will focus on data analysis and machine learning as a means for seeing direct applications of probability in this exciting and quickly growing subfield of computer science. This is going to be a great quarter and we are looking forward to the chance to teach you.

Learning Goals

Our goal in CS109 is to build foundational skills and give you experience in the following areas:

  1. Understanding the combinatorial nature of problems: Many real problems are based on understanding the multitude of possible outcomes that may occur, and determining which of those outcomes satisfy some criteria we care about. Such understanding is important both for determining how likely an outcome is, but also for understanding what factors may affect the outcome (and which of those may be in our control).
  2. Working knowledge of probability theory: Having a solid knowledge of probability theory is essential for computer scientists today. Such knowledge includes theoretical fundamentals as well as an appreciation for how that theory can be successfully applied in practice. We hope to impart both these concepts in this class.
  3. Appreciation for probabilistic statements: In the world around us, probabilistic statements are often made, but are easily misunderstood. For example, when a candidate in an election is said to have a 53% likelihood of winning does this mean that the candidate is likely to get 53% of the vote, or that that if 100 elections were held today, the candidate would win 53% of them? Understanding the difference between these statements requires an understanding of the model in the underlying probabilistic analysis.
  4. Applications: We are not studying probability theory simply for the joy of drawing summation symbols (okay, maybe some people are, but that's not what we're really targeting in this class), but rather because there are a wide variety of applications where probability allows us to solve problems that might otherwise be out of reach (or would be solved more poorly without the tools that probability can bring to bear). We'll look at examples of such applications throughout the class.
  5. An introduction to machine learning: Machine learning is a quickly growing subfield of artificial intelligence which has grown to impact many applications in computing. It focuses on analyzing large quantities of data to build models that can then be harnessed in real problems, such as filtering email, improving web search, understanding computer system performance, predicting financial markets, or analyzing DNA.

Course Topics

Here are the broad strokes of the course (in approximate order). More information is available on our Schedule page. We cover a very broad set of topics so that you are equipped with the probability and statistics you will see in your future CS studies!

  • Counting and probability fundamentals
  • Single-dimensional random variables
  • Probabilistic models
  • Uncertainty theory
  • Parameter estimation
  • Introduction to machine learning

Prerequisites

The prerequisites for this course are CS103, CS106B or X, and Math 51 (or equivalent courses). Probability involves a fair bit of mathematics (set theory, calculus, and familiarity with linear algebra), and we'll be considering several applications of probability in CS that require familiarity with algorithms and data structures covered in CS106B/X. Here is a quick rundown of some of the mathematical tools from CS103 and Math 51 that we'll be using in this class: multivariate calculus (integration and differentiation), linear algebra (basic operations on vectors and matrices), an understanding of the basics of set theory (subsets, complements, unions, intersections, cardinality, etc.), and familiarity with basic proof techniques. We'll also do combinatorics in the class, but we'll be covering a fair bit of that material ourselves in the first week. Past students have managed to take CS106B concurrently with CS109 and have done just fine. CS103 is the pre-requisite that we rely on the least. Students have done well even without having taken CS103.

II. Course Structure

Lectures

Lectures are MWF from 3:30p until 4:50p. We will be holding live lectures in-person in Hewlett 200 Auditorium. Come to learn the material, engage in interesting problems collectively with the class. While lecture attendance isn't mandatory it is correlated with doing well in the course and mastering the material. Lecture is a good place to learn, make friends and enjoy probability. Good times

Lecture Recordings

This term CS109 is not recorded. Each year the powers that be (a group called SCPD) decide which classes will be recorded. This year CS109 will only be recorded in the winter and spring terms, not the fall. So it goes! If recording is necessary for you to participate in lecture, I recommend taking CS109 in a later quarter.

Is it a "bad thing" that lectures are not recorded? I think not! Did you know that on quarters where we are not recorded, students perform better in CS109? We did an analysis of past courses and the lift was rather noticable both in terms of grades and ability! Why? I am guessing that coming to class is a better way to learn the material, and a better time management system than watching the content online. Considering that this is a foundational course, better mastery of the material seems like a great thing. We put a lot of love into the lectures. Come join in on the good times!

There are some really great uses of recordings (eg I got sick and missed a lecture). For such use cases we have a plan. We have collected recordings from CS109 Fall 2022, the last time Chris taught the class, and we are going to make the recordings available to you. Wahoo! This term we will cover the same content and will follow almost exactly the same schedule. There will be a single lecture (lecture 14) which should change substantially.

Units

If you are an undergraduate, you are required to take CS109 for 5 units of credit (this is by department and university policy, no exceptions). If you are a graduate student, you may enroll in CS109 for 3 or 4 units if it is necessary for you to reduce your units for administrative reasons. Taking the course for reduced units does not imply any change in the course requirements.

Sections

Active participation plays an important role in making you adept at combining probability and computer science. It has also been observed over many quarters that keeping up with the material highly correlates with improved class performance.

Each week for 50 minutes you will meet in a small group with one of our outstanding CAs (section leaders) and work through problems. If you have taken any of the CS106 classes, our sections will be very similar—except with more probability. Sign-ups for sections will go out on Thursday, Sept 28th and will be open until noon Pacific Sunday, Oct 1st. We will let you know which section you are in by Monday, Oct 2nd and you will have your first section that week (during Week 2).

Section attendance and participation is required for all students. Your participation grade is based on how much you engage with section; a student that engages and helps others will recieve a better score than one who shows up late and doesn't participate. Students are allowed two (2) makeup sections (i.e. attending another CA's session in the same week) and one (1) unexcused section absence in the quarter without penalty.

Grading

The grade for the course will be determined according to the following breakdown:

ComponentFinal grade
Problem Sets 40%
Midterm 20%
Final 30%
Participation 10%

Problem Sets

During the course, there will be six problem sets assigned. We put a lot of love into these problems so that they can help train you to become gifted practitioners of probability and computer science. Use them as practice. Doing well on the problem sets is the best way to prepare for life after CS109 (and the exams). Each student is to submit individual work on the problem sets. The problem sets will often include coding tasks, which will be primarily in python. We also strongly encourage you to learn LaTex, which is the interchangable markup language for typing math on a computer. We will hold review sessions for those of you who are not familiar with python.

This quarter in CS109 we are using "the psetapp", a webapp writen by Chris where you can solve your problem sets. Importantly, the psetapp allows you to check your work as you go! That way you can get immediate feedback as to whether you have figured out the solution. Things to know: (1) the psetapp auto saves your progress so you never need to hit a "save" button, (2) similarly, you don't need to hit submit. At the deadline you will no longer be able to edit your work and we will start grading your current saved solutions (3) You can check your answer as many times as you want. There is no penalty for retrying if you didn't get the answer correct the first time.

After you submit we will use a combination of manual and automatic grading. For every problem we expect that you show your work. For each math solution, you should provide a detailed enough explanation that someone who is fluent in CS109 would understand how to solve it. Never write down just the answer. For coding solutions, we expect you to use enough comments and good style that it is clear what process your code is using to solve the task.

After grades are released, you have one week to file a regrade request if you think that points were deducted incorrectly. We reserve the right to regrade your entire pset.

Late Policy

There may be unforeseen circumstances that make it difficult to turn in homework assignments on time. Our philosophy is to treat you as adults and thus we have a generous late policy to reflect the many different needs that may come up. However, the course will end for everyone on the same date. As such if you are late on one problem set, you will have to work extra hard to catch up. Time management can be hard and we encourage you to give it the full respect it deserves. In practice, falling behind often impacts midterm and final exam scores.

  • Due Date: The on time deadline will be listed on each assignment writeup (generally at 3p). Finishing the assignment by the deadline means that you are in sync with the course. Hooray!
  • Grace Period: All students will be automatically granted a 24 hour penalty-free "grace period" for submission on all problem sets. The grace period allows you to submit the assignment after the original deadline, with no impact on the final grade. This grace period is meant to give built-in flexibility for any unexpected snags such as being 10 mins late on finishing. The 24 hour period is enforced by a computer clock so we encourage you not to push it to the limit.
  • Long Extension: CS109 is a fast-paced class and if you need an extension beyond the grace period then you may fall behind on future problem sets (or the midterm / final). Having said that, you might have a medical, personal or serious time-management situation which requires an "as long as possible" extension. We will give you up until the date when we release solutions. This time is set on a per pset basis but generally is two class days after the problem set was due. For example, if the pset was due on Friday, typically the long extension would allow you to make changes until the following Wednesday. You can grant yourself an extension of this length via the CS109 pset app. If you use more than one of these in the quarter you will need to contact the Head TA (Kelly) for special permission. Why must you contact us? Because we care and we want to catch issues early. In the rare occurence that someone uses more than one of these extensions without getting permission it will impact their course grade.
  • After the Hard Deadline: What if you are not done by the time we release solutions? In general we do not accept work after the solutions have been posted and TAs start grading. As such you should make sure you don't accidentally miss this very hard deadline! Having said that, there may be a real crisis that means you are not able to do your work before the solutions are released (eg an illness that lasts a week, funeral attendance, etc). First, we hope you are well. Personal life is so truly important and we respect you doing what you need to do. In such an extreme case you need to contact the course staff (cs109@cs.stanford.edu) as well as your Undergraduate Advising Director and we will work something out. Please do contact us as early as possible.

Warning: PSet3 (the pset before the midterm) will not have the option of a long extension. Why? The PSet is due on Friday and the midterm is the following Monday. We release the solutions on Saturday so that students can refer to solutions while studying. Similarly PSet6 (the pset before the final) will have a long extension deadline which reflects when we need to release solutions.

Exams

In addition to the assignments, there will a midterm and a final:

  • Midterm: Monday, Oct 30th, 7pm - 9pm
  • Final: Wed, Dec 13th, 7pm - 10pm

For more details about the format of the exams, see the midterm and final pages. We provide all the material you need to study from the start of class, however we don't expect you to begin preparing until the week before the test.

Midterm Alternatives: A week before the midterm we will give more information on what you should do if you have an academic conflict with the midterm.

No Final Alternatives: For a variety of reasons (including university policy), we generally do not provide alternate offerings for the final exam. Please make sure that you can attend the final exam at the specified time before enrolling in the class. Typically, you will not be able to make the final if you are taking another class at the same time as CS109. If you cannot make the final exam, you should plan on taking CS109 in a different quarter. If you have a particularly compelling reason for missing the exam, and you contact us the first week of the term, we will look into making an exception to this generally hard set rule. If you start the course and for some reason you can't take the exam, we will offer you an incomplete and you can take the final with a future offering of CS109.

Extra Credit Challege

This quarter CS109 will hold an extra credit challege, where students apply the principles in this class to explore a topic of their own choice. Participation in the contest is completely optional and prizes involve extra credit to your final course grade. More details about the contest to be posted midway through the quarter.

III. Course Resources

CS109 Course Reader

Last year Chris thought to himself, "you know what CS109 students would probably love? A course reader!" So Chris wrote, and coded, and created a CS109 Course Reader. It is something which he is going to update we go. Please let Chris know if you are curious about something or if you find a typo. Please don't expect anything perfect. It has gone through one quarter of iterations but you might still find some small bugs. Please check out the chapters and the demos: Course Reader. This is also a collectively owned document. Want to add to it? Or fix a typo? You can contribute Course Reader Github Project.

Optional Textbook

Sheldon Ross, A First Course in Probability (10th Ed.), Pearson Prentice Hall, 2018.

This is an optional textbook, meaning that the text is not required material, but students may find Ross offers a different and useful perspective on the important concepts of the class. Suggested, optional reading assignments from the textbook (10th Ed.) are in the schedule on the course website. The 8th, 9th, and 10th editions of the textbook are all fine for this class.

Borrowing the textbook online: HathiTrust, a library archive of which Stanford is a member, has granted the university online access to the 8th edition (2010) for the duration of the Fall quarter. The "check out" system works similarly to print reserves: A user can check out the book an hour at a time as long as they are actively using it. Access guidelines are on the HathiTrust How To Use It webpage. Once you're logged in, the book is at this link.

All students should retain receipts for books and other course-related expenses, as these may be qualified educational expenses for tax purposes. If you are an undergraduate receiving financial aid, you may be eligible for additional financial aid for required books and course materials if these expenses exceed the aid amount in your award letter. For more information, review your award letter or visit the Student Budget website.

"Working" Office Hours

To help make you more successful in this class , the course staff will hold "working" office hours. The idea is to encourage you to work on your problem sets at these office hours, so you can immediately ask any questions that come up while working on your problem sets. While you are certainly not required to attend any of these working office hours, they are simply meant to encourage you to interact with the course staff more often in order to help you better understand the course material. Besides, our job is to help everyone learn the material for this class, and being more accessible to you when you are actually working on your assignments (rather than when you just have a problem) will help the course go more smoothly for you (and it'll be more fun for us).

Not all office hours are the same! The office hour calendar will indicate specific details such as the time and location of each office hours. Office hours are either in person or online. If an office hour time on the calendar says it is in person, there will be no online option — and if the calendar says an office hours session is online, please don't show up at our TAs dorms :).

There are some things that come up which only Chris can help with. For that reason his office hours are generally 1:1 and do not cover pset questions. Come talk about course concepts, applications of probability, career advice, life etc.

Accommodations

Students who may need an academic accommodation based on the impact of a disability must initiate the request with the Office of Accessible Education (OAE). Professional staff will evaluate the request with required documentation, recommend reasonable accommodations, and prepare an Accommodation Letter for faculty. For students who have disabilities that don't typically change appreciably over time, the letter from the OAE will be for the entire academic year; other letters will be for the current quarter only. Students should contact the OAE as soon as possible since timely notice (for example, at least a week before an exam) is needed to coordinate accommodations. Students should also send your accommodation letter to instructors as soon as possible. If you require additional, or different, accommodations specific to the Fall 2023 learning environment, please contact your disability adviser directly.

IV. Honor Code

Please read our full Honor Code Policy, which specifically prohibits you from soliciting or taking solutions from other students or websites like Stack Overflow and Chegg.

Looking Forward to a Great Quarter

Genuinely, teaching CS109 is a profound joy. Thanks for coming to learn with us. We can't wait 🌱.