Project in Mining Massive Data Sets
Spring 2018
CS341 is an advanced project based course. Students will work on data mining and machine learning algorithms for analyzing very large amounts of data. Both interesting big datasets as well as computational infrastructure (large MapReduce cluster) will be provided by course staff.



Jure Leskovec (jure@cs.stanford.edu)

Jeff Ullman (ullman@gmail.com)

Chris Re (chrismre@cs.stanford.edu)

Anand Rajaraman (datawocky@gmail.com)

Andreas Paepcke (paepcke@cs.stanford.edu)

Rok Sosic (rok@cs.stanford.edu)

Class meetings:

This is a project course. There will be only a few weekly lectures, and only one or two introductory homeworks. We will spend the quarter working in teams on different large scale data mining related projects. Teams will individually meet with the assigned mentor.

Teaching assistants:

Abhay Agarwal (abhayka@stanford.edu). Office Hours: click here to schedule

Wanzi Zhou (wanziz@stanford.edu). Office Hours: by appointment (via e-mail)

Course Calendar

Information Session for 2018


Mailing list: You can reach us at cs341-spr1718-staff@lists.stanford.edu
Piazza: For class-related questions and discussions, you can use Piazza: https://piazza.com/stanford/spring2018/cs341.


The course is generously supported by Google by giving us access to their Cloud Platform.