Project in Mining Massive Data Sets
Spring 2017
CS341 is an advanced project based course. Students will work on data mining and machine learning algorithms for analyzing very large amounts of data. Both interesting big datasets as well as computational infrastructure (large MapReduce cluster) will be provided by course staff.


Course announcements will be posted here.

Project Ideas (Slides from the info session)


Jeff Ullman (ullman@gmail.com)

Jure Leskovec (jure@cs.stanford.edu)

Matei Zaharia (matei@cs.stanford.edu)

Chris Re (chrismre@cs.stanford.edu)

Anand Rajaraman (datawocky@gmail.com)

Andreas Paepcke (paepcke@cs.stanford.edu)

Rok Sosic (rok@cs.stanford.edu)

Class meetings:

Mondays and Wednesdays 4:30PM - 5:50PM in 380-380Y.

This is a project course. There will be only a few weekly lectures, and only one or two introductory homeworks. We will spend the quarter working in teams on different large scale data mining related projects. Teams will individually meet with the assigned mentor.

Teaching assistant:

Nihit Desai (nihit@stanford.edu)
Office Hours: TBD


You can reach us at cs341-spr1617-staff@lists.stanford.edu
Details about Piazza will be posted here when the class begins.

Previous versions of the course:

CS341: Spring 2013

CS341: Spring 2012

CS341: Spring 2011


The course is generously supported by Amazon by giving us access to their EC2 platform.