Project in Mining Massive Data Sets
Spring 2016
CS341 is an advanced project based course. Students will work on data mining and machine learning algorithms for analyzing very large amounts of data. Both interesting big datasets as well as computational infrastructure (large MapReduce cluster) will be provided by course staff.



Chris Re (chrismre@cs.stanford.edu)

Anand Rajaraman (datawocky@gmail.com)

Jeff Ullman (ullman@gmail.com)

Andreas Paepcke (paepcke@cs.Stanford.edu)

Rok Sosic (rok@cs.stanford.edu)

Jure Leskovec (jure@cs.stanford.edu)

Class meetings:

Mondays and Wednesdays 4:30PM - 5:50PM in Rm. 380-380Y.

This is a project course. There will be only a few weekly lectures, and only one or two introductory homeworks. We will spend the quarter working in teams on different large scale data mining related projects. Teams will individually meet with the assigned mentor.

Teaching assistant:

Apaar Sadhwani (apaars@stanford.edu)
Office Hours: upon request (send me an email).


You can reach us at cs341-spr1516-staff@lists.stanford.edu
Use Piazza to post class related questions: https://piazza.com/stanford/spring2016/cs341

Previous versions of the course:

CS341: Spring 2013

CS341: Spring 2012

CS341: Spring 2011


The course is generously supported by Amazon by giving us access to their EC2 platform.