Project in Mining Massive Data Sets
Spring 2015
CS341 is an advanced project based course. Students will work on data mining and machine learning algorithms for analyzing very large amounts of data. Both interesting big datasets as well as computational infrastructure (large MapReduce cluster) will be provided by course staff.


Course information:


Jure Leskovec (jure@cs.stanford.edu)

Anand Rajaraman (datawocky@gmail.com)

Chris Re (chrismre@cs.stanford.edu)

Jeff Ullman (ullman@gmail.com)

Rok Sosic (rok@cs.stanford.edu)

Class meetings:

Mondays and Wednesdays 4:15PM - 5:30PM in Rm. 380-380F.

This is a project course. There will be only a few weekly lectures, and only one or two introductory homeworks. We will spend the quarter working in teams on different large scale data mining related projects. Teams will individually meet with the assigned mentor.

Teaching assistant:

Pararth Shah (pararth@stanford.edu)
Office Hours: upon request (send me an email).


You can reach us at cs341-spr1415-staff@lists.stanford.edu
Use Piazza to post class related questions: http://piazza.com/class#spring2015/cs341
(Piazza usually requires @stanford.edu email address to register. If you do not have @stanford.edu address, use the access code "cs341" to subscribe to the class.)

Previous versions of the course:

CS341: Spring 2013

CS341: Spring 2012

CS341: Spring 2011


The course is generously supported by Amazon by giving us access to their EC2 platform.