Project in Mining Massive Data Sets
CS341 is an advanced project based course. Students will work on data
mining and machine learning algorithms for analyzing very large amounts
of data. Both interesting big datasets as well as computational
infrastructure (large MapReduce cluster) will be provided by course
6/07: As announced earlier, the final reports are due 11:59PM on Wednesday, June 8. The reports should be emailed as PDF or docx to email@example.com
Relevant Piazza post: 1.
6/01: Final presentations are 4-7:15pm June 2 (Thursday) in Gates 415. Dinner will be served. All teams should send their presentations to Apaar (firstname.lastname@example.org) by 11:59PM Wednesday, June 1.
Schedule for the presentations. Relevant Piazza posts: 1.
4/28: Midterm presentations are May 2 (Monday) and May 4 (Wednesday). All teams should send their presentations to Apaar (email@example.com) by 11:59PM Sunday, May 1.
Schedule for the presentations. Relevant Piazza posts: 1, 2.
4/13: The date for midterm presentation, final presentation and report is updated.
Link to the schedule.
4/06: Tutorial: EC2, data management and clusters on AWS (by Apaar).
Link to the Google Doc.
3/31: Please sign up on Piazza!
Link to the Piazza page.
3/30: Tutorial: Getting Started with Amazon Web Services (by Apaar).
Link to the Google Doc.
Before Spring quarter: Slides and contact information for the various projects proposed at the March 3, 2016 Infosession can be found
here. Project proposals are due March 18, 2016. Mail to cs341-spr1516-staff at lists dot stanford dot edu.
For more information see Introductory Slides.
Chris Re (firstname.lastname@example.org)
Anand Rajaraman (email@example.com)
Jeff Ullman (firstname.lastname@example.org)
Andreas Paepcke (paepcke@cs.Stanford.edu)
Rok Sosic (email@example.com)
Jure Leskovec (firstname.lastname@example.org)
Mondays and Wednesdays 4:30PM - 5:50PM in Rm. 380-380Y.
This is a project course. There will be only a few weekly lectures,
and only one or two introductory homeworks. We will spend the quarter
working in teams on different large scale data mining related projects.
Teams will individually meet with the assigned mentor.
Apaar Sadhwani (email@example.com)
Office Hours: upon request (send me an email).
You can reach us at firstname.lastname@example.org
Use Piazza to post class related questions: https://piazza.com/stanford/spring2016/cs341
Previous versions of the course:
CS341: Spring 2013
CS341: Spring 2012
CS341: Spring 2011
The course is generously supported by Amazon by giving us access to their EC2 platform.