Mining Massive Data Sets: Hadoop Labs
Build a solid framework for data mining by taking advantage of this lab course. CS246H builds on the MapReduce framework Hadoop introduced. Hadoop will be covered in depth to give students a more complete understanding of the platform and its role in data mining.
This is a partner course to CS246: Mining Massive Datasets
and does not include additional assignments.
Important course information will be posted on this web page and announced in class. You are responsible for all material that appears here and should check this page for updates frequently.
- 1/7: The first class will be held at 12:50 on Wednesday 1/8, in NVidia Auditorium, Jen-Hsun Huang Engineering Center.
We look forward to seeing you there!
Wednesdays 12:50-2:05pm in NVidia Auditorium, Jen-Hsun Huang Engineering Center.
Watch video lectures on SCPD (any Stanford student can see them here).
Daniel Templeton, Cloudera
Office Hours: Wednesdays 9-10am, Gates 418
Office Hours: Wednesdays 9-10am, Gates InfoLab Lab
You Will Learn to
- Implement data mining algorithms discussed in CS246 using Hadoop
- Implement and debug complex MapReduce jobs in Hadoop
- Use some of the tools in the Hadoop ecosystem for data mining and machine learning
- Cloudera ML/Oryx
You can reach us at firstname.lastname@example.org
Use Piazza to post class related questions: https://piazza.com/stanford/winter2014/cs246h/home