10-405/10-605
Machine Learning with Large Datasets
Homework Setups
Homework Overview
Registration
*Make sure to choose the community edition*
Login
Still Log in to Community Edition:
Import Lab Files
Installing Third-party Packages
“nose”
Creating a Cluster
Choose the default Spark version;
Use Python 3
Notes about Clusters
Interact with Notebooks