10-605 / 10-805
Machine Learning from Large Datasets
Intro 8/25
Machine Learning from Large Datasets
Overview of Intro Lecture
Overview of Intro Lecture
Main idea: better model joint distribution of model size (# parameters), model performance (test set loss / perplexity), and training cost (floating point operations, aka FLOPS)
Idea: fit a simple model to the data that relates compute, parameters, and loss, and use that to optimize loss given a total compute budget.
Compute Scaling Laws = Data Scaling Laws (title text)
Overview of Intro Lecture
7B params: 200B 🡪 1T toks
1B toks: 7B🡪33B params
Map-Reduce and Spark 8/27
Machine Learning from Large Datasets
Overview of Map-Reduce/Spark lecture
Spark Workflows 9/3
Machine Learning from Large Datasets
Overview of Workflow lecture
Learning as Optimization �9/8, 10, 15
Machine Learning from Large Datasets
Overview of Optimization 1
Overview of Optimization 2
Overview of Optimization 2 + 3
Randomized Algorithms�9/17 and 9/22 �
Machine Learning from Large Datasets
Overview of Randomized Algorithms 1
Overview of Randomized Algorithms 2
Autodiff�9/24
Machine Learning from Large Datasets
Overview of Randomized Algorithms – extended to 2/12
GPUS�9/29, 10/1
Machine Learning from Large Datasets
Overview of GPU-based ML�
Overview of GPU-based ML�
Overview of GPU-based ML�
Overview of GPU-based ML�
FINAL POINTS
Writing Session ≠ Exam
Writing Session ≠ Exam