1 of 21

Scalable Hyper-parameter Optimization using RAPIDS and AWS

AKSHIT ARORA

@_AkshitArora

SRISHTI YADAV

@_srishtiyadav

2 of 21

SPEAKER: AKSHIT ARORA

Deep learning solutions architect at NVIDIA
Optimize and deploy deep learning and machine learning pipelines in production
Previously, built deep learning models for tasks in domains such as – Education, Virtual Reality, Natural Language Processing and Weather Prediction

🔗 aroraakshit.github.io

\Uhk_Shith Aurora\

2

3 of 21

SPEAKER: SRISHTI YADAV

Deep learning researcher at Simon Fraser University, Canada
Research, Create and Optimize deep learning and machine learning algorithm
Also, building deep-learning algorithm for sensors at International Space Station (Intern: Urthecast)

🔗 srishti.dev

3

4 of 21

WHY THIS TALK?

4

5 of 21

WE ALL LOVE TO SPEED UP THINGS

BUT ARE WE 🏃‍♀️🏃‍♂️ THINGS FASTER ?

5

6 of 21

COMPANIES LOVE TO STORE AND USE DATA

BUT ARE THEY REALLY CASHING IT ?

6

7 of 21

WE KNOW CLOUDS ARE A THING

BUT WE DON’T KNOW IF IT’S WITHIN REACH 🤣

7

8 of 21

HOW ARE WE HELPING?

8

9 of 21

⚡🏃‍♀️🏃‍♂️ : Papermill

💰💰💰 : GPU

☁️🖥️☁️ 🖥️ : AWS

9

10 of 21

IT CAN BE AN OCEAN OF DETAILS

BUT BEFORE WE IN

10

11 of 21

Hyper-Parameter Optimization (HPO)

For 5 parameters, each with 4 desired values, we will have 4⁵ possible combinations

Combinations can increase exponentially. And your training time with it. 😯
Manually trying each of them 🤮
Solution: Automating Hyper-parameter optimization (HPO) 🎉

Source: Hyperparameters Optimization | Towards Data Science | by Pier Paolo Ippolito

11

12 of 21

WHAT ARE WE GOING TO DO?

12

13 of 21

CPU Based Scalability

Learn to parameterize notebooks from a set of parameters

You don’t want to do all of them manually! 🤦

Automate parallel computation of parameterized notebooks.

Since we use CPU here, you can play with it at home too, without any cost.

13

14 of 21

Papermill lets you:

Parameterize notebooks
Execute notebooks

Helpful for running HPO tasks where the same model needs to be trained again and again with different sets of parameters.

(Docs: nteract/papermill: 📚 Parameterize and execute notebooks)

14

15 of 21

DEMO TIME

https://github.com/copperwiring/scalable-hpo-pybay/tree/master/papermill_demo

15

16 of 21

GPU Based Scalability

Learn to speed up the computation using GPU

Make efficient use of GPU using RAPIDS

Scale up the computation and have better metric visualization using AWS

Learn to do hyper parameter optimization to find best parameters from given set

You don’t want to iterate over all of them manually, trust us!

16

17 of 21

Amazon SageMaker is a fully managed machine learning service
Allows the users to build, train, analyze and deploy models in production ready environment

(Docs: https://docs.aws.amazon.com/sagemaker/)

17

18 of 21

A collection of open-source libraries for end-to-end GPU Accelerated Data Science.
Involves little to no code change

(Docs: https://rapids.ai/hpo)

18

19 of 21

A collection of open-source libraries for end-to-end GPU Accelerated Data Science.
Involves little to no code change

(Docs: https://rapids.ai/hpo)

19

20 of 21

DEMO TIME

https://github.com/rapidsai/cloud-ml-examples/tree/master/aws

20

21 of 21

Thank you!

Please find our code, slides and other resources at our github page:

https://github.com/copperwiring/scalable-hpo-pybay

21