Lec9. NAS (2)
EECE695D: Efficient ML Systems
Recap
Three building blocks of NAS
Elsken et al., “Neural Architecture Search: A Survey,” JMLR 2019
Recap
Elsken et al., “Neural Architecture Search: A Survey,” JMLR 2019
Search by Reinforcement Learning
Train a policy that generates a set of HPs.
Example. Zoph and Le (2017) uses an RNN controller to generate HPs.
Zoph and Le, “NAS with RL,” ICLR 2017
Search by Reinforcement Learning
To update RNN, evaluate the policy gradients w.r.t. the REINFORCE loss.
where R is the validation performance of the model configured by .
Zoph and Le, “NAS with RL,” ICLR 2017
Search by Reinforcement Learning
Update the RNN controller parameters using the gradient:
Williams, “Simple statistical gradient-following algorithms for connectionist RL” Machine Learning, 1992
Search by Reinforcement Learning
Example. ProxylessNAS (Cai et al., 2019).
Cai et al., “ProxylessNAS: Direct NAS on target task and hardware,” ICLR 2019
Search by Evolutionary Methods
Evolutionary methods work as follows.
Search by Evolutionary Methods
Example. Amoebanet (Real et al., 2019)
Utilizes the tournament selection.
Real et al., “Regularized Evolution for Image Classifier Architecture Search” AAAI 2019
Search by Evolutionary Methods
Real et al., “Regularized Evolution for Image Classifier Architecture Search” AAAI 2019
Progressive Search
Progressively expand-and-search the search space.
Example. Progressive NAS (Liu et al., 2018)
(repeat…)
Liu et al., “Progressive NAS” ECCV 2018
Progressive Search
Liu et al., “Progressive NAS” ECCV 2018
Performance Evaluation Strategy
Performance Evaluation
Problem. Full training is costly.
Solution. Use some proxy task.
Performance Evaluation: Shorter Training
Problem. Simply selecting the best solution may not be good enough…
Zela et al., “Towards automated deep learning: Efficient joint neural architecture and hyperparameter search,” ICML workshop 2018
Performance Evaluation: Loss Prediction
Solution. Train a predictor for loss.
Example. Baker et al. (2018) observes that models tend to have� similar loss curves.
Baker et al., “Accelerating NAS using performance prediction,” ICLR 2018
Performance Evaluation: Loss Prediction
Baker et al. (2018) uses nu-SVR to predict the curve,�from the early 25% of the curve.
Baker et al., “Accelerating NAS using performance prediction,” ICLR 2018
Performance Evaluation: Weight Inheritance
Idea. Maybe don’t train from scratch…?
Related Work. Chen et al. (2016) proposed Net2Net,� which transfers weights to other tasks for adaptation.
Chen et al., “Net2Net: Accelerating learning via knowledge transfer,” ICLR 2016
Performance Evaluation: Weight Inheritance
Chen et al., “Net2Net: Accelerating learning via knowledge transfer,” ICLR 2016
Performance Evaluation: Weight Inheritance
Use this idea for NAS!
Example. EfficientNAS (Pham et al., 2018) views NAS as� Finding a Subgraph of a big, universal network.
Updates weights with GD, and uses RNN to find subgraph.
Pham et al., “Efficient NAS via parameter sharing,” arXiv 2018
Performance Evaluation: Weight Inheritance
Example. DARTS (Liu et al., 2018) uses GD for finding the subgraph.
Liu et al., “DARTS: Differentiable architecture search,” ICLR 2019
Liu et al., “DARTS: Differentiable architecture search,” ICLR 2019
Liu et al., “DARTS: Differentiable architecture search,” ICLR 2019
Liu et al., “DARTS: Differentiable architecture search,” ICLR 2019
Zero-Shot NAS
Question. Can we evaluate model quality without training?
Motivation. NAS is similar to pruning,� and it seems like we can evaluate weight quality without training.
Zero-Shot NAS
Mellor et al. (2021) shows that this is possible.
Critical decision. Which proxy score?
Mellor et al., “NAS without training” ICML 2021
Mellor et al., “NAS without training” ICML 2021
Zero-Cost Proxy
NASWOT uses the Jacobian Covariance.
Sad News
The Jacobian Covariance is not the ultimate winner!
White et al., “A deeper look at zero-cost proxies for lightweight NAS” ICLR Blog Track 2022
White et al., “A deeper look at zero-cost proxies for lightweight NAS” ICLR Blog Track 2022
White et al., “A deeper look at zero-cost proxies for lightweight NAS” ICLR Blog Track 2022
White et al., “A deeper look at zero-cost proxies for lightweight NAS” ICLR Blog Track 2022
A Better News
Abdelfattah et al., “Zero-cost proxies for lightweight NAS” ICLR 2021
Further Reading
Efficiency-aware NAS
Others