ytopt: A ML-based Autotuning for Energy Efficiency�With the NP Partnership (Femtoscale)
Scientific Achievement
As the complexity of HPC ecosystems continues to rise, achieving optimal performance and energy becomes a challenge. Large system and application parameter space requires few shots for automatic exploration of the space. ytopt is our autotuning effort to tune scientific applications for energy efficiency.
Significance and Impact
Our low-overhead ytopt framework can be used to identify the best combination of application and system parameters to result in the best performance. This helps not only improve application performance portability on different architectures but also explore performance tradeoffs for efficiently utilizing underlying HPC systems.
This table presents the energy improvement percentage for using ytopt to autotune four ECP proxy apps in energy and EDP (Energy Delay Product) as the performance metrics on up to 4096 nodes on ALCF Theta.
Technical Approach
X. Wu, M. Kruse, P. Balaprakash, H. Finkel, P. Hovland, V. Taylor, and M. Hall, "Autotuning PolyBench benchmarks with LLVM Clang/Polly loop optimization pragmas using Bayesian optimization" Concurrency and Computation. Practice and Experience, Volume 34, Issue 20, 2022. ISSN 1532-0626 DOI: 10.1002/cpe.6683
PI(s)/Facility Lead(s): Robert Ross; Local Lab POC: Xingfu Wu, ANL
Collaborating Institutions: ANL, LBNL, ORNL,
ASCR Program: [SciDAC RAPIDS2 and OASIS, ECP]
ASCR PM: Lali Chatterji, Kalyan Perumalla
Publication(s) for this work: X. Wu, et al., “ytopt: Autotuning Scientific Applications for Energy Efficiency at Large Scales”, Proceeding of Cray User Group Conference 2023 (CUG’23) doi:10.48550/arXiv.2303.16245.
and/or Code Developed or Datasets: https://github.com/ytopt-team/ytopt
Metric | XSBench | SWFFT | AMG | SW4lite |
Energy | 8.6% | 2.1% | 20.9% | 21.2% |
EDP | 37.8% | 5.2% | 24.1% | 23.7% |
This diagram shows the ytopt framework using Bayesian Optimization for the search and Random Forests for the surrogate model to identify the best combination of application and system parameters for energy efficiency