ML-based Performance Portability for Density Functional Theory in HPC Environments (DECODE SciDAC-5)
Adrian P. Dieguez∗, Min Choi, Xinran Zhu, Bryan M. Wong and Khaled Z Ibrahim “ML-based Performance Portability for Time-Dependent Density Functional Theory in HPC Environments" 2022 International Workshop on Performance Modeling, Benchmarking and Simulation of High Performance Computer Systems (PMBS), Supercomputing 2022.
3D FFT mapping in rt-TDDFT
The internal process for searching the optimal space using Bayesian Optimization. Transfer learning leverages configuration on one platform to find optimal configuration on another.
ASCR-BES DECODE SciDAC-5
The RT-DFT Tuning stack for optimizing the performance on HPC machines.
Scientific Achievement
Developed a framework for optimizing the search of runtime configurations and code variants to improve the performance portability on HPC platforms for RT-TDDFT codes.
Significance and Impact
Improving the search speed for optimal performance configuration is critical for performance portability for applications with large space of algorithmic variants and runtime configurations. The developed methodology, despite focusing on RT-TDDFT computation, can be used with a wide set of applications.
Research Details
Optimized search of the optimization space can leverage a wide set of techniques including Bayesian optimization and model-based estimation.
Developed a framework to do transfer-learning, where the search on an HPC platform is used to reduce the number of search attempts on another platform.
Focused on finding optimized configurations for the calculation involved in the conjugate gradient calculation part of rt-TDDFT.
Improved the performance by up to 9x, reducing the search attempts by up to 86%.