HDF5 Sustainable Strategies for Longevity�
1
Scientific Achievement
The HDF5 library offers connectors for the Virtual Object Layer (VOL), which is an abstract layer that enables custom operations for improved I/O performance, such as (1) The Asynchronous connector overlapping I/O and computation operations that would otherwise be performed sequentially, (2) the Cache connector using fast storage layers for caching and staging data, and (3) the Log-based connector using a time log layout. Connectors can significantly reduce the I/O latency, reducing the total execution time of scientific codes.
Significance and Impact
HDF5 is a versatile data model, library, and file format for storing and managing data. It is a crucial component for many scientific applications. Maintaining its performance features and guaranteeing efficient I/O performance on a large scale is essential to ensuring continued support for these applications.
Comparison of the observed data write time at different time steps of a Magnitude 7 earthquake simulation (EQSIM) run on 256 Frontier nodes. The asynchronous I/O capability can effectively hide the I/O latency by overlapping the I/O operations on the CPUs with the computation operations on the GPUs. The last three writes are at the end of the simulation when there is no computation to overlap with.
Technical Approach
M. Scot Breitenfeld, Houjun Tang, Huihuo Zheng, Jordan Henderson, Suren Byna, "HDF5 in the Exascale Era: Delivering Efficient and Scalable Parallel I/O for Exascale Applications", Under review in The International Journal of High Performance Computing Applications.
PI(s)/Facility Lead(s): Houjun Tang, Jean Luca Bez, Scot Breitenfeld
Collaborating Institutions: LBNL, The HDF Group
ASCR Program: RAPIDS2
ASCR PM: Kalyan Perumalla
44X speedup