1 of 1

Region-adaptive, Error-controlled Scientific Data Compression for Scientific Data

1

Significance and Achievement

Built an adaptive data compression pipeline upon the multilevel compressor MGARD with automatic critical region detection and spatially varying error bounds and demonstrated that data generated by E3SM can be greatly reduced while the quantities of interest (QoI) by post-analysis are preserved.

Technical Approach

  • Developed a region-adaptive compression method which can detect critical regions and compress data using spatially varying error bounds
  • Derived mathematical equations and compression errors bounds for region-adaptive accuracy control
  • Evaluated on two climate use cases showing improved data compression ratios and lower errors in post-analysis compared to single-error-bounded approaches

Significance and Impact

Comparing to existing single-error-bounded lossy compressors, we derive theories of point-wise error preservation and develop a pipeline capable of delivering larger compression ratios and preserving the accuracy of post-processing by applying spatially changed error bound for data inside and outside regions of interest (RoI)

"Region-adaptive, Error-controlled Scientific Data Compression using Multilevel Decomposition."  Gong, Qian, et al. SSDBM 2022.

Work was performed at Oak Ridge National Laboratory

Fig 1. Region-adaptive compression pipeline

Fig 2. TC detection: adaptive vs uniform compression

Fig 2. AR detection: adaptive vs uniform compression