Compressing unstructured mesh data from simulations using machine learning

Kamath, Chandrika

doi:10.1007/s41060-019-00180-6

Title: Compressing unstructured mesh data from simulations using machine learning

Journal Article · Mon Apr 01 00:00:00 EDT 2019 · International journal of data science and analytics

DOI:https://doi.org/10.1007/s41060-019-00180-6· OSTI ID:1738887

^[1]

Lawrence Livermore National Lab. (LLNL), Livermore, CA (United States)

The amount of data output from a computer simulation has grown to terabytes and petabytes as increasingly complex simulations are being run on massively parallel systems. As we approach exaflop computing in the next decade, it is expected that the I/O subsystem will not be able to write out these large volumes of data. In this paper, we explore the use of machine learning to compress the data before it is written out. Despite the computational constraints that limit us to using very simple learning algorithms, our results show that machine learning is a viable option for compressing unstructured data. Furthermore, we demonstrate that by simply using a better sampling algorithm to generate the training set, we can obtain more accurate results compared to random sampling, but at no extra cost. Further, by carefully selecting and incorporating points with high prediction error, we can improve reconstruction accuracy without sacrificing the compression rate.

View Accepted Manuscript (DOE)

Cite

Export

Save

Research Organization:: Lawrence Livermore National Lab. (LLNL), Livermore, CA (United States)

Sponsoring Organization:: USDOE National Nuclear Security Administration (NNSA)

Grant/Contract Number:: AC52-07NA27344

OSTI ID:: 1738887

Report Number(s):: LLNL-JRNL-750460; 935302

Journal Information:: International journal of data science and analytics, Vol. 9, Issue 1; ISSN 2364-415X

Publisher:: SpringerCopyright Statement

Country of Publication:: United States

Language:: English

References (13)

Fast Error-Bounded Lossy HPC Data Compression with SZ Di, Sheng; Cappello, Franck 2016 IEEE International Parallel and Distributed Processing Symposium (IPDPS) https://doi.org/10.1109/IPDPS.2016.11	conference	May 2016
Fixed-Rate Compressed Floating-Point Arrays Lindstrom, Peter IEEE Transactions on Visualization and Computer Graphics, Vol. 20, Issue 12 https://doi.org/10.1109/TVCG.2014.2346458	journal	December 2014
Spectrally optimal sampling for distribution ray tracing Mitchell, Don P. Proceedings of the 18th annual conference on Computer graphics and interactive techniques - SIGGRAPH '91 https://doi.org/10.1145/122718.122736	conference	January 1991
Turbulent Transport Reduction by Zonal Flows: Massively Parallel Simulations Lin, Z. Science, Vol. 281, Issue 5384 https://doi.org/10.1126/science.281.5384.1835	journal	September 1998
Fast and Efficient Compression of Floating-Point Data Lindstrom, Peter; Isenburg, Martin IEEE Transactions on Visualization and Computer Graphics, Vol. 12, Issue 5 https://doi.org/10.1109/TVCG.2006.143	journal	September 2006
Learning to compress images and videos Cheng, Li; Vishwanathan, S. V. N. Proceedings of the 24th international conference on Machine learning - ICML '07 https://doi.org/10.1145/1273496.1273517	conference	January 2007
ISABELA for effective in situ compression of scientific data: ISABELA FOR EFFECTIVE Lakshminarasimhan, Sriram; Shah, Neil; Ethier, Stephane Concurrency and Computation: Practice and Experience, Vol. 25, Issue 4 https://doi.org/10.1002/cpe.2887	journal	July 2012
Learning to Compress Unstructured Mesh Data from Simulations Kamath, Chandrika 2017 IEEE International Conference on Data Science and Advanced Analytics (DSAA) https://doi.org/10.1109/DSAA.2017.30	conference	October 2017
Spectrally optimal sampling for distribution ray tracing Mitchell, Don P. ACM SIGGRAPH Computer Graphics, Vol. 25, Issue 4 https://doi.org/10.1145/127719.122736	journal	July 1991
A Comparison of Compressed Sensing and Sparse Recovery Algorithms Applied to Simulation Data Fan, Ya Ju; Kamath, Chandrika Statistics, Optimization & Information Computing, Vol. 4, Issue 3 https://doi.org/10.19139/soic.v4i3.207	journal	August 2016
NUMARCK: Machine Learning Algorithm for Resiliency and Checkpointing Chen, Zhengzhang; Son, Seung Woo; Hendrix, William SC14: International Conference for High Performance Computing, Networking, Storage and Analysis https://doi.org/10.1109/SC.2014.65	conference	November 2014
Fast Poisson disk sampling in arbitrary dimensions Bridson, Robert ACM SIGGRAPH 2007 sketches on - SIGGRAPH '07 https://doi.org/10.1145/1278780.1278807	conference	January 2007
Wavelet-based data compression for flow simulation on block-structured Cartesian mesh: DATA COMPRESSION FOR FLOW SIMULATION ON CARTESIAN MESH Sakai, Ryotaro; Sasaki, Daisuke; Obayashi, Shigeru International Journal for Numerical Methods in Fluids, Vol. 73, Issue 5 https://doi.org/10.1002/fld.3808	journal	May 2013

Similar Records

Fast 2D Bicephalous Convolutional Autoencoder for Compressing 3D Time Projection Chamber Data

Journal Article · Sun Nov 12 00:00:00 EST 2023 · International Conference for High Performance Computing, Networking, Storage and Analysis · OSTI ID:1738887

Huang, Yi; Ren, Yihui; Yoo, Shinjae; +1 more

Optimal Compressed Sensing and Reconstruction of Unstructured Mesh Datasets

Journal Article · Wed Aug 09 00:00:00 EDT 2017 · Data Science and Engineering · OSTI ID:1738887

Salloum, Maher; Fabian, Nathan D.; Hensinger, David M.; +7 more

Machine Learning Algorithms for Matching Theories, Simulations, and Observations in Cosmology (Final Project)

Technical Report · Mon Dec 31 00:00:00 EST 2018 · OSTI ID:1738887

Poczos, Barnabas

Related Subjects

97 MATHEMATICS AND COMPUTING
Regression
compression
computer simulations
mesh data

Title: Compressing unstructured mesh data from simulations using machine learning

Citation Formats

References (13)

Similar Records

Related Subjects