Deep Generative Models that Solve PDEs: Distributed Computing for Training Large Data-Free Models

Botelho, Sergio; Joshi, Ameya; Khara, Biswajit; Rao, Vinay; Sarkar, Soumik; Hegde, Chinmay; Adavani, Santi; Ganapathysubramanian, Baskar

doi:10.1109/mlhpcai4s51975.2020.00013

Deep Generative Models that Solve PDEs: Distributed Computing for Training Large Data-Free Models

Conference · Sun Nov 01 00:00:00 EDT 2020 · Workshop on Machine Learning in HPC Environments (Online)

DOI:https://doi.org/10.1109/mlhpcai4s51975.2020.00013· OSTI ID:1648524

Botelho, Sergio ^[1]; Joshi, Ameya; Khara, Biswajit; Rao, Vinay; Sarkar, Soumik; Hegde, Chinmay; Adavani, Santi; Ganapathysubramanian, Baskar

Iowa State University

Recent progress in scientific machine learning (SciML) has opened up the possibility of training novel neural network architectures that solve complex partial differential equations (PDEs). Several (nearly data free) approaches have been recently reported that successfully solve PDEs, with examples including deep feed forward networks, generative networks, and deep encoder-decoder networks. However, practical adoption of these approaches is limited by the difficulty in training these models, especially to make predictions at large output resolutions (≥1024×1024). Here we report on a software framework for data parallel distributed deep learning that resolves the twin challenges of training these large SciML models - training in reasonable time as well as distributing the storage requirements. Our framework provides several out of the box functionality including (a) loss integrity independent of number of processes, (b) synchronized batch normalization, and (c) distributed higher-order optimization methods. We show excellent scalability of this framework on both cloud as well as HPC clusters, and report on the interplay between bandwidth, network topology and bare metal vs cloud. We deploy this approach to train generative models of sizes hitherto not possible, showing that neural PDE solvers can be viably trained for practical applications. We also demonstrate that distributed higher-order optimization methods are 2-3× faster than stochastic gradient-based methods and provide minimal convergence drift with higher batch-size.

Research Organization:: Iowa State University

Sponsoring Organization:: USDOE Advanced Research Projects Agency - Energy (ARPA-E)

Contributing Organization:: RocketML Inc.

DOE Contract Number:: AR0001215

OSTI ID:: 1648524

Report Number(s):: arXiv:2007.12792

Journal Information:: Workshop on Machine Learning in HPC Environments (Online), Journal Name: Workshop on Machine Learning in HPC Environments (Online) Vol. 2020; ISSN 2768-4253

Publisher:: IEEE

Country of Publication:: United States

Language:: English

References (12)

Learning Particle Physics by Example: Location-Aware Generative Adversarial Networks for Physics Synthesis de Oliveira, Luke; Paganini, Michela; Nachman, Benjamin Computing and Software for Big Science, Vol. 1, Issue 1 https://doi.org/10.1007/s41781-017-0004-6	journal	September 2017
Simulator-free solution of high-dimensional stochastic elliptic partial differential equations using deep neural networks Karumuri, Sharmila; Tripathy, Rohit; Bilionis, Ilias Journal of Computational Physics, Vol. 404 https://doi.org/10.1016/j.jcp.2019.109120	journal	March 2020
On the control of solidification using magnetic fields and magnetic field gradients Ganapathysubramanian, Baskar; Zabaras, Nicholas International Journal of Heat and Mass Transfer, Vol. 48, Issue 19-20 https://doi.org/10.1016/j.ijheatmasstransfer.2005.04.027	journal	September 2005
Inverse molecular design using machine learning: Generative models for matter engineering Sanchez-Lengeling, Benjamin; Aspuru-Guzik, Alán Science, Vol. 361, Issue 6400 https://doi.org/10.1126/science.aat2663	journal	July 2018
DGM: A deep learning algorithm for solving partial differential equations Sirignano, Justin; Spiliopoulos, Konstantinos Journal of Computational Physics, Vol. 375 https://doi.org/10.1016/j.jcp.2018.08.029	journal	December 2018
fPINNs: Fractional Physics-Informed Neural Networks Pang, Guofei; Lu, Lu; Karniadakis, George Em SIAM Journal on Scientific Computing, Vol. 41, Issue 4 https://doi.org/10.1137/18M1229845	journal	January 2019
Physics-constrained deep learning for high-dimensional surrogate modeling and uncertainty quantification without labeled data Zhu, Yinhao; Zabaras, Nicholas; Koutsourelakis, Phaedon-Stelios Journal of Computational Physics, Vol. 394 https://doi.org/10.1016/j.jcp.2019.05.024	journal	October 2019
Solving high-dimensional partial differential equations using deep learning Han, Jiequn; Jentzen, Arnulf; E., Weinan Proceedings of the National Academy of Sciences, Vol. 115, Issue 34 https://doi.org/10.1073/pnas.1718942115	journal	August 2018
Physics-informed neural networks: A deep learning framework for solving forward and inverse problems involving nonlinear partial differential equations Raissi, M.; Perdikaris, P.; Karniadakis, G. E. Journal of Computational Physics, Vol. 378 https://doi.org/10.1016/j.jcp.2018.10.045	journal	February 2019
How do evaporating thin films evolve? Unravelling phase-separation mechanisms during solvent-based fabrication of polymer blends Wodo, Olga; Ganapathysubramanian, Baskar Applied Physics Letters, Vol. 105, Issue 15 https://doi.org/10.1063/1.4898136	journal	October 2014
CaloGAN: Simulating 3D high energy particle showers in multilayer electromagnetic calorimeters with generative adversarial networks Paganini, Michela; de Oliveira, Luke; Nachman, Benjamin Physical Review D, Vol. 97, Issue 1 https://doi.org/10.1103/PhysRevD.97.014021	journal	January 2018
Hidden physics models: Machine learning of nonlinear partial differential equations Raissi, Maziar; Karniadakis, George Em Journal of Computational Physics, Vol. 357 https://doi.org/10.1016/j.jcp.2017.11.039	journal	March 2018

Similar Records

Machine learning models for PDE constrained optimization

Technical Report · Mon Sep 01 00:00:00 EDT 2025 · OSTI ID:2589582

Gradient-enhanced physics-informed neural networks for forward and inverse PDE problems

Journal Article · Fri Mar 18 00:00:00 EDT 2022 · Computer Methods in Applied Mechanics and Engineering · OSTI ID:1976976

DPM: A deep learning PDE augmentation method with application to large-eddy simulation

Journal Article · Thu Sep 03 00:00:00 EDT 2020 · Journal of Computational Physics · OSTI ID:1850305

Related Subjects

PDEs
Cloud vs HPC
Deep generative models
Distributed training
Higher-order optimization
Loss functions

Deep Generative Models that Solve PDEs: Distributed Computing for Training Large Data-Free Models

Citation Formats

References (12)

Similar Records

Related Subjects