DOE PAGES title logo U.S. Department of Energy
Office of Scientific and Technical Information

Title: Deep learning to estimate permeability using geophysical data

Abstract

Time-lapse electrical resistivity tomography (ERT) is a popular geophysical method to estimate three-dimensional (3D) permeability fields from electrical potential difference measurements. Traditional inversion and data assimilation methods are used to ingest this ERT data into hydrogeophysical models to estimate permeability. Due to ill-posedness and the curse of dimensionality, existing inversion strategies provide poor estimates and low resolution of the 3D permeability field. Recent advances in deep learning provide us with powerful algorithms to overcome this challenge. This paper presents a deep learning (DL) framework to estimate the 3D subsurface permeability from time-lapse ERT data. To test the feasibility of the proposed framework, we train DL-enabled inverse models on simulation data. Each measurement in both synthetic and field data is standardized by removing the mean and scaling the time-series to unit variance. This pre-processing step is necessary to bring simulation data closer to field observations. Subsurface process models based on hydrogeophysics are used to generate this synthetic data. Training performed on limited simulation data resulted in the DL model over-fitting. An advanced data augmentation based on mixup is implemented to generate additional training samples to overcome this issue. This mixup technique creates weakly labeled (low-fidelity) samples from strongly labeled (high-fidelity) data.more » The weakly labeled training data is then used to develop DL-enabled inverse models and reduce over-fitting. As both time-lapse ERT (1133048 features/realization) and 3D permeability (585453 features/realization) data samples are from a high-dimensional space, principal component analysis (PCA) is employed to reduce dimensionality. Encoded ERT and encoded permeability are generated using the trained PCA estimators. A deep neural network is then trained to map the encoded ERT to encoded permeability. This mixup training and unsupervised learning allowed us to build a fast and reasonably accurate DL-based inverse model under limited simulation data. Results show that proposed weak supervised learning can capture salient spatial features in the 3D permeability field. Quantitatively, the average mean squared error (in terms of the natural log) on the strongly labeled training, validation, and test datasets is less than 0.5. The R2-score (global metric) is greater than 0.75, and the percent error in each cell (local metric) is less than 10%. Finally, an added benefit in terms of computational cost is that the proposed DL-based inverse model is at least O(104) times faster than running a forward model once it is trained. Data generation, DL model training, and hyperparameter tuning to identify optimal neural network architectures utilized high-performance computing resources while the DL inference is performed on a standard laptop. Approximately, O(105) processor hours are used for generating data and DL tuning and training. We acknowledge that the data generation and DL model development are expensive. But once a DL model is trained, it can be re-used for inversion rapidly for the given system, with set physics and domain. Note that traditional inversion may require multiple forward model simulations (e.g., in the order of 10 to 1000), which are very expensive. This computational savings ≈ O(105) – O(107)) makes the proposed DL-based inverse model attractive for subsurface imaging and real-time ERT monitoring applications due to fast and yet reasonably accurate estimations of permeability field.« less

Authors:
ORCiD logo [1]; ORCiD logo [1];  [2];  [1]
  1. Pacific Northwest National Lab. (PNNL), Richland, WA (United States)
  2. Virginia Polytechnic Inst. and State Univ. (Virginia Tech), Blacksburg, VA (United States)
Publication Date:
Research Org.:
Pacific Northwest National Laboratory (PNNL), Richland, WA (United States)
Sponsoring Org.:
USDOE Office of Science (SC), Biological and Environmental Research (BER). Earth and Environmental Systems Science Division; USDOE
OSTI Identifier:
1883390
Alternate Identifier(s):
OSTI ID: 1960964
Report Number(s):
PNNL-SA-175440
Journal ID: ISSN 0309-1708
Grant/Contract Number:  
AC05-76RL01830; AC02-05CH11231
Resource Type:
Accepted Manuscript
Journal Name:
Advances in Water Resources
Additional Journal Information:
Journal Volume: 167; Journal ID: ISSN 0309-1708
Publisher:
Elsevier
Country of Publication:
United States
Language:
English
Subject:
58 GEOSCIENCES; Artificial neural networks; Ensemble methods; Deep learning; Hydrogeophysics; Electrical resistivity tomography; Permeability; Multi-physics; Dimensionality reduction; Deep neural networks; Scalable deep learning

Citation Formats

Mudunuru, Maruti K., Cromwell, Erol L. D., Wang, Hongsheng, and Chen, Xingyuan. Deep learning to estimate permeability using geophysical data. United States: N. p., 2022. Web. doi:10.1016/j.advwatres.2022.104272.
Mudunuru, Maruti K., Cromwell, Erol L. D., Wang, Hongsheng, & Chen, Xingyuan. Deep learning to estimate permeability using geophysical data. United States. https://doi.org/10.1016/j.advwatres.2022.104272
Mudunuru, Maruti K., Cromwell, Erol L. D., Wang, Hongsheng, and Chen, Xingyuan. Fri . "Deep learning to estimate permeability using geophysical data". United States. https://doi.org/10.1016/j.advwatres.2022.104272. https://www.osti.gov/servlets/purl/1883390.
@article{osti_1883390,
title = {Deep learning to estimate permeability using geophysical data},
author = {Mudunuru, Maruti K. and Cromwell, Erol L. D. and Wang, Hongsheng and Chen, Xingyuan},
abstractNote = {Time-lapse electrical resistivity tomography (ERT) is a popular geophysical method to estimate three-dimensional (3D) permeability fields from electrical potential difference measurements. Traditional inversion and data assimilation methods are used to ingest this ERT data into hydrogeophysical models to estimate permeability. Due to ill-posedness and the curse of dimensionality, existing inversion strategies provide poor estimates and low resolution of the 3D permeability field. Recent advances in deep learning provide us with powerful algorithms to overcome this challenge. This paper presents a deep learning (DL) framework to estimate the 3D subsurface permeability from time-lapse ERT data. To test the feasibility of the proposed framework, we train DL-enabled inverse models on simulation data. Each measurement in both synthetic and field data is standardized by removing the mean and scaling the time-series to unit variance. This pre-processing step is necessary to bring simulation data closer to field observations. Subsurface process models based on hydrogeophysics are used to generate this synthetic data. Training performed on limited simulation data resulted in the DL model over-fitting. An advanced data augmentation based on mixup is implemented to generate additional training samples to overcome this issue. This mixup technique creates weakly labeled (low-fidelity) samples from strongly labeled (high-fidelity) data. The weakly labeled training data is then used to develop DL-enabled inverse models and reduce over-fitting. As both time-lapse ERT (1133048 features/realization) and 3D permeability (585453 features/realization) data samples are from a high-dimensional space, principal component analysis (PCA) is employed to reduce dimensionality. Encoded ERT and encoded permeability are generated using the trained PCA estimators. A deep neural network is then trained to map the encoded ERT to encoded permeability. This mixup training and unsupervised learning allowed us to build a fast and reasonably accurate DL-based inverse model under limited simulation data. Results show that proposed weak supervised learning can capture salient spatial features in the 3D permeability field. Quantitatively, the average mean squared error (in terms of the natural log) on the strongly labeled training, validation, and test datasets is less than 0.5. The R2-score (global metric) is greater than 0.75, and the percent error in each cell (local metric) is less than 10%. Finally, an added benefit in terms of computational cost is that the proposed DL-based inverse model is at least O(104) times faster than running a forward model once it is trained. Data generation, DL model training, and hyperparameter tuning to identify optimal neural network architectures utilized high-performance computing resources while the DL inference is performed on a standard laptop. Approximately, O(105) processor hours are used for generating data and DL tuning and training. We acknowledge that the data generation and DL model development are expensive. But once a DL model is trained, it can be re-used for inversion rapidly for the given system, with set physics and domain. Note that traditional inversion may require multiple forward model simulations (e.g., in the order of 10 to 1000), which are very expensive. This computational savings ≈ O(105) – O(107)) makes the proposed DL-based inverse model attractive for subsurface imaging and real-time ERT monitoring applications due to fast and yet reasonably accurate estimations of permeability field.},
doi = {10.1016/j.advwatres.2022.104272},
journal = {Advances in Water Resources},
number = ,
volume = 167,
place = {United States},
year = {Fri Jul 15 00:00:00 EDT 2022},
month = {Fri Jul 15 00:00:00 EDT 2022}
}

Works referenced in this record:

Discovering State‐Parameter Mappings in Subsurface Models Using Generative Adversarial Networks
journal, October 2018


Neural Style Transfer: A Review
journal, November 2020

  • Jing, Yongcheng; Yang, Yezhou; Feng, Zunlei
  • IEEE Transactions on Visualization and Computer Graphics, Vol. 26, Issue 11
  • DOI: 10.1109/TVCG.2019.2921336

Advances in interpretation of subsurface processes with time-lapse electrical imaging
journal, August 2014

  • Singha, K.; Day-Lewis, F. D.; Johnson, T.
  • Hydrological Processes, Vol. 29, Issue 6, p. 1549-1576
  • DOI: 10.1002/hyp.10280

Evaluating the performance of parallel subsurface simulators: An illustrative example with PFLOTRAN: Evaluating the Parallel Performance of Pflotran
journal, January 2014

  • Hammond, G. E.; Lichtner, P. C.; Mills, R. T.
  • Water Resources Research, Vol. 50, Issue 1
  • DOI: 10.1002/2012WR013483

Physics-informed neural networks: A deep learning framework for solving forward and inverse problems involving nonlinear partial differential equations
journal, February 2019


Hydro-, bio-geophysics
journal, July 2004

  • al Hagrey, Said A.; Meissner, Rolf; Werban, Ulrike
  • The Leading Edge, Vol. 23, Issue 7
  • DOI: 10.1190/1.1776739

Biogeophysics: A new frontier in Earth science research
journal, January 2009

  • Atekwana, Estella A.; Slater, Lee D.
  • Reviews of Geophysics, Vol. 47, Issue 4
  • DOI: 10.1029/2009RG000285

Machine learning to discover mineral trapping signatures due to CO2 injection
journal, July 2021

  • Ahmmed, Bulbul; Karra, Satish; Vesselinov, Velimir V.
  • International Journal of Greenhouse Gas Control, Vol. 109
  • DOI: 10.1016/j.ijggc.2021.103382

Less is more: Sampling chemical space with active learning
journal, June 2018

  • Smith, Justin S.; Nebgen, Ben; Lubbers, Nicholas
  • The Journal of Chemical Physics, Vol. 148, Issue 24
  • DOI: 10.1063/1.5023802

Estimating Watershed Subsurface Permeability From Stream Discharge Data Using Deep Neural Networks
journal, February 2021


PFLOTRAN-SIP: A PFLOTRAN Module for Simulating Spectral-Induced Polarization of Electrical Impedance Data
journal, December 2020

  • Ahmmed, Bulbul; Mudunuru, Maruti Kumar; Karra, Satish
  • Energies, Vol. 13, Issue 24
  • DOI: 10.3390/en13246552

Deep transfer learning and data augmentation improve glucose levels prediction in type 2 diabetes patients
journal, July 2021


Trends, prospects and challenges in quantifying flow and transport through fractured rocks
journal, February 2005


Borehole and surface-based hydrogeophysics
journal, February 2005


A comparative simulation study of coupled THM processes and their effect on fractured rock permeability around nuclear waste repositories
journal, October 2008


Characterizing flow and transport in fractured geological media: A review
journal, August 2002


The Microbial Community Structure in Petroleum-Contaminated Sediments Corresponds to Geophysical Signatures
journal, May 2007

  • Allen, Jonathan P.; Atekwana, Estella A.; Atekwana, Eliot A.
  • Applied and Environmental Microbiology, Vol. 73, Issue 9
  • DOI: 10.1128/AEM.01752-06

DART-PFLOTRAN: An ensemble-based data assimilation system for estimating subsurface flow and transport model parameters
journal, August 2021


Multiscale geophysical imaging of the critical zone: Geophysical Imaging of the Critical Zone
journal, January 2015

  • Parsekian, A. D.; Singha, K.; Minsley, B. J.
  • Reviews of Geophysics, Vol. 53, Issue 1
  • DOI: 10.1002/2014RG000465

The emergence of hydrogeophysics for improved understanding of subsurface processes over multiple scales: The Emergence of Hydrogeophysics
journal, June 2015

  • Binley, Andrew; Hubbard, Susan S.; Huisman, Johan A.
  • Water Resources Research, Vol. 51, Issue 6
  • DOI: 10.1002/2015WR017016

Improved hydrogeophysical characterization and monitoring through parallel modeling and inversion of time-domain resistivity andinduced-polarization data
journal, July 2010

  • Johnson, Timothy C.; Versteeg, Roelof J.; Ward, Andy
  • Geophysics, Vol. 75, Issue 4, p. WA27-WA41
  • DOI: 10.1190/1.3475513

What is principal component analysis?
journal, March 2008


Understanding Mixup Training Methods
journal, January 2018


How can Big Data and machine learning benefit environment and water management: a survey of methods, applications, and future directions
journal, July 2019


Neural network modeling of in situ fluid-filled pore size distributions in subsurface shale reservoirs under data constraints
journal, March 2019


Comparison of invasive and non-invasive electromagnetic methods in soil water content estimation of a dike model
journal, April 2009

  • Preko, Kwasi; Scheuermann, Alexander; Wilhelm, Helmut
  • Journal of Geophysics and Engineering, Vol. 6, Issue 2
  • DOI: 10.1088/1742-2132/6/2/006

A Survey on Deep Transfer Learning
book, January 2018


mpi4py: Status Update After 12 Years of Development
journal, July 2021


Shale gas and non-aqueous fracturing fluids: Opportunities and challenges for supercritical CO2
journal, June 2015


Geophysical Signatures of Microbial Activity at Hydrocarbon Contaminated Sites: A Review
journal, November 2009


A Bayesian approach for inverse modeling, data assimilation, and conditional simulation of spatial random fields
journal, October 2010

  • Rubin, Yoram; Chen, Xingyuan; Murakami, Haruko
  • Water Resources Research, Vol. 46, Issue 10
  • DOI: 10.1029/2009WR008799

The Data Assimilation Research Testbed: A Community Facility
journal, September 2009

  • Anderson, Jeffrey; Hoar, Tim; Raeder, Kevin
  • Bulletin of the American Meteorological Society, Vol. 90, Issue 9
  • DOI: 10.1175/2009BAMS2618.1

PFLOTRAN: Reactive Flow & Transport Code for Use on Laptops to Leadership-Class Supercomputers
book, March 2012


Understanding hydraulic fracturing: a multi-scale problem
journal, October 2016

  • Hyman, J. D.; Jiménez-Martínez, J.; Viswanathan, H. S.
  • Philosophical Transactions of the Royal Society A: Mathematical, Physical and Engineering Sciences, Vol. 374, Issue 2078
  • DOI: 10.1098/rsta.2015.0426

Inverse problems: A Bayesian perspective
journal, May 2010


Multiple-point geostatistics for modeling subsurface heterogeneity: A comprehensive review: REVIEW OF MULTIPLE POI
journal, November 2008


A System Model for Geologic Sequestration of Carbon Dioxide
journal, February 2009

  • Stauffer, Philip H.; Viswanathan, Hari S.; Pawar, Rajesh J.
  • Environmental Science & Technology, Vol. 43, Issue 3
  • DOI: 10.1021/es800403w

Machine learning in geo- and environmental sciences: From small to large scale
journal, August 2020


Principal component analysis
journal, January 2014


PFLOTRAN-E4D: A parallel open source PFLOTRAN module for simulating time-lapse electrical resistivity data
journal, February 2017


Machine learning for hydrologic sciences: An introductory overview
journal, May 2021


The cross-scale science of CO2 capture and storage: from pore scale to regional scale
journal, January 2012

  • Middleton, Richard S.; Keating, Gordon N.; Stauffer, Philip H.
  • Energy & Environmental Science, Vol. 5, Issue 6
  • DOI: 10.1039/c2ee03227a

Sequential geophysical and flow inversion to characterize fracture networks in subsurface systems: MUDUNURU
journal, September 2017

  • Mudunuru, Maruti Kumar; Karra, Satish; Makedonska, Nataliia
  • Statistical Analysis and Data Mining: The ASA Data Science Journal, Vol. 10, Issue 5
  • DOI: 10.1002/sam.11356

Unsupervised machine learning based on non-negative tensor factorization for analyzing reactive-mixing
journal, October 2019


3-D Geologic Controls of Hydrothermal Fluid Flow at Brady geothermal field, Nevada, USA
journal, July 2021


Regression-based reduced-order models to predict transient thermal output for enhanced geothermal systems
journal, November 2017


A survey on Image Data Augmentation for Deep Learning
journal, July 2019