skip to main content
DOE PAGES title logo U.S. Department of Energy
Office of Scientific and Technical Information

This content will become publicly available on February 13, 2020

Title: Using Machine Learning to Discern Eruption in Noisy Environments: A Case Study Using CO2-Driven Cold-Water Geyser in Chimayó, New Mexico

Abstract

In this paper, we present an approach based on machine learning (ML) to distinguish eruption and precursory signals of Chimayó geyser (New Mexico, U.S.A.) under noisy environmental conditions. This geyser can be considered a natural analog of CO 2 intrusion into shallow water aquifers. By studying this geyser, we can understand upwelling of CO 2-rich fluids from depth, which has relevance to leak monitoring in a CO 2 sequestration project. ML methods such as random forests (RFs) are known to be robust multiclass classifiers and perform well under unfavorable, noisy conditions. However, the extent of the RF method’s accuracy is poorly understood for this CO 2-driven geysering application. The current study aims to quantify the performance of RF classifiers to discern the geyser state. Toward this goal, we first present the data collected from the seismometer that is installed near the Chimayó geyser. The seismic signals collected at this site contain different types of noises such as daily temperature variations, animal movement near the geyser, and human activity. First, we filter the signals from these noises by combining the Butterworth high-pass (BH) filter and an autoregressive (AR) method in a multilevel fashion. We show that by combining these filtering techniquesmore » in a hierarchical fashion leads to a reduction in noise in the seismic data without removing the precursors and eruption event signals. We then use RF on the filtered data to classify the state of geyser into three classes: remnant noise, precursor, and eruption states. RF classifier is constructed based on the comprehensive features extracted using the Tsfresh Python package. We show that the classification accuracy using RF on the filtered data is greater than 90%. We also evaluate the accuracy of other classical time-series methods such as dynamic time warping (DTW) on filtered data along with RF on partially filtered data in which we remove the daily trends. Classification accuracy shows that DTW performs poorly (44%) and RF on partially filtered data performs decently (87%). Denoising seismic signals from both daily trends and human activity enhances RF classifier performance by 7%. Finally, these aspects make the proposed ML framework attractive for event discrimination and signal enhancement under noisy conditions, with strong potential for application to monitor leaks in CO 2 sequestration.« less

Authors:
 [1];  [2]; ORCiD logo [3]; ORCiD logo [3]; ORCiD logo [3]; ORCiD logo [3]; ORCiD logo [3]; ORCiD logo [3]; ORCiD logo [3];  [3]; ORCiD logo [3]
  1. Univ. of California, Los Angeles, CA (United States)
  2. Columbia Univ., Palisades, NY (United States)
  3. Los Alamos National Lab. (LANL), Los Alamos, NM (United States)
Publication Date:
Research Org.:
Los Alamos National Lab. (LANL), Los Alamos, NM (United States)
Sponsoring Org.:
USDOE Laboratory Directed Research and Development (LDRD) Program
OSTI Identifier:
1544684
Report Number(s):
LA-UR-18-29261
Journal ID: ISSN 0895-0695
Grant/Contract Number:  
89233218CNA000001
Resource Type:
Accepted Manuscript
Journal Name:
Seismological Research Letters
Additional Journal Information:
Journal Volume: 90; Journal Issue: 2A; Journal ID: ISSN 0895-0695
Publisher:
Seismological Society of America
Country of Publication:
United States
Language:
English
Subject:
58 GEOSCIENCES; 97 MATHEMATICS AND COMPUTING; Computer Science; Earth Sciences; Energy Sciences; Information Science; Mathematics; CO2-driven cold-water geysers; carbon sequestration; monitoring; seismicity; signal processing; precursor; eruption; machine learning; event classification; feature extraction; random forests

Citation Formats

Yuan, Baichuan, Tan, Yen Joe, Mudunuru, Maruti Kumar, Marcillo, Omar Eduardo, Delorey, Andrew A., Roberts, Peter Morse, Webster, Jeremy David, Gammans, Christine Naomi Louise, Karra, Satish, Guthrie, George Drake, and Johnson, Paul Allan. Using Machine Learning to Discern Eruption in Noisy Environments: A Case Study Using CO2-Driven Cold-Water Geyser in Chimayó, New Mexico. United States: N. p., 2019. Web. doi:10.1785/0220180306.
Yuan, Baichuan, Tan, Yen Joe, Mudunuru, Maruti Kumar, Marcillo, Omar Eduardo, Delorey, Andrew A., Roberts, Peter Morse, Webster, Jeremy David, Gammans, Christine Naomi Louise, Karra, Satish, Guthrie, George Drake, & Johnson, Paul Allan. Using Machine Learning to Discern Eruption in Noisy Environments: A Case Study Using CO2-Driven Cold-Water Geyser in Chimayó, New Mexico. United States. doi:10.1785/0220180306.
Yuan, Baichuan, Tan, Yen Joe, Mudunuru, Maruti Kumar, Marcillo, Omar Eduardo, Delorey, Andrew A., Roberts, Peter Morse, Webster, Jeremy David, Gammans, Christine Naomi Louise, Karra, Satish, Guthrie, George Drake, and Johnson, Paul Allan. Wed . "Using Machine Learning to Discern Eruption in Noisy Environments: A Case Study Using CO2-Driven Cold-Water Geyser in Chimayó, New Mexico". United States. doi:10.1785/0220180306.
@article{osti_1544684,
title = {Using Machine Learning to Discern Eruption in Noisy Environments: A Case Study Using CO2-Driven Cold-Water Geyser in Chimayó, New Mexico},
author = {Yuan, Baichuan and Tan, Yen Joe and Mudunuru, Maruti Kumar and Marcillo, Omar Eduardo and Delorey, Andrew A. and Roberts, Peter Morse and Webster, Jeremy David and Gammans, Christine Naomi Louise and Karra, Satish and Guthrie, George Drake and Johnson, Paul Allan},
abstractNote = {In this paper, we present an approach based on machine learning (ML) to distinguish eruption and precursory signals of Chimayó geyser (New Mexico, U.S.A.) under noisy environmental conditions. This geyser can be considered a natural analog of CO2 intrusion into shallow water aquifers. By studying this geyser, we can understand upwelling of CO2-rich fluids from depth, which has relevance to leak monitoring in a CO2 sequestration project. ML methods such as random forests (RFs) are known to be robust multiclass classifiers and perform well under unfavorable, noisy conditions. However, the extent of the RF method’s accuracy is poorly understood for this CO2-driven geysering application. The current study aims to quantify the performance of RF classifiers to discern the geyser state. Toward this goal, we first present the data collected from the seismometer that is installed near the Chimayó geyser. The seismic signals collected at this site contain different types of noises such as daily temperature variations, animal movement near the geyser, and human activity. First, we filter the signals from these noises by combining the Butterworth high-pass (BH) filter and an autoregressive (AR) method in a multilevel fashion. We show that by combining these filtering techniques in a hierarchical fashion leads to a reduction in noise in the seismic data without removing the precursors and eruption event signals. We then use RF on the filtered data to classify the state of geyser into three classes: remnant noise, precursor, and eruption states. RF classifier is constructed based on the comprehensive features extracted using the Tsfresh Python package. We show that the classification accuracy using RF on the filtered data is greater than 90%. We also evaluate the accuracy of other classical time-series methods such as dynamic time warping (DTW) on filtered data along with RF on partially filtered data in which we remove the daily trends. Classification accuracy shows that DTW performs poorly (44%) and RF on partially filtered data performs decently (87%). Denoising seismic signals from both daily trends and human activity enhances RF classifier performance by 7%. Finally, these aspects make the proposed ML framework attractive for event discrimination and signal enhancement under noisy conditions, with strong potential for application to monitor leaks in CO2 sequestration.},
doi = {10.1785/0220180306},
journal = {Seismological Research Letters},
number = 2A,
volume = 90,
place = {United States},
year = {2019},
month = {2}
}

Journal Article:
Free Publicly Available Full Text
This content will become publicly available on February 13, 2020
Publisher's Version of Record

Save / Share: