DOE PAGES title logo U.S. Department of Energy
Office of Scientific and Technical Information

Title: Using feature importance as an exploratory data analysis tool on Earth system models

Journal Article · · Geoscientific Model Development (Online)

Abstract. Machine learning (ML) models are commonly used to generate predictions, but these models can also support the discovery of new science. Generating accurate predictions necessitates that a model captures the structure of the underlying data. If the structure is properly extracted, ML could be a useful exploratory and evidential tool. In this paper, we present a case study that demonstrates the use of ML for exploratory data analysis (EDA) in the climate space. We apply the ML explainability method of spatiotemporal zeroed feature importance (stZFI) to understand how climate-variable associations evolve over space and time. Our analyses focus on data from ensembles of Earth system models (ESMs) which provide data on different climate states and conditions. We elect to work with ESM ensembles since they allow us to compare feature importance across alternative scenarios not available with observed data. The ensembles also account for natural variability so that we can distinguish between signal and noise due to natural climate variability when computing feature importance. The use of perturbed initial condition ensembles introduces variability mimicking the natural variability in the atmosphere; thus the signals emerging using feature importance (FI) can be evaluated against the natural variability in the climate system. For our analyses, we consider the 1991 volcanic eruption of Mount Pinatubo, which was a large stratospheric aerosol injection. We explore the climate pathway associated with the eruption from aerosols to radiation to temperature at both the near-surface and stratospheric levels. In addition to applying the method to data generated from two different ESMs, we apply stZFI to reanalysis data to compare the associations identified by stZFI. We show how stZFI tracks the importance of aerosol optical depth over time on forecasting temperatures. This case study illustrates usefulness of an ML tool (stZFI) for EDA on a well-studied climate exemplar.

Sponsoring Organization:
USDOE
Grant/Contract Number:
NA0003525
OSTI ID:
2522840
Journal Information:
Geoscientific Model Development (Online), Journal Name: Geoscientific Model Development (Online) Journal Issue: 4 Vol. 18; ISSN 1991-9603
Publisher:
Copernicus GmbHCopyright Statement
Country of Publication:
Germany
Language:
English

References (49)

The Impact of Mount Pinatubo on World-Wide Temperatures journal May 1996
The Brewer-Dobson circulation journal June 2014
Stratospheric dynamics and midlatitude jets under geoengineering with space mirrors and sulfate and titania aerosols journal January 2015
Geoengineering with stratospheric aerosols: What do we not know after a decade of research?: GEOENGINEERING: WHAT DO WE NOT KNOW? journal November 2016
Deep echo state networks with uncertainty quantification for spatio‐temporal forecasting journal November 2018
The DOE E3SM Model Version 2: Overview of the physical model preprint April 2022
Characterizing climate pathways using feature importance on echo state networks
  • Goode, Katherine; Ries, Daniel; McClernon, Kellie
  • Statistical Analysis and Data Mining: The ASA Data Science Journal, Vol. 17, Issue 4 https://doi.org/10.1002/sam.11706
journal July 2024
An ensemble quadratic echo state network for non‐linear spatio‐temporal forecasting journal January 2017
An overview of the Earth system science of solar geoengineering: Overview of the earth system science of solar geoengineering journal July 2016
Gradient-Based Attribution Methods book January 2019
A Practical Guide to Applying Echo State Networks book January 2012
Evaluating time series forecasting models: an empirical study on performance estimation methods journal October 2020
Unrestricted permutation forces extrapolation: variable importance requires at least one more model, or there is no free variable importance journal October 2021
Aerosols, clouds and radiation journal January 1991
The effect of aerosols on long wave radiation and global warming journal January 2014
Reservoir computing approaches to recurrent neural network training journal August 2009
A comparison of model validation approaches for echo state networks using climate model replicates journal March 2024
Re-evaluation of SO 2 release of the 15 June 1991 Pinatubo eruption using ultraviolet and infrared satellite sensors : SATELLITE STUDY OF 15 JUNE 1991 PINATUBO SO journal April 2004
An Overview of the Atmospheric Component of the Energy Exascale Earth System Model journal August 2019
Physically Interpretable Neural Networks for the Geosciences: Applications to Earth System Variability journal August 2020
Using an Explainable Machine Learning Approach to Characterize Earth System Model Errors: Application of SHAP Analysis to Modeling Lightning Flash Occurrence journal April 2022
Impact of the Latitude of Stratospheric Aerosol Injection on the Southern Annular Mode journal September 2022
Explainable Artificial Intelligence for Bayesian Neural Networks: Toward Trustworthy Predictions of Ocean Dynamics journal October 2022
Using Explainable Artificial Intelligence to Quantify “Climate Distinguishability” After Stratospheric Aerosol Injection journal October 2023
Stratospheric temperature increases due to Pinatubo aerosols journal January 1992
Stratospheric aerosol optical depths, 1850–1990 journal January 1993
Atmospheric effects of the Mt Pinatubo eruption journal February 1995
Stop explaining black box machine learning models for high stakes decisions and use interpretable models instead journal May 2019
Key impacts of climate engineering on biodiversity and ecosystems, with priorities for future research journal March 2016
The effect of various methodological options on the detection of leading modes of sea level pressure variability journal January 2006
The 1997–98 El Niño Evolution Relative to Previous El Niño Events journal January 2000
A Proposal for the Intercomparison of the Dynamical Cores of Atmospheric General Circulation Models journal October 1994
A Comparison of Semi-Lagrangian and Eulerian Tropical Climate Simulations journal April 1998
Investigating the Fidelity of Explainable Artificial Intelligence Methods for Applications of Convolutional Neural Networks in Geoscience journal October 2022
How Well Do Coupled Models Simulate Today's Climate? journal March 2008
Making the Black Box More Transparent: Understanding the Physical Implications of Machine Learning journal November 2019
Time-Varying Climate Sensitivity from Regional Feedbacks journal July 2013
The Modern-Era Retrospective Analysis for Research and Applications, Version 2 (MERRA-2) journal July 2017
Quantifying Uncertainty in Projections of Regional Climate Change: A Bayesian Approach to the Analysis of Multimodel Ensembles journal May 2005
Bayesian Modeling of Uncertainty in Ensembles of Climate Models journal March 2009
Solving High-Dimensional Inverse Problems with Auxiliary Uncertainty via Operator Learning with Limited data journal January 2023
Robust winter warming over Eurasia under stratospheric sulfate geoengineering – the role of stratospheric dynamics journal January 2021
Retrieval of volcanic SO2 from HIRS/2 using optimal estimation journal July 2017
Localized injections of interactive volcanic aerosols and their climate impacts in a simple general circulation model preprint March 2024
Machine learning for numerical weather and climate modelling: a review journal November 2023
Validating a microphysical prognostic stratospheric aerosol implementation in E3SMv2 using observations after the Mount Pinatubo eruption journal July 2024
HSW-V v1.0: localized injections of interactive volcanic aerosols and their climate impacts in a simple general circulation model journal August 2024
Using feature importance as an exploratory data analysis tool on Earth system models journal February 2025
Evaluating statistical consistency in the ocean model component of the Community Earth System Model (pyCECT v2.0) journal January 2016

Similar Records

Related Subjects