Data augmentation for disruption prediction via robust surrogate models

Rath, Katharina; Rügamer, David; Bischl, Bernd; von Toussaint, Udo; Rea, Cristina; Maris, Andrew; Granetz, Robert; Albert, Christopher G.

doi:10.7910/DVN/FMJCAD

Title: Data augmentation for disruption prediction via robust surrogate models

Dataset
Other Related Research

Abstract

The goal of this work is to generate large statistically representative datasets to train machine learning models for disruption prediction provided by data from few existing discharges. Such a comprehensive training database is important to achieve satisfying and reliable prediction results in artificial neural network classifiers. Here, we aim for a robust augmentation of the training database for multivariate time series data using Student-t process regression. We apply Student-t process regression in a state space formulation via Bayesian filtering to tackle challenges imposed by outliers and noise in the training data set and to reduce the computational complexity. Thus, the method can also be used if the time resolution is high. We use an uncorrelated model for each dimension and impose correlations afterwards via coloring transformations. We demonstrate the efficacy of our approach on plasma diagnostics data of three different disruption classes from the DIII-D tokamak. To evaluate if the distribution of the generated data is similar to the training data, we additionally perform statistical analyses using methods from time series analysis, descriptive statistics, and classic machine learning clustering algorithms.

Authors:

Rath, Katharina; Rügamer, David; Bischl, Bernd; von Toussaint, Udo; Rea, Cristina; Maris, Andrew; Granetz, Robert; Albert, Christopher G.

OSTI

Publication Date:: Mon Jun 06 04:00:00 UTC 2022

DOE Contract Number:: SC0014264; FC02-04ER54698

Research Org.:: Massachusetts Inst. of Technology (MIT), Cambridge, MA (United States). Plasma Science and Fusion Center; General Atomics, San Diego, CA (United States)

Sponsoring Org.:: USDOE Office of Science (SC), Fusion Energy Sciences (FES)

Subject:: 97 MATHEMATICS AND COMPUTING

OSTI Identifier:: 1887951

DOI:: https://doi.org/10.7910/DVN/FMJCAD

Citation Formats


                    Rath, Katharina, Rügamer, David, Bischl, Bernd, von Toussaint, Udo, Rea, Cristina, Maris, Andrew, Granetz, Robert, and Albert, Christopher G. Data augmentation for disruption prediction via robust surrogate models.  United States: N. p., 2022. 
        Web.  doi:10.7910/DVN/FMJCAD.

Copy to clipboard


                    Rath, Katharina, Rügamer, David, Bischl, Bernd, von Toussaint, Udo, Rea, Cristina, Maris, Andrew, Granetz, Robert, & Albert, Christopher G. Data augmentation for disruption prediction via robust surrogate models.  United States.  doi:https://doi.org/10.7910/DVN/FMJCAD

Copy to clipboard


                    Rath, Katharina, Rügamer, David, Bischl, Bernd, von Toussaint, Udo, Rea, Cristina, Maris, Andrew, Granetz, Robert, and Albert, Christopher G. 2022.  
"Data augmentation for disruption prediction via robust surrogate models".  United States.  doi:https://doi.org/10.7910/DVN/FMJCAD.  https://www.osti.gov/servlets/purl/1887951. Pub date:Mon Jun 06 04:00:00 UTC 2022

Copy to clipboard


                    
@article{osti_1887951,

  title        = {Data augmentation for disruption prediction via robust surrogate models},

  author       = {Rath, Katharina and Rügamer, David and Bischl, Bernd and von Toussaint, Udo and Rea, Cristina and Maris, Andrew and Granetz, Robert and Albert, Christopher G.},

  abstractNote = {The goal of this work is to generate large statistically representative datasets to train machine learning models for disruption prediction provided by data from few existing discharges. Such a comprehensive training database is important to achieve satisfying and reliable prediction results in artificial neural network classifiers. Here, we aim for a robust augmentation of the training database for multivariate time series data using Student-t process regression. We apply Student-t process regression in a state space formulation via Bayesian filtering to tackle challenges imposed by outliers and noise in the training data set and to reduce the computational complexity. Thus, the method can also be used if the time resolution is high. We use an uncorrelated model for each dimension and impose correlations afterwards via coloring transformations. We demonstrate the efficacy of our approach on plasma diagnostics data of three different disruption classes from the DIII-D tokamak. To evaluate if the distribution of the generated data is similar to the training data, we additionally perform statistical analyses using methods from time series analysis, descriptive statistics, and classic machine learning clustering algorithms.},

  doi          = {10.7910/DVN/FMJCAD},

  journal      = {},

  number       = ,

  volume       = ,

  place        = {United States},

  year         = {Mon Jun 06 04:00:00 UTC 2022},

  month        = {Mon Jun 06 04:00:00 UTC 2022}

}

Copy to clipboard

Dataset:

View Dataset

DOI: https://doi.org/10.7910/DVN/FMJCAD

Save / Share:

Export Metadata

Save to My Library

Similar records in DOE Data Explorer and OSTI.GOV collections:

Data augmentation for disruption prediction via robust surrogate models

Journal Article Rath, Katharina ; Rügamer, David ; Bischl, Bernd ; ... - Journal of Plasma Physics

The goal of this work is to generate large statistically representative data sets to train machine learning models for disruption prediction provided by data from few existing discharges. Such a comprehensive training database is important to achieve satisfying and reliable prediction results in artificial neural network classifiers. Here, we aim for a robust augmentation of the training database for multivariate time series data using Student t process regression. We apply Student t process regression in a state space formulation via Bayesian filtering to tackle challenges imposed by outliers and noise in the training data set and to reduce the computationalmore »« less
An application of survival analysis to disruption prediction via Random Forests

Dataset Tinguely, R. A. ; Montes, K. J. ; Rea, C. ; ...

One of the most pressing challenges facing the fusion community is adequately mitigating or, even better, avoiding disruptions of tokamak plasmas. However, before this can be done, disruptions must first be predicted with sufficient warning time to actuate a response. The established field of survival analysis provides a convenient statistical framework for time-to-event (i.e. time-to-disruption) studies. This paper demonstrates the integration of an existing disruption prediction machine learning algorithm with the Kaplan-Meier estimator of survival probability. Specifically discussed are the implied warning times from binary classification of disruption databases and the interpretation of output signals from Random Forest algorithms trainedmore »« less
Parameter Calibration and Uncertainty Quantification via Surrogate Model Optimization for CFD-DEM Modelling of a Small-Scale Slugging Bed

Dataset Fullmer, W D ; Webber, J ; VanEssendelft, D

ARS TRS
Data for "Extended-MHD simulations of disruption mitigation via massive gas injection in SPARC"

Dataset Kleiner, Andreas ; Ferraro, Nathaniel ; Sweeney, Ryan ; ...

The data in this repository correspond to the figures of the paper "Extended-MHD simulations of disruption mitigation via massive gas injection in SPARC" published in the journal Nuclear Fusion. The figures are created based on simulations with the M3D-C1 code using Python and IDL post-processing tools.
Hybrid deep learning architecture for general disruption prediction across tokamaks

Dataset Zhu, J. X. ; Rea, C. ; Montes, K. ; ...

In this paper, we present a new deep learning disruption prediction algorithm based on important findings from explorative data analysis which effectively allows knowledge transfer from existing devices to new ones, thereby predicting disruptions using very limited disruptive data from the new devices. The explorative data analysis conducted via unsupervised clustering techniques confirms that time-sequence data are much better separators of disruptive and non-disruptive behavior than the instantaneous plasma state data with further advantageous implications for a sequence-based predictor. Based on such important findings, we have designed a new algorithm for multi-machine disruption prediction that achieves high predictive accuracy onmore »« less

Similar Records