Cluster, classify, regress: A general method for learning discontinuous functions
- Oak Ridge National Lab. (ORNL), Oak Ridge, TN (United States). Computer Science and Mathematics Division
- Oak Ridge National Lab. (ORNL), Oak Ridge, TN (United States). Fusion Energy Division
- Univ. of Manchester (United Kingdom). Dept. of Mathematics
This paper presents a method for solving the supervised learning problem in which the output is highly nonlinear and discontinuous. Here, it is proposed to solve this problem in three stages: (i) cluster the pairs of input-output data points, resulting in a label for each point; (ii) classify the data, where the corresponding label is the output; and finally (iii) perform one separate regression for each class, where the training data corresponds to the subset of the original input-output pairs which have that label according to the classifier. It has not yet been proposed to combine these 3 fundamental building blocks of machine learning in this simple and powerful fashion. This can be viewed as a form of deep learning, where any of the intermediate layers can itself be deep. The utility and robustness of the methodology is illustrated on some toy problems, including one example problem arising from simulation of plasma fusion in a tokamak.
- Research Organization:
- Oak Ridge National Laboratory (ORNL), Oak Ridge, TN (United States)
- Sponsoring Organization:
- USDOE Office of Science (SC), Fusion Energy Sciences (FES); USDOE Office of Science (SC), Advanced Scientific Computing Research (ASCR)
- Grant/Contract Number:
- AC05-00OR22725
- OSTI ID:
- 1648947
- Journal Information:
- Foundations of Data Science, Vol. 1, Issue 4; ISSN 2639-8001
- Publisher:
- AIMSCopyright Statement
- Country of Publication:
- United States
- Language:
- English
Similar Records
Computer Model Calibration with Time Series Data Using Deep Learning and Quantile Regression
Process Anomaly Detection for Sparsely Labeled Events in Nuclear Power Plants