Skip to main content
U.S. Department of Energy
Office of Scientific and Technical Information

ON MACHINE-LEARNED CLASSIFICATION OF VARIABLE STARS WITH SPARSE AND NOISY TIME-SERIES DATA

Journal Article · · Astrophysical Journal

With the coming data deluge from synoptic surveys, there is a need for frameworks that can quickly and automatically produce calibrated classification probabilities for newly observed variables based on small numbers of time-series measurements. In this paper, we introduce a methodology for variable-star classification, drawing from modern machine-learning techniques. We describe how to homogenize the information gleaned from light curves by selection and computation of real-numbered metrics (features), detail methods to robustly estimate periodic features, introduce tree-ensemble methods for accurate variable-star classification, and show how to rigorously evaluate a classifier using cross validation. On a 25-class data set of 1542 well-studied variable stars, we achieve a 22.8% error rate using the random forest (RF) classifier; this represents a 24% improvement over the best previous classifier on these data. This methodology is effective for identifying samples of specific science classes: for pulsational variables used in Milky Way tomography we obtain a discovery efficiency of 98.2% and for eclipsing systems we find an efficiency of 99.1%, both at 95% purity. The RF classifier is superior to other methods in terms of accuracy, speed, and relative immunity to irrelevant features; the RF can also be used to estimate the importance of each feature in classification. Additionally, we present the first astronomical use of hierarchical classification methods to incorporate a known class taxonomy in the classifier, which reduces the catastrophic error rate from 8% to 7.8%. Excluding low-amplitude sources, the overall error rate improves to 14%, with a catastrophic error rate of 3.5%.

OSTI ID:
21576816
Journal Information:
Astrophysical Journal, Journal Name: Astrophysical Journal Journal Issue: 1 Vol. 733; ISSN ASJOAB; ISSN 0004-637X
Country of Publication:
United States
Language:
English

Similar Records

Automated classification of periodic variable stars detected by the wide-field infrared survey explorer
Journal Article · Tue Jul 01 00:00:00 EDT 2014 · Astronomical Journal (New York, N.Y. Online) · OSTI ID:22342291

CONSTRUCTION OF A CALIBRATED PROBABILISTIC CLASSIFICATION CATALOG: APPLICATION TO 50k VARIABLE SOURCES IN THE ALL-SKY AUTOMATED SURVEY
Journal Article · Fri Dec 14 23:00:00 EST 2012 · Astrophysical Journal, Supplement Series · OSTI ID:22089746

Using Machine Learning to Discern Eruption in Noisy Environments: A Case Study Using CO2-Driven Cold-Water Geyser in Chimayó, New Mexico
Journal Article · Tue Feb 12 23:00:00 EST 2019 · Seismological Research Letters · OSTI ID:1544684