Skip to main content
U.S. Department of Energy
Office of Scientific and Technical Information

Applied Machine-Learning Models to Identify Spectral Sub-Types of M Dwarfs from Photometric Surveys

Journal Article · · Publications of the Astronomical Society of the Pacific
M dwarfs are the most abundant stars in the Solar Neighborhood and they are prime targets for searching for rocky planets in habitable zones. Consequently, a detailed characterization of these stars is in demand. The spectral sub-type is one of the parameters that is used for the characterization and it is traditionally derived from the observed spectra. However, obtaining the spectra of M dwarfs is expensive in terms of observation time and resources due to their intrinsic faintness. We study the performance of four machine-learning (ML) models—K-Nearest Neighbor (KNN), Random Forest (RF), Probabilistic Random Forest (PRF), and Multilayer Perceptron (MLP)—in identifying the spectral sub-types of M dwarfs at a grand scale by deploying broadband photometry in the optical and near-infrared. We trained the ML models by using the spectroscopically identified M dwarfs from the Sloan Digital Sky Survey (SDSS) Data Release (DR) 7, together with their photometric colors that were derived from the SDSS, Two-Micron All-Sky Survey, and Wide-field Infrared Survey Explorer. We found that the RF, PRF, and MLP give a comparable prediction accuracy, 74%, while the KNN provides slightly lower accuracy, 71%. We also found that these models can predict the spectral sub-type of M dwarfs with ~99% accuracy within ±1 sub-type. The five most useful features for the prediction are r - z, r - i, r - J, r - H , and g - z, and hence lacking data in all SDSS bands substantially reduces the prediction accuracy. However, we can achieve an accuracy of over 70% when the r and i magnitudes are available. Since the stars in this study are nearby (d ≲ 1300 pc for 95% of the stars), the dust extinction can reduce the prediction accuracy by only 3%. Finally, we used our optimized RF models to predict the spectral sub-types of M dwarfs from the Catalog of Cool Dwarf Targets for the Transiting Exoplanet Survey Satellite, and we provide the optimized RF models for public use.
Research Organization:
US Department of Energy (USDOE), Washington, DC (United States). Office of Science, Sloan Digital Sky Survey (SDSS)
Sponsoring Organization:
National Aeronautics and Space Administration (NASA); National Science Foundation (NSF); USDOE
OSTI ID:
2425105
Journal Information:
Publications of the Astronomical Society of the Pacific, Journal Name: Publications of the Astronomical Society of the Pacific Journal Issue: 1046 Vol. 135; ISSN 0004-6280
Publisher:
Astronomical Society of the Pacific (ASP)Copyright Statement
Country of Publication:
United States
Language:
English

References (33)

Learning from imbalanced data: open challenges and future directions journal April 2016
Array programming with NumPy journal September 2020
CARMENES input catalogue of M dwarfs: I. Low-resolution spectroscopy with CAFOS⋆ journal May 2015
The Gaia mission journal November 2016
Exploiting the Gaia EDR3 photometry to derive stellar temperatures journal September 2021
Gaia Data Release 3 journal June 2023
The SIMBAD astronomical database: The CDS reference database for astronomical objects journal April 2000
The Sloan Digital Sky Survey: Technical Summary journal September 2000
Maps of Dust Infrared Emission for Use in Estimation of Reddening and Cosmic Microwave Background Radiation Foregrounds journal June 1998
The Solar Neighborhood. XVII. Parallax Results from the CTIOPI 0.9 m Program: 20 New Members of the RECONS 10 Parsec Sample journal January 2006
Stellar SEDs from 0.3 to 2.5 μm: Tracing the Stellar Locus and Searching for Color Outliers in the SDSS and 2MASS journal November 2007
The Luminosity and mass Functions of Low-Mass Stars in the Galactic Disk. ii. the Field journal May 2010
The Sloan Digital sky Survey data Release 7 Spectroscopic m Dwarf Catalog. i. data journal February 2011
Measuring Reddening with Sloan Digital sky Survey Stellar Spectra and Recalibrating sfd journal August 2011
THE Pan-STARRS1 PHOTOMETRIC SYSTEM journal April 2012
THE FALSE POSITIVE RATE OF KEPLER AND THE OCCURRENCE OF PLANETS journal March 2013
How to Constrain your m Dwarf: Measuring Effective Temperature, Bolometric Luminosity, Mass, and Radius journal May 2015
TERRESTRIAL PLANET OCCURRENCE RATES FOR THE KEPLER GK DWARF SAMPLE journal August 2015
The Synthetic-Oversampling Method: Using Photometric Colors to Discover Extremely Metal-Poor Stars journal September 2015
The Seventh data Release of the Sloan Digital sky Survey journal May 2009
A million binaries from Gaia eDR3: sample selection and validation of Gaia parallax uncertainties journal February 2021
Galactic reddening in 3D from stellar photometry – an improved map journal June 2018
Kepler Planet-Detection Mission: Introduction and First Results journal January 2010
ERRATUM: “HOW TO CONSTRAIN YOUR M DWARF: MEASURING EFFECTIVE TEMPERATURE, BOLOMETRIC LUMINOSITY, MASS, AND RADIUS” (ApJ, 804, 64) journal March 2016
A Catalog of Cool Dwarf Targets for the Transiting Exoplanet Survey Satellite journal April 2018
Probabilistic Random Forest: A Machine Learning Algorithm for Noisy Data Sets journal December 2018
The Solar Neighborhood. XLV. The Stellar Multiplicity Rate of M Dwarfs Within 25 pc journal May 2019
Kepler Planet Occurrence Rates for Mid-type M Dwarfs as a Function of Spectral Type journal July 2019
Estimating Distances from Parallaxes. V. Geometric and Photogeometric Distances to 1.47 Billion Stars in Gaia Early Data Release 3 journal February 2021
LSST: From Science Drivers to Reference Design and Anticipated Data Products journal March 2019
Scaling K2. I. Revised Parameters for 222,088 K2 Stars and a K2 Planet Radius Valley at 1.9 R journal March 2020
Applying Random Forest Classification to Ultracool Dwarf Discovery in Deep Surveys. I. Color Classification with SDSS, UKIDSS, and WISE Photometry journal April 2022
Applying Random Forest Classification to Ultracool Dwarf Discovery in Deep Surveys. II. Color Classification with PanSTARRS, 2MASS, UKIDSS, and WISE Photometry journal April 2022

Similar Records

Machine-learning Classifiers for Intermediate Redshift Emission-line Galaxies
Journal Article · Thu Sep 19 20:00:00 EDT 2019 · The Astrophysical Journal (Online) · OSTI ID:1650076

MULTI-WAVELENGTH CHARACTERIZATION OF STELLAR FLARES ON LOW-MASS STARS USING SDSS AND 2MASS TIME-DOMAIN SURVEYS
Journal Article · Tue Mar 20 00:00:00 EDT 2012 · Astrophysical Journal · OSTI ID:22016140