Skip to main content
U.S. Department of Energy
Office of Scientific and Technical Information

Fast and efficient identification of anomalous galaxy spectra with neural density estimation

Journal Article · · Monthly Notices of the Royal Astronomical Society

ABSTRACT

Current large-scale astrophysical experiments produce unprecedented amounts of rich and diverse data. This creates a growing need for fast and flexible automated data inspection methods. Deep learning algorithms can capture and pick up subtle variations in rich data sets and are fast to apply once trained. Here, we study the applicability of an unsupervised and probabilistic deep learning framework, the probabilistic auto-encoder, to the detection of peculiar objects in galaxy spectra from the SDSS survey. Different to supervised algorithms, this algorithm is not trained to detect a specific feature or type of anomaly, instead it learns the complex and diverse distribution of galaxy spectra from training data and identifies outliers with respect to the learned distribution. We find that the algorithm assigns consistently lower probabilities (higher anomaly score) to spectra that exhibit unusual features. For example, the majority of outliers among quiescent galaxies are E+A galaxies, whose spectra combine features from old and young stellar population. Other identified outliers include LINERs, supernovae, and overlapping objects. Conditional modelling further allows us to incorporate additional information. Namely, we evaluate the probability of an object being anomalous given a certain spectral class, but other information such as metrics of data quality or estimated redshift could be incorporated as well. We make our code publicly available.

Sponsoring Organization:
USDOE
Grant/Contract Number:
AC02-05CH11231
OSTI ID:
2008397
Journal Information:
Monthly Notices of the Royal Astronomical Society, Journal Name: Monthly Notices of the Royal Astronomical Society Journal Issue: 2 Vol. 526; ISSN 0035-8711
Publisher:
Oxford University PressCopyright Statement
Country of Publication:
United Kingdom
Language:
English

References (42)

Variational autoencoders for new physics mining at the Large Hadron Collider journal May 2019
Adversarially-trained autoencoders for robust unsupervised new physics searches journal October 2019
The Dawes Review 10: The impact of deep learning for the analysis of galaxy surveys journal January 2023
SciPy 1.0: fundamental algorithms for scientific computing in Python journal February 2020
The nature of the optical-radio correlations for powerful radio galaxies journal August 1998
An optical spectroscopic survey of the 3CR sample of radio galaxies with z $\mathsf{<}$ 0.3: II. Spectroscopic classes and accretion modes in radio-loud AGN journal January 2010
Second ROSAT all-sky survey (2RXS) source catalogue journal March 2016
A large sample of Kohonen selected E+A (post-starburst) galaxies from the Sloan Digital Sky Survey journal January 2017
Planck 2018 results: VI. Cosmological parameters journal September 2020
Optical characterization of WISE selected blazar candidates journal September 2019
The SIMBAD astronomical database: The CDS reference database for astronomical objects journal April 2000
Classification parameters for the emission-line spectra of extragalactic objects journal February 1981
Spectroscopy of galaxies in distant clusters. II - The population of the 3C 295 cluster journal July 1983
The Sloan Digital Sky Survey: Technical Summary journal September 2000
Color Separation of Galaxy Types in the Sloan Digital Sky Survey Imaging Data journal October 2001
Spectroscopic Target Selection in the Sloan Digital Sky Survey: The Quasar Sample journal June 2002
Spectroscopic Target Selection in the Sloan Digital Sky Survey: The Main Galaxy Sample journal September 2002
Spectroscopic Detection of Type Ia Supernovae in the Sloan Digital Sky Survey journal November 2003
Distributions of Galaxy Spectral Types in the Sloan Digital Sky Survey journal August 2004
The 2.5 m Telescope of the Sloan Digital Sky Survey journal April 2006
A Large, Uniform Sample of X-Ray-emitting Active Galactic Nuclei from the ROSAT All Sky and Sloan Digital Sky Surveys: The Data Release 5 Sample journal December 2006
A Large Sample of bl lac Objects from the sdss and First journal May 2008
Spectral Classification and Redshift Measurement for the Sdss-Iii Baryon Oscillation Spectroscopic Survey journal October 2012
RAPID: Early Classification of Explosive Transients Using Deep Learning journal September 2019
Evolution Through the Post-starburst Phase: Using Post-starburst Galaxies as Laboratories for Understanding the Processes that Drive Galaxy Evolution journal July 2021
Optical spectra of 3CR radio galaxies journal September 1979
Discovery of 90 Type Ia supernovae among 700 000 Sloan spectra: the Type Ia supernova rate versus galaxy mass and star formation rate at redshift ∼0.1 journal February 2013
Stellar velocity dispersions and emission line properties of SDSS-III/BOSS galaxies journal March 2013
The weirdest SDSS galaxies: results from an outlier detection algorithm journal November 2016
Searching for new physics with deep autoencoders journal April 2020
A Unifying Review of Deep and Shallow Anomaly Detection journal May 2021
The physical properties of star-forming galaxies in the low-redshift Universe journal July 2004
Semi-empirical analysis of Sloan Digital Sky Survey galaxies - II. The bimodality of the galaxy population revisited journal August 2006
A comprehensive classification of galaxies in the Sloan Digital Sky Survey: how to tell true from fake AGN?: How to tell true from fake AGN? journal March 2011
Evolution of the most massive galaxies to z= 0.6 - I. A new method for physical parameter estimation: Evolution of the most massive galaxies journal January 2012
Nuclear Activity in Nearby Galaxies journal September 2008
Overview of the DESI Legacy Imaging Surveys journal April 2019
Dimensionality Reduction of SDSS Spectra with Variational Autoencoders journal June 2020
Physical Drivers of Emission-line Diversity of SDSS Seyfert 2s and LINERs after Removal of Contributions from Star Formation journal November 2021
A Probabilistic Autoencoder for Type Ia Supernova Spectral Time Series journal August 2022
The 16th Data Release of the Sloan Digital Sky Surveys: First Release from the APOGEE-2 Southern Survey and Full Release of eBOSS Spectra journal June 2020
A Deep-learning Approach for Live Anomaly Detection of Extragalactic Transients journal August 2021

Similar Records

Where’s Swimmy?: Mining unique color features buried in galaxies by deep anomaly detection using Subaru Hyper Suprime-Cam data
Journal Article · Mon Nov 22 23:00:00 EST 2021 · Publications of the Astronomical Society of Japan · OSTI ID:1982719

Where’s Swimmy?: Mining unique color features buried in galaxies by deep anomaly detection using Subaru Hyper Suprime-Cam data
Journal Article · Mon Nov 22 23:00:00 EST 2021 · Publications of the Astronomical Society of Japan · OSTI ID:1833434

Automatic classification of time-variable X-ray sources
Journal Article · Thu May 01 00:00:00 EDT 2014 · Astrophysical Journal · OSTI ID:22357039

Related Subjects