DOE PAGES title logo U.S. Department of Energy
Office of Scientific and Technical Information

Title: Astronomaly at scale: searching for anomalies amongst 4 million galaxies

Journal Article · · Monthly Notices of the Royal Astronomical Society

ABSTRACT Modern astronomical surveys are producing data sets of unprecedented size and richness, increasing the potential for high-impact scientific discovery. This possibility, coupled with the challenge of exploring a large number of sources, has led to the development of novel machine-learning-based anomaly detection approaches, such as astronomaly. For the first time, we test the scalability of astronomaly by applying it to almost 4 million images of galaxies from the Dark Energy Camera Legacy Survey. We use a trained deep learning algorithm to learn useful representations of the images and pass these to the anomaly detection algorithm isolation forest, coupled with astronomaly’s active learning method, to discover interesting sources. We find that data selection criteria have a significant impact on the trade-off between finding rare sources such as strong lenses and introducing artefacts into the data set. We demonstrate that active learning is required to identify the most interesting sources and reduce artefacts, while anomaly detection methods alone are insufficient. Using astronomaly, we find 1635 anomalies among the top 2000 sources in the data set after applying active learning, including eight strong gravitational lens candidates, 1609 galaxy merger candidates, and 18 previously unidentified sources exhibiting highly unusual morphology. Our results show that by leveraging the human–machine interface, astronomaly is able to rapidly identify sources of scientific interest even in large data sets.

Sponsoring Organization:
USDOE
OSTI ID:
2318767
Journal Information:
Monthly Notices of the Royal Astronomical Society, Journal Name: Monthly Notices of the Royal Astronomical Society Vol. 529 Journal Issue: 1; ISSN 0035-8711
Publisher:
Oxford University PressCopyright Statement
Country of Publication:
United Kingdom
Language:
English

References (53)

The Sloan Lens ACS Survey. XIII. Discovery of 40 New Galaxy-scale Strong Lenses journal December 2017
The SIMBAD astronomical database: The CDS reference database for astronomical objects journal April 2000
The Astropy Project: Building an Open-science Project and Status of the v2.0 Core Package journal August 2018
A Catalog of Morphologically Identified Merging Galaxies journal March 2009
LIII. On lines and planes of closest fit to systems of points in space journal November 1901
A Tutorial on Principal Component Analysis preprint January 2014
Morphological classification of astronomical images with limited labelling preprint January 2021
Systematic serendipity: a test of unsupervised machine learning as a method for anomaly detection journal December 2018
The Discovery of Quasi-stellar Objects behind M31 and M33 journal May 2019
The Asiago Supernova Catalogue - 10 years after journal November 1999
The NRAO VLA Sky Survey journal May 1998
The 16th Data Release of the Sloan Digital Sky Surveys: First Release from the APOGEE-2 Southern Survey and Full Release of eBOSS Spectra journal June 2020
Zoobot: Adaptable Deep Learning Models for Galaxy Morphology journal May 2023
The Million Quasars (Milliquas) Catalogue, v8 text January 2023
Anomaly detection in Hyper Suprime-Cam galaxy images with generative adversarial networks journal September 2021
Learning to Detect Interesting Anomalies preprint January 2022
Automated novelty detection in the WISE survey with one-class support vector machines journal October 2017
Galaxy groups in the 2dFGRS: the group-finding algorithm and the 2PIGG catalogue journal March 2004
Automated supervised classification of variable stars: I. Methodology journal September 2007
Galaxy Zoo: probabilistic morphology through Bayesian CNNs and active learning journal October 2019
Compact Groups of Galaxies in Sloan Digital Sky Survey and LAMOST Spectral Survey. I. The Catalogs journal January 2020
Discovering New Strong Gravitational Lenses in the DESI Legacy Imaging Surveys journal March 2021
Galaxy Zoo 1: data release of morphological classifications for nearly 900 000 galaxies★: Galaxy Zoo journal November 2010
The strong gravitational lens finding challenge journal May 2019
Harnessing the Hubble Space Telescope Archives: A Catalog of 21,926 Interacting Galaxies journal May 2023
Astropy: A community Python package for astronomy journal September 2013
The Sloan Digital Sky Survey: Technical Summary journal September 2000
Galaxy Zoo DECaLS: Detailed visual morphology measurements from volunteers and deep learning for 314 000 galaxies journal September 2021
The Eighteenth Data Release of the Sloan Digital Sky Surveys: Targeting and First Spectra from SDSS-V journal August 2023
UMAP: Uniform Manifold Approximation and Projection for Dimension Reduction preprint January 2018
The Two Micron All Sky Survey (2MASS) journal February 2006
ANNz2: Photometric Redshift and Probability Distribution Function Estimation using Machine Learning journal August 2016
Euclid preparation journal June 2022
The DES Bright Arcs Survey: Hundreds of Candidate Strongly Lensed Galaxy Systems from the Dark Energy Survey Science Verification and Year 1 Observations journal September 2017
The 6dF Galaxy Survey: final redshift release (DR3) and southern large-scale structures journal October 2009
The Galaxy Evolution Explorer : A Space Ultraviolet Survey Mission journal January 2005
Practical galaxy morphology tools from deep supervised representation learning journal February 2022
LSST: From Science Drivers to Reference Design and Anticipated Data Products journal March 2019
GLADE: A galaxy catalogue for multimessenger searches in the advanced gravitational-wave detector era journal June 2018
An Extended Catalog of Galaxy–Galaxy Strong Gravitational Lenses Discovered in DES Using Convolutional Neural Networks journal July 2019
Clustering of LRGs in the DECaLS DR8 Footprint: Distance Constraints from Baryon Acoustic Oscillations Using Photometric Redshifts journal November 2020
The Astropy Project: Sustaining and Growing a Community-oriented Open-source Project and the Latest Major Release (v5.0) of the Core Package* journal August 2022
Overview of the DESI Legacy Imaging Surveys journal April 2019
DeepMerge – II. Building robust deep learning algorithms for merging galaxy identification across domains journal June 2021
Self-supervised Learning for Astronomical Image Classification conference January 2021
LOF: identifying density-based local outliers journal June 2000
The SAGA Survey. II. Building a Statistical Sample of Satellite Systems around Milky Way–like Galaxies journal February 2021
The weirdest SDSS galaxies: results from an outlier detection algorithm journal November 2016
Finding Strong Gravitational Lenses in the DESI DECam Legacy Survey journal May 2020
The Wide-Field Infrared Survey Explorer (Wise): Mission Description and Initial On-Orbit Performance journal November 2010
Astronomaly: Personalised active anomaly detection in astronomical data journal July 2021
Galaxy Zoo: morphologies derived from visual inspection of galaxies from the Sloan Digital Sky Survey journal September 2008
Surface Brightness and Evolution of Galaxies journal December 1976

Related Subjects