DOE PAGES title logo U.S. Department of Energy
Office of Scientific and Technical Information

Title: DeepAstroUDA: semi-supervised universal domain adaptation for cross-survey galaxy morphology classification and anomaly detection

Journal Article · · Machine Learning: Science and Technology

Abstract Artificial intelligence methods show great promise in increasing the quality and speed of work with large astronomical datasets, but the high complexity of these methods leads to the extraction of dataset-specific, non-robust features. Therefore, such methods do not generalize well across multiple datasets. We present a universal domain adaptation method, DeepAstroUDA , as an approach to overcome this challenge. This algorithm performs semi-supervised domain adaptation (DA) and can be applied to datasets with different data distributions and class overlaps. Non-overlapping classes can be present in any of the two datasets (the labeled source domain, or the unlabeled target domain), and the method can even be used in the presence of unknown classes. We apply our method to three examples of galaxy morphology classification tasks of different complexities (three-class and ten-class problems), with anomaly detection: (1) datasets created after different numbers of observing years from a single survey (Legacy Survey of Space and Time mock data of one and ten years of observations); (2) data from different surveys (Sloan Digital Sky Survey (SDSS) and DECaLS); and (3) data from observing fields with different depths within one survey (wide field and Stripe 82 deep field of SDSS). For the first time, we demonstrate the successful use of DA between very discrepant observational datasets. DeepAstroUDA is capable of bridging the gap between two astronomical surveys, increasing classification accuracy in both domains (up to 40 % on the unlabeled data), and making model performance consistent across datasets. Furthermore, our method also performs well as an anomaly detection algorithm and successfully clusters unknown class samples even in the unlabeled target dataset.

Research Organization:
Argonne National Laboratory (ANL), Lemont, IL (United States); Fermi National Accelerator Laboratory (FNAL), Batavia, IL (United States); Lawrence Berkeley National Laboratory (LBNL), Berkeley, CA (United States)
Sponsoring Organization:
USDOE; USDOE Office of Science (SC), Advanced Scientific Computing Research (ASCR); USDOE Office of Science (SC), High Energy Physics (HEP)
Grant/Contract Number:
AC02-05CH11231; AC02-06CH11357; AC02-07CH11359
OSTI ID:
1971411
Report Number(s):
FERMILAB-PUB--23-034-CSAID; arXiv:2302.02005
Journal Information:
Machine Learning: Science and Technology, Journal Name: Machine Learning: Science and Technology Journal Issue: 2 Vol. 4; ISSN 2632-2153
Publisher:
IOP PublishingCopyright Statement
Country of Publication:
United Kingdom
Language:
English

References (62)

Deep CORAL: Correlation Alignment for Deep Domain Adaptation book January 2016
GalSim: The modular galaxy image simulation toolkit journal April 2015
The illustris simulation: Public data release journal November 2015
Robust unsupervised domain adaptation for neural networks via moment alignment journal May 2019
Deep visual domain adaptation: A survey journal October 2018
Adaptive open domain recognition by coarse-to-fine prototype-based network journal August 2022
The dependence of star formation history and internal structure on stellar mass for 10 5 low-redshift galaxies journal May 2003
Dust properties and star formation of approximately a thousand local galaxies journal October 2019
SExtractor: Software for source extraction journal June 1996
Extragalactic nebulae. journal October 1926
A Preliminary Liminosity Classification for Galaxies of Type Sb. journal March 1960
The Sloan Digital Sky Survey: Technical Summary journal September 2000
A Direct Measurement of Major Galaxy Mergers at z 3 journal September 2003
A New Nonparametric Approach to Galaxy Morphological Classification journal July 2004
ERRATUM: “THE EIGHTH DATA RELEASE OF THE SLOAN DIGITAL SKY SURVEY: FIRST DATA FROM SDSS-III” (2011, ApJS, 193, 29) journal August 2011
A General Approach to Domain Adaptation with Applications in Astronomy journal September 2019
DeepAdversaries: examining the robustness of deep learning models for galaxy morphology classification journal July 2022
The Dark Energy Survey: more than dark energy – an overview journal March 2016
First results from the IllustrisTNG simulations: radio haloes and magnetic fields journal August 2018
Morphological classification of galaxies with deep learning: comparing 3-way and 4-way CNNs journal June 2021
DeepMerge – II. Building robust deep learning algorithms for merging galaxy identification across domains journal June 2021
Galaxy Zoo DECaLS: Detailed visual morphology measurements from volunteers and deep learning for 314 000 galaxies journal September 2021
Galaxy morphological classification catalogue of the Dark Energy Survey Year 3 data with convolutional neural networks journal July 2021
Galaxy Zoo 2: detailed morphological classifications for 304 122 galaxies from the Sloan Digital Sky Survey journal September 2013
Introducing the Illustris Project: simulating the coevolution of dark and visible matter in the Universe journal August 2014
The EAGLE project: simulating the evolution and assembly of galaxies and their environments journal November 2014
Galaxy morphology and star formation in the Illustris Simulation at z  = 0 journal October 2015
Galaxy Zoo: morphological classifications for 120 000 galaxies in HST legacy imaging journal October 2016
Galaxy Zoo: quantitative visual morphological classifications for 48 000 galaxies from CANDELS journal October 2016
First results from the IllustrisTNG simulations: the stellar mass content of groups and clusters of galaxies journal December 2017
First results from the IllustrisTNG simulations: matter and galaxy clustering journal December 2017
First results from the IllustrisTNG simulations: a tale of two elements – chemical evolution of magnesium and europium journal March 2018
Automated distant galaxy merger classifications from Space Telescope images using the Illustris simulation journal April 2019
Galaxy Zoo: probabilistic morphology through Bayesian CNNs and active learning journal October 2019
simba: Cosmological simulations with black hole growth and feedback journal April 2019
The Hyper Suprime-Cam SSP Survey: Overview and survey design journal September 2017
Deep Residual Learning for Image Recognition conference June 2016
Deep Hashing Network for Unsupervised Domain Adaptation conference July 2017
Importance Weighted Adversarial Nets for Partial Domain Adaptation conference June 2018
Universal Domain Adaptation conference June 2019
Separate to Adapt: Open Set Domain Adaptation via Progressive Separation conference June 2019
Learning to Transfer Examples for Partial Domain Adaptation conference June 2019
Cross-Domain Adaptive Clustering for Semi-Supervised Domain Adaptation conference June 2021
Contrastive Domain Adaptation conference June 2021
A Study and Comparison of Human and Deep Learning Recognition Performance under Visual Distortions conference July 2017
Open Set Domain Adaptation conference October 2017
Partial Video Domain Adaptation with Partial Adversarial Temporal Attentive Network conference October 2021
Improved Open Set Domain Adaptation with Backpropagation conference September 2019
Taking the Human Out of the Loop: A Review of Bayesian Optimization journal January 2016
Understanding how image quality affects deep neural networks conference June 2016
Galaxy Zoo: morphologies derived from visual inspection of galaxies from the Sloan Digital Sky Survey journal September 2008
E pur si muove: Galilean-invariant cosmological hydrodynamical simulations on a moving mesh journal January 2010
Galaxy Zoo 1: data release of morphological classifications for nearly 900 000 galaxies★: Galaxy Zoo journal November 2010
A Global Geometric Framework for Nonlinear Dimensionality Reduction journal December 2000
A Survey of Unsupervised Deep Domain Adaptation journal September 2020
The IllustrisTNG simulations: public data release journal May 2019
On Information and Sufficiency journal March 1951
How to Use t-SNE Effectively journal October 2016
Overview of the DESI Legacy Imaging Surveys journal April 2019
The Frequency of Dust Lanes in Edge-on Spiral Galaxies Identified by Galaxy Zoo in KiDS Imaging of GAMA Targets journal August 2019
Galaxy Zoo: Morphological Classification of Galaxy Images from the Illustris Simulation journal February 2018
LSST: From Science Drivers to Reference Design and Anticipated Data Products journal March 2019