Skip to main content
U.S. Department of Energy
Office of Scientific and Technical Information

Practical galaxy morphology tools from deep supervised representation learning

Journal Article · · Monthly Notices of the Royal Astronomical Society
 [1];  [2];  [3];  [4];  [5];  [3];  [6];  [7];  [8];  [9];  [7];  [10]
  1. University of Manchester (United Kingdom)
  2. University of Manchester (United Kingdom); Alan Turing Institute, London (United Kingdom)
  3. University of Oxford (United Kingdom)
  4. University of the Western Cape, Cape Town (South Africa); South African Radio Astronomy Observatory (SARAO), Cape Town (South Africa)
  5. University of the Western Cape, Cape Town (South Africa)
  6. The Open University, Kents Hill (United Kingdom)
  7. University of Minnesota, Minneapolis, MN (United States)
  8. Max-Planck-Institut fur extraterrestrische Physik, Garching bei München, (Germany); European Space Agency, Noordwijk (Netherlands)
  9. Haverford College, PA (United States)
  10. Lancaster University, Bailrigg (United Kingdom)
Astronomers have typically set out to solve supervised machine learning problems by creating their own representations from scratch. We show that deep learning models trained to answer every Galaxy Zoo DECaLS question learn meaningful semantic representations of galaxies that are useful for new tasks on which the models were never trained. We exploit these representations to outperform several recent approaches at practical tasks crucial for investigating large galaxy samples. The first task is identifying galaxies of similar morphology to a query galaxy. Given a single galaxy assigned a free text tag by humans (e.g. ‘#diffuse’), we can find galaxies matching that tag for most tags. The second task is identifying the most interesting anomalies to a particular researcher. Our approach is 100 per cent accurate at identifying the most interesting 100 anomalies (as judged by Galaxy Zoo 2 volunteers). The third task is adapting a model to solve a new task using only a small number of newly labelled galaxies. Models fine-tuned from our representation are better able to identify ring galaxies than models fine-tuned from terrestrial images (ImageNet) or trained from scratch. We solve each task with very few new labels; either one (for the similarity search) or several hundred (for anomaly detection or fine-tuning). This challenges the longstanding view that deep supervised methods require new large labelled data sets for practical use in astronomy. To help the community benefit from our pretrained models, we release our fine-tuning code zoobot. Zoobot is accessible to researchers with no prior experience in deep learning.
Research Organization:
Lawrence Berkeley National Laboratory (LBNL), Berkeley, CA (United States). National Energy Research Scientific Computing Center (NERSC); University of Minnesota, Minneapolis, MN (United States)
Sponsoring Organization:
Alan Turing Institute; Alfred P. Sloan Foundation; National Research Foundation (NRF); National Science Foundation (NSF); South African Radio Astronomy Observatory; USDOE Office of Science (SC), High Energy Physics (HEP)
Grant/Contract Number:
AC02-05CH11231
OSTI ID:
1982680
Journal Information:
Monthly Notices of the Royal Astronomical Society, Journal Name: Monthly Notices of the Royal Astronomical Society Journal Issue: 2 Vol. 513; ISSN 0035-8711
Publisher:
Oxford University PressCopyright Statement
Country of Publication:
United States
Language:
English

References (69)

Bayesian approach to global optimization and application to multiobjective and constrained problems journal July 1991
A strongly nonlinear reaction diffusion model for a deterministic diffusive epidemic journal February 1995
Incremental Learning for Robust Visual Tracking journal August 2007
ImageNet Large Scale Visual Recognition Challenge journal April 2015
Machine and Deep Learning applied to galaxy morphology - A comparative study journal January 2020
Suitability of a content-based retrieval method in astronomical image databases journal January 1996
Galaxy morphology — An unsupervised machine learning approach journal September 2015
Effectively using unsupervised machine learning in next generation astronomical surveys journal January 2021
Astronomaly: Personalised active anomaly detection in astronomical data journal July 2021
Domain-specific classification-pretrained fully convolutional network encoders for skin lesion segmentation journal January 2019
Efficient Global Optimization of Expensive Black-Box Functions journal January 1998
Random Forests journal January 2001
Deep learning journal May 2015
Automatic Detection of Galaxy Type From Datasets of Galaxies Image Based on Image Retrieval Approach journal June 2017
ARRAKIS: atlas of resonance rings as known in the S 4 G journal February 2014
Morphology-assisted galaxy mass-to-light predictions using deep learning journal April 2019
Identifying galaxies, quasars, and stars with machine learning: A new catalogue of classifications for 111 million SDSS sources without spectra journal July 2020
The response of gas in a galactic disk to bar forcing journal July 1981
Probing the Evolution of the Galaxy Interaction/Merger Rate Using Collisional Ring Galaxies journal September 2004
A Catalog of Detailed Visual Morphological Classifications for 14,034 Galaxies in the Sloan Digital sky Survey journal February 2010
KOI-54: THE KEPLER DISCOVERY OF TIDALLY EXCITED PULSATIONS AND BRIGHTENINGS IN A HIGHLY ECCENTRIC BINARY journal September 2011
A CLASSICAL MORPHOLOGICAL ANALYSIS OF GALAXIES IN THE SPITZER SURVEY OF STELLAR STRUCTURE IN GALAXIES (S 4 G) journal April 2015
Multiband Galaxy Morphologies for CLASH: A Convolutional Neural Network Transferred from CANDELS journal August 2019
Radio Galaxy Zoo: Unsupervised Clustering of Convolutionally Auto-encoded Radio-astronomical Images journal September 2019
The weirdest SDSS galaxies: results from an outlier detection algorithm journal November 2016
Galactic rings revisited – I. CVRHS classifications of 3962 ringed galaxies from the Galaxy Zoo 2 Database journal July 2017
Improving galaxy morphologies for SDSS with Deep Learning journal February 2018
AstroVaDEr: astronomical variational deep embedder for unsupervised morphological classification of galaxies and synthetic image generation journal November 2020
A deep learning approach to test the small-scale galaxy morphology and its relationship with star formation activity in hydrodynamical simulations journal December 2020
Galaxy Zoo DECaLS: Detailed visual morphology measurements from volunteers and deep learning for 314 000 galaxies journal September 2021
Anomaly detection in Hyper Suprime-Cam galaxy images with generative adversarial networks journal September 2021
Survey2Survey: a deep learning generative model approach for cross-survey image mapping journal February 2021
Beyond the hubble sequence – exploring galaxy morphology with unsupervised machine learning journal March 2021
Galaxy Zoo: bulgeless galaxies with growing black holes journal January 2013
Galaxy Zoo 2: detailed morphological classifications for 304 122 galaxies from the Sloan Digital Sky Survey journal September 2013
Rotation-invariant convolutional neural networks for galaxy morphology prediction journal April 2015
Galaxy Zoo: comparing the demographics of spiral arm number and a new method for correcting redshift bias journal July 2016
Planet Hunters IX. KIC 8462852 – where's the flux? journal January 2016
An automatic taxonomy of galaxy morphology using unsupervised machine learning journal September 2017
Galaxy Zoo: the interplay of quenching mechanisms in the group environment★ journal April 2017
Using transfer learning to detect galaxy mergers journal May 2018
Radio Galaxy Zoo: Claran – a deep learning classifier for radio morphologies journal October 2018
SDSS-IV MaNGA PyMorph Photometric and Deep Learning Morphological Catalogues and implications for bulge properties and stellar angular momentum journal November 2018
Transfer learning for galaxy morphology from one survey to another journal December 2018
A comprehensive examination of the optical morphologies of 719 isolated galaxies in the AMIGA sample journal June 2019
Galaxy Zoo: probabilistic morphology through Bayesian CNNs and active learning journal October 2019
Automatic detection of full ring galaxy candidates in SDSS journal November 2019
Self-supervised Learning for Astronomical Image Classification conference January 2021
Deep Learning Earth Observation Classification Using ImageNet Pretrained Networks journal January 2016
Matplotlib: A 2D Graphics Environment journal January 2007
Rings and spirals in barred galaxies - I. Building blocks journal March 2009
Galaxy Zoo: ‘Hanny's Voorwerp’, a quasar light echo? journal October 2009
Galaxy Zoo Green Peas: discovery of a class of compact extremely star-forming galaxies journal November 2009
Applying the analytic theory of colliding ring galaxies journal April 2010
A new catalogue of polar-ring galaxies selected from the Sloan Digital Sky Survey★ journal September 2011
Feedback-Guided Anomaly Discovery via Online Optimization conference July 2018
Towards fairer datasets conference January 2020
Deep Learning for Morphological Classification of Galaxies from sdss journal October 2019
TED: A Pretrained Unsupervised Summarization Model with Theme Modeling and Denoising conference January 2020
UMAP: Uniform Manifold Approximation and Projection journal September 2018
A rules-based and Transfer Learning approach for deriving the Hubble type of a galaxy from the Galaxy Zoo data conference July 2020
The Astropy Project: Building an Open-science Project and Status of the v2.0 Core Package journal August 2018
Overview of the DESI Legacy Imaging Surveys journal April 2019
Galaxy Morphology Network: A Convolutional Neural Network Used to Study Morphology and Quenching in ∼100,000 SDSS and ∼20,000 CANDELS Galaxies journal June 2020
Capturing the Physics of MaNGA Galaxies with Self-supervised Machine Learning journal November 2021
A Catalog of Automatically Detected Ring Galaxy Candidates in PanSTARRS journal July 2017
The 13th Data Release of the Sloan Digital Sky Survey: First Spectroscopic Data from the SDSS-IV Survey Mapping Nearby Galaxies at Apache Point Observatory journal December 2017
Self-supervised Representation Learning for Astronomical Images journal April 2021
scikit-image: image processing in Python journal January 2014

Cited By (1)

Learning useful representations for radio astronomy "in the wild" with contrastive learning preprint January 2022

Similar Records

Self-supervised Representation Learning for Astronomical Images
Journal Article · Sun Apr 25 20:00:00 EDT 2021 · The Astrophysical Journal. Letters · OSTI ID:1813371

RINO: Renormalization Group Invariance with No Labels
Conference · Tue Sep 09 00:00:00 EDT 2025 · No journal information · OSTI ID:3003666

Masked Particle Modeling on Sets: Towards Self-Supervised High Energy Physics Foundation Models
Journal Article · Tue Jul 16 20:00:00 EDT 2024 · Machine Learning: Science and Technology · OSTI ID:2403614