Skip to main content
U.S. Department of Energy
Office of Scientific and Technical Information

Self-supervised Representation Learning for Astronomical Images

Journal Article · · The Astrophysical Journal. Letters
 [1];  [2];  [3];  [3];  [3]
  1. Univ. of Arkansas, Fayetteville, AR (United States); Lawrence Berkeley National Lab. (LBNL), Berkeley, CA (United States)
  2. Lawrence Berkeley National Lab. (LBNL), Berkeley, CA (United States); Univ. of California, Berkeley, CA (United States)
  3. Lawrence Berkeley National Lab. (LBNL), Berkeley, CA (United States)
Sky surveys are the largest data generators in astronomy, making automated tools for extracting meaningful scientific information an absolute necessity. We show that, without the need for labels, self-supervised learning recovers representations of sky survey images that are semantically useful for a variety of scientific tasks. These representations can be directly used as features, or fine-tuned, to outperform supervised methods trained only on labeled data. We apply a contrastive learning framework on multiband galaxy photometry from the Sloan Digital Sky Survey (SDSS), to learn image representations. We then use them for galaxy morphology classification and fine-tune them for photometric redshift estimation, using labels from the Galaxy Zoo 2 data set and SDSS spectroscopy. In both downstream tasks, using the same learned representations, we outperform the supervised state-of-the-art results, and we show that our approach can achieve the accuracy of supervised models while using 2-4 times fewer labels for training. The codes, trained models, and data can be found at https://portal.nersc.gov/project/dasrepo/self-supervised-learning-sdss.
Research Organization:
Lawrence Berkeley National Laboratory (LBNL), Berkeley, CA (United States)
Sponsoring Organization:
USDOE Office of Science (SC)
Grant/Contract Number:
AC02-05CH11231
OSTI ID:
1813371
Alternate ID(s):
OSTI ID: 23154197
Journal Information:
The Astrophysical Journal. Letters, Journal Name: The Astrophysical Journal. Letters Journal Issue: 2 Vol. 911; ISSN 2041-8205
Publisher:
IOP PublishingCopyright Statement
Country of Publication:
United States
Language:
English

References (36)

georgestein/ml-in-cosmology: Machine learning in cosmology dataset January 2020
Measuring photometric redshifts using galaxy images and Deep Neural Networks journal July 2016
Effectively using unsupervised machine learning in next generation astronomical surveys journal January 2021
Deep learning at scale for the construction of galaxy catalogs in the Dark Energy Survey journal August 2019
The many flavours of photometric redshifts journal June 2018
Photometric redshift estimation via deep learning: Generalized and pre-classification-less, image based, fully probabilistic redshifts journal January 2018
Photometric redshifts from SDSS images using a convolutional neural network journal December 2018
Slicing Through Multicolor Space: Galaxy Redshifts from Broadband Photometry journal December 1995
Photometric redshifts of galaxies journal April 1986
The Sloan Digital Sky Survey Photometric Camera journal December 1998
Drizzle: A Method for the Linear Reconstruction of Undersampled Images journal February 2002
Spectroscopic Target Selection in the Sloan Digital Sky Survey: The Main Galaxy Sample journal September 2002
Preparing Red‐Green‐Blue Images from CCD Data
  • Lupton, Robert; Blanton, Michael R.; Fekete, George
  • Publications of the Astronomical Society of the Pacific, Vol. 116, Issue 816 https://doi.org/10.1086/382245
journal February 2004
The 2.5 m Telescope of the Sloan Digital Sky Survey journal April 2006
Measuring Reddening with Sloan Digital sky Survey Stellar Spectra and Recalibrating sfd journal August 2011
The Eleventh and Twelfth data Releases of the Sloan Digital sky Survey: Final data from Sdss-Iii journal July 2015
Morphology in the era of large surveys journal September 2013
Identifying strong lenses with unsupervised machine learning using convolutional autoencoder journal April 2020
Detecting outliers in astronomical images with deep generative networks journal June 2020
AstroVaDEr: astronomical variational deep embedder for unsupervised morphological classification of galaxies and synthetic image generation journal November 2020
Galaxy Zoo 2: detailed morphological classifications for 304 122 galaxies from the Sloan Digital Sky Survey journal September 2013
Rotation-invariant convolutional neural networks for galaxy morphology prediction journal April 2015
Photometric redshifts for the SDSS Data Release 12 journal April 2016
Galaxy Zoo: comparing the demographics of spiral arm number and a new method for correcting redshift bias journal July 2016
An automatic taxonomy of galaxy morphology using unsupervised machine learning journal September 2017
Improving galaxy morphologies for SDSS with Deep Learning journal February 2018
Transfer learning for galaxy morphology from one survey to another journal December 2018
Galaxy Zoo: probabilistic morphology through Bayesian CNNs and active learning journal October 2019
Galaxy morphological classification in deep-wide surveys via unsupervised machine learning journal October 2019
Galaxy Zoo: morphologies derived from visual inspection of galaxies from the Sloan Digital Sky Survey journal September 2008
Galaxy Zoo: ‘Hanny's Voorwerp’, a quasar light echo? journal October 2009
The Galaxy Zoo survey for giant AGN-ionized clouds: past and present black hole accretion events: Giant AGN clouds journal December 2011
Anomaly Detection for Astronomical Data text January 2010
A Study of the Point-spread Function in SDSS Images journal October 2018
LSST: From Science Drivers to Reference Design and Anticipated Data Products journal March 2019
georgestein/ml-in-cosmology: Machine learning in cosmology dataset January 2020

Similar Records

Practical galaxy morphology tools from deep supervised representation learning
Journal Article · Sun Feb 27 19:00:00 EST 2022 · Monthly Notices of the Royal Astronomical Society · OSTI ID:1982680

Masked Particle Modeling on Sets: Towards Self-Supervised High Energy Physics Foundation Models
Journal Article · Tue Jul 16 20:00:00 EDT 2024 · Machine Learning: Science and Technology · OSTI ID:2403614

Optical Wavelength Guided Self-Supervised Feature Learning For Galaxy Cluster Richness Estimate
Conference · Wed Dec 02 23:00:00 EST 2020 · OSTI ID:1887831