DOE PAGES title logo U.S. Department of Energy
Office of Scientific and Technical Information

Title: Towards an astronomical foundation model for stars with a transformer-based model

Journal Article · · Monthly Notices of the Royal Astronomical Society

ABSTRACT Rapid strides are currently being made in the field of artificial intelligence using transformer-based models like Large Language Models (LLMs). The potential of these methods for creating a single, large, versatile model in astronomy has not yet been explored. In this work, we propose a framework for data-driven astronomy that uses the same core techniques and architecture as used by LLMs. Using a variety of observations and labels of stars as an example, we build a transformer-based model and train it in a self-supervised manner with cross-survey data sets to perform a variety of inference tasks. In particular, we demonstrate that a single model can perform both discriminative and generative tasks even if the model was not trained or fine-tuned to do any specific task. For example, on the discriminative task of deriving stellar parameters from Gaia XP spectra, we achieve an accuracy of 47 K in Teff, 0.11 dex in log g, and 0.07 dex in [M/H], outperforming an expert XGBoost model in the same setting. But the same model can also generate XP spectra from stellar parameters, inpaint unobserved spectral regions, extract empirical stellar loci, and even determine the interstellar extinction curve. Our framework demonstrates that building and training a single foundation model without fine-tuning using data and parameters from multiple surveys to predict unmeasured observations and parameters is well within reach. Such ‘Large Astronomy Models’ trained on large quantities of observational data will play a large role in the analysis of current and future large surveys.

Sponsoring Organization:
USDOE
OSTI ID:
2205369
Journal Information:
Monthly Notices of the Royal Astronomical Society, Journal Name: Monthly Notices of the Royal Astronomical Society Journal Issue: 1 Vol. 527; ISSN 0035-8711
Publisher:
Oxford University PressCopyright Statement
Country of Publication:
United Kingdom
Language:
English

References (74)

LoRA: Low-Rank Adaptation of Large Language Models preprint January 2021
Learning a Similarity Metric Discriminatively, with Application to Face Verification conference January 2005
Gaia Early Data Release 3: Photometric content and validation journal April 2021
An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale preprint January 2020
Self-supervised Representation Learning for Astronomical Images journal April 2021
Improving Gaia Parallax Precision with a Data-driven Model of Stars journal September 2018
Deep Attention-based Supernovae Classification of Multiband Light Curves journal December 2022
A 3D Dust Map Based on Gaia , Pan-STARRS 1, and 2MASS journal December 2019
Parameters of 220 million stars from Gaia BP/RP spectra journal June 2023
Simultaneous calibration of spectro-photometric distances and the Gaia DR2 parallax zero-point offset with deep learning journal August 2019
Efficient Estimation of Word Representations in Vector Space preprint January 2013
Mining for Strong Gravitational Lenses with Self-supervised Learning journal June 2022
Gaia Data Release 3 journal June 2023
BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding preprint January 2018
Astronomia ex machina: a history, primer and outlook on neural networks in astronomy journal May 2023
The COBE Diffuse Infrared Background Experiment Search for the Cosmic Infrared Background. I. Limits and Detections journal November 1998
SGDR: Stochastic Gradient Descent with Warm Restarts preprint January 2016
Sparks of Artificial General Intelligence: Early experiments with GPT-4 preprint January 2023
Decoupled Weight Decay Regularization preprint January 2017
Radio Galaxy Zoo: Towards building the first multi-purpose foundation model for radio astronomy with self-supervised learning preprint January 2023
Toward a Spectral Foundation Model: An Attention-Based Approach with Domain-Inspired Fine-Tuning and Wavelength Parameterization preprint January 2023
Deep learning of multi-element abundances from high-resolution spectroscopic data journal November 2018
The Gaia mission journal November 2016
The Poor Old Heart of the Milky Way journal December 2022
Gaia Data Release 3. External calibration of BP/RP low-resolution spectroscopic data journal June 2022
Paying Attention to Astronomical Transients: Introducing the Time-series Transformer for Photometric Classification preprint January 2021
GPT-4 Technical Report preprint March 2024
Measuring Reddening with Sloan Digital sky Survey Stellar Spectra and Recalibrating sfd journal August 2011
Euclid Definition Study Report preprint January 2011
The Two Micron All Sky Survey (2MASS) journal February 2006
Sloan Digital Sky Survey IV: Mapping the Milky Way, Nearby Galaxies, and the Distant Universe journal June 2017
A three-dimensional Galactic extinction model journal October 2003
Deep contextualized word representations preprint January 2018
The Apache Point Observatory Galactic Evolution Experiment (APOGEE) Spectrographs journal March 2019
Gaussian Error Linear Units (GELUs) preprint January 2016
Closing the stellar labels gap: An unsupervised, generative model for $\textit{Gaia}$ BP/RP spectra text January 2023
Astromer journal February 2023
Dive into Deep Learning preprint January 2021
Adam: A Method for Stochastic Optimization preprint January 2014
On Galactic Density Modeling in the Presence of dust Extinction journal February 2016
Some experiments in the generation of word and document associations conference January 1962
Attention Is All You Need preprint January 2017
Emergent Abilities of Large Language Models preprint January 2022
Dimensionality Reduction by Learning an Invariant Mapping conference January 2006
Hunting for C-rich long-period variable stars in the Milky Way’s bar-bulge using unsupervised classification ofGaiaBP/RP spectra journal March 2023
Celestial Spectra Classification Network Based on Residual and Attention Mechanisms journal March 2020
Layer Normalization preprint January 2016
Deep Residual Learning for Image Recognition preprint January 2015
extinction v0.3.0 software December 2016
PyTorch: An Imperative Style, High-Performance Deep Learning Library preprint January 2019
A Data Science Platform to Enable Time-domain Astronomy journal July 2023
Momentum Contrast for Unsupervised Visual Representation Learning preprint January 2019
Supervised Contrastive Learning preprint January 2020
Gaia Data Release 3: Processing and validation of BP/RP low-resolution spectral data journal July 2022
The Seventeenth Data Release of the Sloan Digital Sky Surveys: Complete Release of MaNGA, MaStar, and APOGEE-2 Data journal March 2022
Aspcap: the Apogee Stellar Parameter and Chemical Abundances Pipeline journal May 2016
Neural Machine Translation by Jointly Learning to Align and Translate preprint January 2014
Learning Transferable Visual Models From Natural Language Supervision preprint January 2021
Gaia Early Data Release 3: Parallax bias versus magnitude, colour, and position journal April 2021
Towards Galaxy Foundation Models with Hybrid Contrastive Learning preprint January 2022
Galactic ChitChat: Using Large Language Models to Converse with Astronomy Literature journal September 2023
Modelling the Galactic interstellar extinction distribution in three dimensions journal June 2006
On Faithfulness and Factuality in Abstractive Summarization preprint January 2020
Correcting for the Effects of Interstellar Extinction journal January 1999
Robust Data-driven Metallicities for 175 Million Stars from Gaia XP Spectra journal July 2023
SDSS-V: Pioneering Panoptic Spectroscopy preprint January 2017
Astroformer: More Data Might not be all you need for Classification preprint January 2023
A variational encoder–decoder approach to precise spectroscopic age estimation for large Galactic surveys journal April 2023
Internal calibration of Gaia BP/RP low-resolution spectra journal August 2021
Mixed Precision Training preprint January 2017
The Apache Point Observatory Galactic Evolution Experiment (APOGEE) journal August 2017
A measurement of the distance to the Galactic centre using the kinematics of bar stars journal December 2022
Maps of Dust Infrared Emission for Use in Estimation of Reddening and Cosmic Microwave Background Radiation Foregrounds journal June 1998
LSST: From Science Drivers to Reference Design and Anticipated Data Products journal March 2019

Similar Records

Related Subjects