skip to main content
OSTI.GOV title logo U.S. Department of Energy
Office of Scientific and Technical Information

Title: The LSST AGN Data Challenge: Selection Methods

Journal Article · · The Astrophysical Journal
ORCiD logo; ORCiD logo; ORCiD logo; ORCiD logo; ORCiD logo; ORCiD logo; ORCiD logo; ORCiD logo; ORCiD logo; ORCiD logo; ORCiD logo; ORCiD logo; ORCiD logo; ORCiD logo; ORCiD logo

Abstract Development of the Rubin Observatory Legacy Survey of Space and Time (LSST) includes a series of Data Challenges (DCs) arranged by various LSST Scientific Collaborations that are taking place during the project's preoperational phase. The AGN Science Collaboration Data Challenge (AGNSC-DC) is a partial prototype of the expected LSST data on active galactic nuclei (AGNs), aimed at validating machine learning approaches for AGN selection and characterization in large surveys like LSST. The AGNSC-DC took place in 2021, focusing on accuracy, robustness, and scalability. The training and the blinded data sets were constructed to mimic the future LSST release catalogs using the data from the Sloan Digital Sky Survey Stripe 82 region and the XMM-Newton Large Scale Structure Survey region. Data features were divided into astrometry, photometry, color, morphology, redshift, and class label with the addition of variability features and images. We present the results of four submitted solutions to DCs using both classical and machine learning methods. We systematically test the performance of supervised models (support vector machine, random forest, extreme gradient boosting, artificial neural network, convolutional neural network) and unsupervised ones (deep embedding clustering) when applied to the problem of classifying/clustering sources as stars, galaxies, or AGNs. We obtained classification accuracy of 97.5% for supervised models and clustering accuracy of 96.0% for unsupervised ones and 95.0% with a classic approach for a blinded data set. We find that variability features significantly improve the accuracy of the trained models, and correlation analysis among different bands enables a fast and inexpensive first-order selection of quasar candidates.

Research Organization:
Fermi National Accelerator Laboratory (FNAL), Batavia, IL (United States)
Sponsoring Organization:
USDOE Office of Science (SC), High Energy Physics (HEP)
Grant/Contract Number:
AC02-07CH11359
OSTI ID:
1995039
Alternate ID(s):
OSTI ID: 1989912
Report Number(s):
FERMILAB-PUB-22-735-SCD; arXiv:2307.04072
Journal Information:
The Astrophysical Journal, Journal Name: The Astrophysical Journal Vol. 953 Journal Issue: 2; ISSN 0004-637X
Publisher:
American Astronomical SocietyCopyright Statement
Country of Publication:
United States
Language:
English

References (108)

Black Hole Mass Estimates from Reverberation Mapping and from Spatially Resolved Kinematics journal November 2000
A random forest-based selection of optically variable AGN in the VST-COSMOS field journal January 2021
The Sloan Digital Sky Survey Photometric Camera journal December 1998
Deep learning journal May 2015
The Sdss-Iv Extended Baryon Oscillation Spectroscopic Survey: Quasar Target Selection journal December 2015
THE CHANDRA DEEP FIELD-SOUTH SURVEY: 7 MS SOURCE CATALOGS journal December 2016
The Fraction of Active Galactic Nuclei in the USS 1558–003 Protocluster at z = 2.53 journal March 2019
Second Data Release of the All-sky NOIRLab Source Catalog journal March 2021
Extending the variability selection of active galactic nuclei in the W-CDF-S and SERVS/SWIRE region journal February 2020
Subaru High- z Exploration of Low-luminosity Quasars (SHELLQs). IV. Discovery of 41 Quasars and Luminous Galaxies at 5.7 ≤ z ≤ 6.9 journal July 2018
Cross-Validatory Choice and Assessment of Statistical Predictions journal January 1974
A wide-field multicolor survey for high-redshift quasars, Z above 2.2. I - Photometric catalog and survey selection function journal May 1991
Active galactic nuclei catalog from the AKARI NEP-Wide field journal July 2021
On Machine-Learned Classification of Variable Stars with Sparse and Noisy Time-Series data journal April 2011
Exploring Reionization-era Quasars. III. Discovery of 16 Quasars at 6.4 ≲ z ≲ 6.9 with DESI Legacy Imaging Surveys and the UKIRT Hemisphere Survey and Quasar Luminosity Function at z ∼ 6.7 journal October 2019
Extracting information from AGN variability journal June 2017
The 16th Data Release of the Sloan Digital Sky Surveys: First Release from the APOGEE-2 Southern Survey and Full Release of eBOSS Spectra journal June 2020
The XMM-SERVS Survey: XMM-Newton Point-source Catalogs for the W-CDF-S and ELAIS-S1 Fields journal September 2021
Quasi-Stellar Object Selection Algorithm Using time Variability and Machine Learning: Selection of 1620 Quasi-Stellar Object Candidates from Macho Large Magellanic Cloud Database journal June 2011
An Application of Multi-band Forced Photometry to One Square Degree of SERVS: Accurate Photometric Redshifts and Implications for Future Science journal May 2017
Modelling type 1 quasar colours in the era of Rubin and Euclid journal September 2021
Eight new luminous z ≥ 6 quasars discovered via SED model fitting of VISTA, WISE and Dark Energy Survey Year 1 observations journal March 2017
Differential Chromatic Refraction in the Context of the Legacy Survey of Space and Time journal December 2020
Measuring X-Ray Variability in Faint/Sparsely Sampled Active Galactic Nuclei journal June 2013
Identifying galaxies, quasars, and stars with machine learning: A new catalogue of classifications for 111 million SDSS sources without spectra journal July 2020
Stars with zero proper motion and the number of faint QSOs journal August 1981
Quasar Classification Using Color and Variability journal September 2015
The UKIRT Infrared Deep Sky Survey (UKIDSS) journal August 2007
ulisse: A tool for one-shot sky exploration and its application for detection of active galactic nuclei journal October 2022
The Dark Energy Survey: more than dark energy – an overview journal March 2016
The Hyper Suprime-Cam SSP Survey: Overview and survey design journal September 2017
SEARCH FOR GAMMA-RAY-EMITTING ACTIVE GALACTIC NUCLEI IN THE FERMI -LAT UNASSOCIATED SAMPLE USING MACHINE LEARNING journal January 2014
The Sloan Digital Sky Survey: Technical Summary journal September 2000
The UKIRT Hemisphere Survey: definition and J-band data release journal October 2017
MID-INFRARED VARIABILITY FROM THE SPITZER DEEP WIDE-FIELD SURVEY journal May 2010
THE SLOAN DIGITAL SKY SURVEY COADD: 275 deg 2 OF DEEP SLOAN DIGITAL SKY SURVEY IMAGING ON STRIPE 82 journal September 2014
Sloan Digital Sky Survey Standard Star Catalog for Stripe 82: The Dawn of Industrial 1% Optical Photometry journal July 2007
Quasars to B greater than 22.5 in selected area 57 - A catalog of multicolor photometry, variability, and astrometry journal March 1986
Photometric Supernova Classification with Machine Learning journal August 2016
Revisiting the Unified Model of Active Galactic Nuclei journal August 2015
Exploring Reionization-era Quasars. IV. Discovery of Six New z ≳ 6.5 Quasars with DES, VHS, and unWISE Photometry journal May 2019
Blazar Variability with the Vera C. Rubin Legacy Survey of Space and Time journal December 2021
Incorporating Measurement Error in Astronomical Object Classification journal June 2022
The Canada-France-Hawaii Telescope Legacy Survey: Stacked Images and Catalogs journal January 2012
Quantifying Quasar Variability as part of a General Approach to Classifying Continuously Varying Sources journal December 2009
On the Nature of Faint Blue Objects in High Galactic Latitudes. I. Photometry, Proper Motions, and Spectra in PHL Field 1:36+6° and Richter Field M3, II journal June 1967
Gaia Early Data Release 3: Summary of the contents and survey properties journal April 2021
Overview of the DESI Legacy Imaging Surveys journal April 2019
THE CANADA-FRANCE HIGH- z QUASAR SURVEY: NINE NEW QUASARS AND THE LUMINOSITY FUNCTION AT REDSHIFT 6 journal February 2010
Matplotlib: A 2D Graphics Environment journal January 2007
A Survey of z  > 5.7 Quasars in the Sloan Digital Sky Survey. IV. Discovery of Seven Additional Quasars journal March 2006
Astrometric Redshifts for Quasars journal May 2009
The LSST operations simulator conference August 2014
A new algorithm for difference image analysis journal May 2008
The XMM-SERVS survey: new XMM–Newton point-source catalogue for the XMM-LSS field journal April 2018
THE IDENTIFICATION OF z -DROPOUTS IN PAN-STARRS1: THREE QUASARS AT 6.5< z < 6.7 journal February 2015
THE z = 5 QUASAR LUMINOSITY FUNCTION FROM SDSS STRIPE 82 journal April 2013
A SURVEY OF LUMINOUS HIGH-REDSHIFT QUASARS WITH SDSS AND WISE. I. TARGET SELECTION AND OPTICAL SPECTROSCOPY journal February 2016
A survey for faint variable objects in SA 57 journal July 1989
Spectroscopic Target Selection in the Sloan Digital Sky Survey: The Quasar Sample journal June 2002
Examining AGN UV/Optical Variability beyond the Simple Damped Random Walk journal September 2022
The Hyper Suprime-Cam software pipeline journal October 2017
Least squares quantization in PCM journal March 1982
Cosmic X-ray surveys of distant active galaxies: The demographics, physics, and ecology of growing supermassive black holes journal January 2015
Gaia Data Release 2 : Summary of the contents and survey properties journal August 2018
Optimal Time-Series Selection of Quasars journal February 2011
Filling in the Quasar Redshift Gap at z ∼ 5.5. II. A Complete Survey of Luminous Quasars in the Post-reionization Universe journal January 2019
Is Quasar Optical Variability a Damped Random Walk? journal February 2013
Active galactic nuclei: what’s in a name? journal August 2017
A Fundamental Relation between Supermassive Black Holes and Their Host Galaxies journal August 2000
Galaxy Zoo: reproducing galaxy morphologies via machine learning★: Galaxy Zoo: morphology via machine learning journal April 2010
Approximation by superpositions of a sigmoidal function journal December 1989
Modelling photometric reverberation mapping data for the next generation of big data surveys. Quasar accretion discs sizes with the LSST journal February 2023
Optimization of the Observing Cadence for the Rubin Observatory Legacy Survey of Space and Time: A Pioneering Process of Community-focused Experimental Design journal December 2021
Probabilistic Cross‐Identification of Astronomical Sources journal May 2008
Unified Models for Active Galactic Nuclei and Quasars journal September 1993
Are the Variations in Quasar Optical flux Driven by Thermal Fluctuations? journal May 2009
The WFCAM Science Archive: The WFCAM Science Archive journal January 2008
The VISTA Deep Extragalactic Observations (VIDEO) survey★ journal October 2012
The eROSITA Final Equatorial-Depth Survey (eFEDS) journal May 2022
Coevolution (Or Not) of Supermassive Black Holes and Host Galaxies journal August 2013
Use of neural networks for the identification of newz≥ 3.6 QSOs from FIRST-SDSS DR5 journal November 2008
Current and Future Applications of Reverberation-Mapped Quasars in Cosmology journal December 2019
Cosmological constraints from the Hubble diagram of quasars at high redshifts journal January 2019
Measuring the broad-band power spectra of active galactic nuclei with RXTE journal May 2002
THE PAN-STARRS1 DISTANT z > 5.6 QUASAR SURVEY: MORE THAN 100 QUASARS WITHIN THE FIRST GYR OF THE UNIVERSE journal November 2016
Stochastic Modeling Handbook for Optical AGN Variability journal March 2019
Improving Damped Random Walk Parameters for SDSS Stripe 82 Quasars with Pan-STARRS1 journal February 2021
The LSST DESC data challenge 1: generation and analysis of synthetic images for next-generation surveys journal July 2020
The XMM-Large Scale Structure catalogue: X-ray sources and associated optical data. Version I journal November 2007
Optically variable AGN in the three-year VST survey of the COSMOS field journal June 2019
HELP: the Herschel Extragalactic Legacy Project journal June 2021
Photometric Characterization of the Dark Energy Camera journal April 2018
Photometric classification of emission line galaxies with machine-learning methods journal November 2013
The Gaia mission journal November 2016
Selecting Quasars by Their Intrinsic Variability journal April 2010
X-ray/UV/optical variability of NGC 4593 with Swift: reprocessing of X-rays by an extended reprocessor journal August 2018
The UKIRT wide field camera ZYJHK photometric system: calibration from 2MASS journal April 2009
LSST: From Science Drivers to Reference Design and Anticipated Data Products journal March 2019
Physical Properties of 15 Quasars at z ≳ 6.5 journal November 2017
The UKIRT Infrared Deep Sky Survey ZY JHK photometric system: passbands and synthetic colours journal April 2006
Identifying AGN Host Galaxies by Machine Learning with HSC+WISE journal October 2021
THE FINAL SDSS HIGH-REDSHIFT QUASAR SAMPLE OF 52 QUASARS AT z > 5.7 journal December 2016
Observational Evidence of Active Galactic Nuclei Feedback journal September 2012
Quasar Variability in the Mid-Infrared journal January 2016
The Wide-Field Infrared Survey Explorer (Wise): Mission Description and Initial On-Orbit Performance journal November 2010
An active galactic nucleus recognition model based on deep neural network journal December 2020
Accretion of Interstellar Matter by Massive Objects. journal August 1964