skip to main content
OSTI.GOV title logo U.S. Department of Energy
Office of Scientific and Technical Information

Title: CLaSPS: A NEW METHODOLOGY FOR KNOWLEDGE EXTRACTION FROM COMPLEX ASTRONOMICAL DATA SETS

Journal Article · · Astrophysical Journal
; ;  [1]; ; ;  [2]
  1. Harvard-Smithsonian Center for Astrophysics, 60 Garden Street, Cambridge, MA 02138 (United States)
  2. Department of Astronomy, California Institute of Technology, MC 249-17 1200 East California Blvd, Pasadena, CA 91125 (United States)

In this paper, we present the Clustering-Labels-Score Patterns Spotter (CLaSPS), a new methodology for the determination of correlations among astronomical observables in complex data sets, based on the application of distinct unsupervised clustering techniques. The novelty in CLaSPS is the criterion used for the selection of the optimal clusterings, based on a quantitative measure of the degree of correlation between the cluster memberships and the distribution of a set of observables, the labels, not employed for the clustering. CLaSPS has been primarily developed as a tool to tackle the challenging complexity of the multi-wavelength complex and massive astronomical data sets produced by the federation of the data from modern automated astronomical facilities. In this paper, we discuss the applications of CLaSPS to two simple astronomical data sets, both composed of extragalactic sources with photometric observations at different wavelengths from large area surveys. The first data set, CSC+, is composed of optical quasars spectroscopically selected in the Sloan Digital Sky Survey data, observed in the x-rays by Chandra and with multi-wavelength observations in the near-infrared, optical, and ultraviolet spectral intervals. One of the results of the application of CLaSPS to the CSC+ is the re-identification of a well-known correlation between the {alpha}{sub OX} parameter and the near-ultraviolet color, in a subset of CSC+ sources with relatively small values of the near-ultraviolet colors. The other data set consists of a sample of blazars for which photometric observations in the optical, mid-, and near-infrared are available, complemented for a subset of the sources, by Fermi {gamma}-ray data. The main results of the application of CLaSPS to such data sets have been the discovery of a strong correlation between the multi-wavelength color distribution of blazars and their optical spectral classification in BL Lac objects and flat-spectrum radio quasars, and a peculiar pattern followed by blazars in the WISE mid-infrared colors space. This pattern and its physical interpretation have been discussed in detail in other papers by one of the authors.

OSTI ID:
22039105
Journal Information:
Astrophysical Journal, Vol. 755, Issue 2; Other Information: Country of input: International Atomic Energy Agency (IAEA); ISSN 0004-637X
Country of Publication:
United States
Language:
English

Similar Records

THE NEXT GENERATION ATLAS OF QUASAR SPECTRAL ENERGY DISTRIBUTIONS FROM RADIO TO X-RAYS
Journal Article · Thu Sep 01 00:00:00 EDT 2011 · Astrophysical Journal, Supplement Series · OSTI ID:22039105

The XMM-BCS galaxy cluster survey: I. The X-ray selected cluster catalog from the initial 6 deg$^2$
Journal Article · Tue Nov 01 00:00:00 EDT 2011 · Submitted to Astron.Astrophys. · OSTI ID:22039105

EIGHT-DIMENSIONAL MID-INFRARED/OPTICAL BAYESIAN QUASAR SELECTION
Journal Article · Wed Apr 15 00:00:00 EDT 2009 · Astronomical Journal (New York, N.Y. Online) · OSTI ID:22039105