Skip to main content
U.S. Department of Energy
Office of Scientific and Technical Information

Quasar Identification Using Multivariate Probability Density Estimated from Nonparametric Conditional Probabilities

Journal Article · · Mathematics
DOI:https://doi.org/10.3390/math11010155· OSTI ID:2425264

Nonparametric estimation for a probability density function that describes multivariate data has typically been addressed by kernel density estimation (KDE). A novel density estimator recently developed by Farmer and Jacobs offers an alternative high-throughput automated approach to univariate nonparametric density estimation based on maximum entropy and order statistics, improving accuracy over univariate KDE. This article presents an extension of the single variable case to multiple variables. The univariate estimator is used to recursively calculate a product array of one-dimensional conditional probabilities. In combination with interpolation methods, a complete joint probability density estimate is generated for multiple variables. Good accuracy and speed performance in synthetic data are demonstrated by a numerical study using known distributions over a range of sample sizes from 100 to 106 for two to six variables. Performance in terms of speed and accuracy is compared to KDE. The multivariate density estimate developed here tends to perform better as the number of samples and/or variables increases. As an example application, measurements are analyzed over five filters of photometric data from the Sloan Digital Sky Survey Data Release 17. The multivariate estimation is used to form the basis for a binary classifier that distinguishes quasars from galaxies and stars with up to 94% accuracy.

Research Organization:
US Department of Energy (USDOE), Washington, DC (United States). Office of Science, Sloan Digital Sky Survey (SDSS)
Sponsoring Organization:
USDOE Office of Science (SC)
OSTI ID:
2425264
Journal Information:
Mathematics, Journal Name: Mathematics Journal Issue: 1 Vol. 11; ISSN 2227-7390
Publisher:
MDPICopyright Statement
Country of Publication:
United States
Language:
English

References (49)

Index for rating diagnostic tests journal January 1950
Multivariate Density Estimation: Theory, Practice, and Visualization book March 2015
Nonparametric density estimation for high‐dimensional data—Algorithms and applications journal April 2019
Data-driven deep density estimation journal July 2021
Nonparametric Estimation of Multivariate Density and its Derivative by Dependent Data Using Gamma Kernels journal March 2021
Fast Algorithm for Choosing Kernel Function Blur Coefficients in a Nonparametric Probability Density Estimate journal September 2018
Nonparametric multivariate density estimation using mixtures journal November 2013
A new maximum entropy method for estimation of multimodal probability density function journal February 2022
An introduction to the maximum entropy approach and its application to inference problems in biology journal April 2018
Classification method based on multidimensional probability density function estimation dedicated to embedded systems journal January 2018
An improved algorithm for the multidimensional moment-constrained maximum entropy problem journal September 2007
Nonparametric density estimation for multivariate bounded data journal January 2010
MATLAB tool for probability density assessment and nonparametric estimation journal June 2022
Structural reliability analysis based on the concepts of entropy, fractional moment and dimensional reduction method journal July 2013
A new maximum entropy-based importance sampling for reliability analysis journal November 2016
Probability Density Function Estimation Using Gamma Kernels journal September 2000
Identifying galaxies, quasars, and stars with machine learning: A new catalogue of classifications for 111 million SDSS sources without spectra journal July 2020
Nonparametric monitoring of multivariate data via KNN learning journal September 2020
Bernstein polynomial model for nonparametric multivariate density journal February 2019
Adaptive Bayesian bandwidth selection in asymmetric kernel density estimation for nonnegative heavy-tailed data journal February 2015
Incorporating support constraints into nonparametric estimators of densities journal January 1985
The Sloan Digital Sky Survey Photometric System journal April 1996
The Sloan Digital Sky Survey Photometric Camera journal December 1998
Sloan Digital Sky Survey: Early Data Release journal January 2002
The 2.5 m Telescope of the Sloan Digital Sky Survey journal April 2006
Photometric Response Functions of the Sloan Digital sky Survey Imager journal March 2010
Smooth optimum kernel estimators near endpoints journal January 1991
Information Theory and Statistical Mechanics journal May 1957
Nonparametric multivariate density estimation: a comparative study journal January 1994
Application of Machine Learning to Interpret Predictability of Different Models: Approach to Classification for SDSS Sources conference September 2021
An Improved Kernel Density Estimation with adaptive bandwidth selection for Edge detection conference February 2021
A Computationally Efficient Multivariate Maximum-Entropy Density Estimation (MEDE) Technique journal February 2004
Density estimation and random variate generation using multilayer networks journal May 2002
Multivariate Density Estimation by Neural Networks journal February 2024
On Estimation of a Probability Density Function and Mode journal September 1962
Remarks on Some Nonparametric Estimates of a Density Function journal September 1956
High throughput nonparametric probability density estimation journal May 2018
Classification of SDSS Photometric Data Using Machine Learning on A Cloud journal July 2018
PDFEstimator: An R Package for Density Estimation and Analysis journal October 2022
Universal Sample Size Invariant Measures for Uncertainty Quantification in Density Estimation journal November 2019
Adaptive Nonparametric Kernel Density Estimation Approach for Joint Probability Density Function Modeling of Multiple Wind Farms journal April 2019
Nonparametric Estimation of the Density Function of the Distribution of the Noise in CHARN Models journal February 2022
An Improved Model for Kernel Density Estimation Based on Quadtree and Quasi-Interpolation journal July 2022
Asymptotic Convergence of Soft-Constrained Neural Networks for Density Estimation journal April 2020
An Improved Variable Kernel Density Estimator Based on L2 Regularization journal August 2021
Nonparametric Multivariate Density Estimation: Case Study of Cauchy Mixture Model journal October 2021
Sloan Digital Sky Survey IV: Mapping the Milky Way, Nearby Galaxies, and the Distant Universe journal June 2017
The Seventeenth Data Release of the Sloan Digital Sky Surveys: Complete Release of MaNGA, MaStar, and APOGEE-2 Data journal March 2022
Identifying galaxies, quasars and stars with machine learning: a new catalogue of classifications for 111 million SDSS sources without spectra dataset January 2019

Figures / Tables (14)