skip to main content
OSTI.GOV title logo U.S. Department of Energy
Office of Scientific and Technical Information

Title: A fast and objective multidimensional kernel density estimation method: fastKDE

Journal Article · · Computational Statistics and Data Analysis (Print)

Numerous facets of scientific research implicitly or explicitly call for the estimation of probability densities. Histograms and kernel density estimates (KDEs) are two commonly used techniques for estimating such information, with the KDE generally providing a higher fidelity representation of the probability density function (PDF). Both methods require specification of either a bin width or a kernel bandwidth. While techniques exist for choosing the kernel bandwidth optimally and objectively, they are computationally intensive, since they require repeated calculation of the KDE. A solution for objectively and optimally choosing both the kernel shape and width has recently been developed by Bernacchia and Pigolotti (2011). While this solution theoretically applies to multidimensional KDEs, it has not been clear how to practically do so. A method for practically extending the Bernacchia-Pigolotti KDE to multidimensions is introduced. This multidimensional extension is combined with a recently-developed computational improvement to their method that makes it computationally efficient: a 2D KDE on 105 samples only takes 1 s on a modern workstation. This fast and objective KDE method, called the fastKDE method, retains the excellent statistical convergence properties that have been demonstrated for univariate samples. The fastKDE method exhibits statistical accuracy that is comparable to state-of-the-science KDE methods publicly available in R, and it produces kernel density estimates several orders of magnitude faster. The fastKDE method does an excellent job of encoding covariance information for bivariate samples. This property allows for direct calculation of conditional PDFs with fastKDE. It is demonstrated how this capability might be leveraged for detecting non-trivial relationships between quantities in physical systems, such as transitional behavior.

Research Organization:
Lawrence Berkeley National Laboratory (LBNL), Berkeley, CA (United States)
Sponsoring Organization:
USDOE Office of Science (SC), Biological and Environmental Research (BER)
Grant/Contract Number:
AC02-05CH11231
OSTI ID:
1305435
Alternate ID(s):
OSTI ID: 1435070
Journal Information:
Computational Statistics and Data Analysis (Print), Journal Name: Computational Statistics and Data Analysis (Print) Vol. 101 Journal Issue: C; ISSN 0167-9473
Country of Publication:
Netherlands
Language:
English
Citation Metrics:
Cited by: 83 works
Citation information provided by
Web of Science

References (15)

A review of cloud top height and optical depth histograms from MISR, ISCCP, and MODIS journal January 2010
Self-consistent method for density estimation: Density Estimation journal April 2011
Cross-validation Bandwidth Matrices for Multivariate Kernel Density Estimation journal September 2005
Small-Scale and Mesoscale Variability in Cloudy Boundary Layers: Joint Probability Density Functions journal December 2002
‘All models are wrong...’: an introduction to model uncertainty journal July 2012
On dynamic and thermodynamic components of cloud changes journal March 2004
Transformations in Density Estimation journal June 1991
Reducing the computational cost of the ECF using a nuFFT: A fast and objective probability density estimation method journal November 2014
Simulation of the 1976/77 Climate Transition over the North Pacific: Sensitivity to Tropical Forcing journal December 2006
Stratiform Rain in the Tropics as Seen by the TRMM Precipitation Radar* journal June 2003
Global warming and changes in risk of concurrent climate extremes: Insights from the 2014 California drought: Global Warming and Concurrent Extremes journal December 2014
The World's Technological Capacity to Store, Communicate, and Compute Information journal February 2011
Bandwidth selection for kernel density estimation: a review of fully automatic selectors journal June 2013
Self-Consistent Density Estimation journal June 2014
Improvements to NOAA’s Historical Merged Land–Ocean Surface Temperature Analysis (1880–2006) journal May 2008