Advanced Search

Browse by Discipline

Scientific Societies

E-print Alerts

Add E-prints

E-print Network

  Advanced Search  

Accounting for Boundary Effects in Nearest Neighbor David M. Mount

Summary: Accounting for Boundary Effects in Nearest Neighbor
Sunil Arya
David M. Mount
Onuttom Narayan
Given n data points in d-dimensional space, nearest neighbor searching involves
determining the nearest of these data points to a given query point. Most average-
case analyses of nearest neighbor searching algorithms are made under the simplifying
assumption that d is fixed and that n is so large relative to d that boundary effects can
be ignored. This means that for any query point the statistical distribution of the data
points surrounding it is independent of the location of the query point. However, in
many applications of nearest neighbor searching (such as data compression by vector
quantization) this assumption is not met, since the number of data points n grows
roughly as 2d
. Largely for this reason, the actual performances of many nearest neighbor
algorithms tend to be much better than their theoretical analyses would suggest. We
present evidence of why this is the case. We provide an accurate analysis of the number
of cells visited in nearest neighbor searching by the bucketing and k-d tree algorithms.
We assume md


Source: Arya, Sunil - Department of Computer Science, Hong Kong University of Science and Technology


Collections: Computer Technologies and Information Sciences