Using the triangle inequality to reduce the number of comparisons required for similarity-based retrieval
- Univ. of Virginia (United States)
- Univ. of Virginia, Charlottesville, VA (United States). Dept. of Computer Science
- Los Alamos National Lab., NM (United States)
Dissimilarity measures, the basis of similarity-based retrieval, can be viewed as a distance and a similarity-based search as a nearest neighbor search. Though there has been extensive research on data structures and search methods to support nearest-neighbor searching, these indexing and dimension-reduction methods are generally not applicable to non-coordinate data and non-Euclidean distance measures. In this paper we reexamine and extend previous work of other researchers on best match searching based on the triangle inequality. These methods can be used to organize both non-coordinate data and non-Euclidean metric similarity measures. The effectiveness of the indexes depends on the actual dimensionality of the feature set, data, and similarity metric used. We show that these methods provide significant performance improvements and may be of practical value in real-world databases.
- Research Organization:
- Los Alamos National Lab. (LANL), Los Alamos, NM (United States)
- Sponsoring Organization:
- USDOE, Washington, DC (United States)
- DOE Contract Number:
- W-7405-ENG-36
- OSTI ID:
- 208323
- Report Number(s):
- LA-UR-96-61; CONF-960171-1; ON: DE96007194
- Resource Relation:
- Conference: IS&T/SPIE symposium on electronic imaging: science & technology, Bellingham, WA (United States), 29 Jan - 2 Feb 1996; Other Information: PBD: [1996]
- Country of Publication:
- United States
- Language:
- English
Similar Records
A fingerprint based metric for measuring similarities of crystalline structures
MO-AB-BRA-10: Cancer Therapy Outcome Prediction Based On Dempster-Shafer Theory and PET Imaging