Skip to main content
U.S. Department of Energy
Office of Scientific and Technical Information

Machine Learning to Improve Retrieval by Category in Big Volunteered Geodata

Conference ·
Nowadays, Volunteered Geographic Information (VGI) is commonly used in research and practical applications. However, the quality assurance of such a geographic data remains a problem. In this study we use machine learning and natural language processing to improve record retrieval by category (e.g. restaurant, museum, etc.) from Wikimapia Points of Interest data. We use textual information contained in VGI records to evaluate its ability to determine the category label. The performance of the trained classifier is evaluated on the complete dataset and then is compared with its performance on regional subsets. Preliminary analysis shows significant difference in the classifier performance across the regions. Such geographic differences will have a significant effect on data enrichment efforts such as labeling entities with missing categories.
Research Organization:
Oak Ridge National Laboratory (ORNL), Oak Ridge, TN (United States)
Sponsoring Organization:
USDOE
DOE Contract Number:
AC05-00OR22725
OSTI ID:
1490594
Country of Publication:
United States
Language:
English

References (4)

Assuring the quality of volunteered geographic information journal May 2012
Introduction to Information Retrieval book January 2008
A review of volunteered geographic information quality assessment methods journal May 2016
Ambiguity and plausibility
  • Ali, Ahmed Loai; Schmid, Falko; Al-Salman, Rami
  • Proceedings of the 22nd ACM SIGSPATIAL International Conference on Advances in Geographic Information Systems https://doi.org/10.1145/2666310.2666392
conference November 2014

Similar Records

SONET: a semantic ontological network graph for managing points of interest data heterogeneity
Conference · Fri Nov 01 00:00:00 EDT 2019 · OSTI ID:1606916

Exploiting deep learning and volunteered geographic information for mapping buildings in Kano, Nigeria
Journal Article · Mon Oct 22 20:00:00 EDT 2018 · Scientific Data · OSTI ID:1607293

Redistribution of visceral blood volume in upright exercise in healthy volunteers
Journal Article · Sat Oct 01 00:00:00 EDT 1988 · J. Nucl. Med.; (United States) · OSTI ID:6558093

Related Subjects