skip to main content
OSTI.GOV title logo U.S. Department of Energy
Office of Scientific and Technical Information

Title: Document clustering methods, document cluster label disambiguation methods, document clustering apparatuses, and articles of manufacture

Patent ·
OSTI ID:979381

Document clustering methods, document cluster label disambiguation methods, document clustering apparatuses, and articles of manufacture are described. In one aspect, a document clustering method includes providing a document set comprising a plurality of documents, providing a cluster comprising a subset of the documents of the document set, using a plurality of terms of the documents, providing a cluster label indicative of subject matter content of the documents of the cluster, wherein the cluster label comprises a plurality of word senses, and selecting one of the word senses of the cluster label.

Research Organization:
Pacific Northwest National Laboratory (PNNL), Richland, WA (United States)
Sponsoring Organization:
USDOE
DOE Contract Number:
AC06-76RL01830
Assignee:
Battelle Memorial Research (Richland, WA)
Patent Number(s):
7,636,730
Application Number:
11/118,156
OSTI ID:
979381
Country of Publication:
United States
Language:
English

References (7)

Word sense disambiguation using Conceptual Density conference January 1996
Optimization of Context Disambiguation in Web Wearch Results conference August 2008
One sense per discourse conference January 1992
Word sense disambiguation for free-text indexing using a massive semantic network conference January 1993
Contextual correlates of semantic similarity journal January 1991
Contextual correlates of synonymy journal October 1965
Development and application of a metric on semantic nets journal January 1989