skip to main content
OSTI.GOV title logo U.S. Department of Energy
Office of Scientific and Technical Information

Title: Computation of term dominance in text documents

Patent ·
OSTI ID:1040003

An improved entropy-based term dominance metric useful for characterizing a corpus of text documents, and is useful for comparing the term dominance metrics of a first corpus of documents to a second corpus having a different number of documents.

Research Organization:
Sandia National Laboratories (SNL), Albuquerque, NM, and Livermore, CA (United States)
Sponsoring Organization:
USDOE
DOE Contract Number:
AC04-94AL85000
Assignee:
Sandia Corporation (Albuquerque, NM)
Patent Number(s):
8,166,051
Application Number:
12/364,753
OSTI ID:
1040003
Country of Publication:
United States
Language:
English

References (3)

Improving the retrieval of information from external sources journal June 1991
Term-weighting approaches in automatic text retrieval journal January 1988
New Term Weighting Formulas for the Vector Space Method in Information Retrieval report March 1999