Global to push GA events into
skip to main content

Title: Semiotic indexing of digital resources

A method of classifying a plurality of documents. The method includes steps of providing a first set of classification terms and a second set of classification terms, the second set of classification terms being different from the first set of classification terms; generating a first frequency array of a number of occurrences of each term from the first set of classification terms in each document; generating a second frequency array of a number of occurrences of each term from the second set of classification terms in each document; generating a first similarity matrix from the first frequency array; generating a second similarity matrix from the second frequency array; determining an entrywise combination of the first similarity matrix and the second similarity matrix; and clustering the plurality of documents based on the result of the entrywise combination.
Inventors:
;
Issue Date:
OSTI Identifier:
1164666
Assignee:
NamesforLife LLC (East Lansing, MI) CHO
Patent Number(s):
8,903,825
Application Number:
13/478,973
Contract Number:
FG02-07ER86321
Resource Relation:
Patent File Date: 2012 May 23
Research Org:
NamesforLife LLC, East Lansing, MI (United States)
Sponsoring Org:
USDOE
Country of Publication:
United States
Language:
English
Subject:
99 GENERAL AND MISCELLANEOUS; 97 MATHEMATICS AND COMPUTING

Works referenced in this record:

High-Throughput Identification of Chemistry in Life Science Texts
book, January 2006
  • Corbett, Peter; Murray-Rust, Peter; Hutchison, David
  • Computational Life Sciences II, p. 107-118
  • DOI: 10.1007/11875741_11