skip to main content
OSTI.GOV title logo U.S. Department of Energy
Office of Scientific and Technical Information

Title: Semiotic indexing of digital resources

Patent ·
OSTI ID:1164666

A method of classifying a plurality of documents. The method includes steps of providing a first set of classification terms and a second set of classification terms, the second set of classification terms being different from the first set of classification terms; generating a first frequency array of a number of occurrences of each term from the first set of classification terms in each document; generating a second frequency array of a number of occurrences of each term from the second set of classification terms in each document; generating a first similarity matrix from the first frequency array; generating a second similarity matrix from the second frequency array; determining an entrywise combination of the first similarity matrix and the second similarity matrix; and clustering the plurality of documents based on the result of the entrywise combination.

Research Organization:
NamesforLife LLC, East Lansing, MI (United States)
Sponsoring Organization:
USDOE
DOE Contract Number:
FG02-07ER86321; FG02-04ER63933
Assignee:
NamesforLife LLC (East Lansing, MI)
Patent Number(s):
8,903,825
Application Number:
13/478,973
OSTI ID:
1164666
Resource Relation:
Patent File Date: 2012 May 23
Country of Publication:
United States
Language:
English

References (43)

Method of Peer Review of a Web-Based Encyclopedia patent-application August 2007
Method of rapidly screening X-ray powder diffraction patterns patent December 2001
Boosting to Determine Indicative Features from a Training Set patent-application December 2010
Method and system for tracking the lifecycles of technology items patent-application May 2004
Expression Construct for Digesting Aggregating Protein and Method of Inhibiting the Aggregation of Aggregating Protein patent-application November 2010
System and method for the triage and classification of documents patent-application February 2008
Managing taxonomic information patent-application September 2003
Insecticidal protein toxins from Photorhabdus patent-application November 2003
Information Classifying Device, Information Classifying Method, Information Classifying Program, Information Classifying System patent-application May 2008
Systems and Methods for Automatically Identifying and Linking Names in Digital Resources patent-application August 2010
Cross-Trace Scalable Issue Detection and Clustering patent-application June 2012
Method of vector analysis for a document patent July 2009
Method and apparatus for measuring similarity among electronic documents patent January 2006
High-Throughput Identification of Chemistry in Life Science Texts book January 2006
Methods For Data Classification patent-application December 2009
Generation of materials with enhanced hydrogen content from microbial consortia including thermotoga patent-application October 2006
Taxonomy generation for electronic documents patent July 2007
Method of Analyzing Documenta patent-application February 2009
Using Categorical Metadata to Rank Search Results patent-application February 2011
Systems and methods for resolving ambiguity between names and entities patent-application July 2005
Method and System for Failure Signal Detention Analysis patent-application December 2007
Method and computer-based sytem for non-probabilistic hypothesis generation and verification patent-application May 2004
Computer systems and methods for visualizing data with generation of marks patent-application September 2006
Clustering Using Non-Negative Matrix Factorization on Sparse Graphs patent-application October 2009
Methods and Materials for Canine Breed Identification patent-application September 2011
Method and Apparatus for User Modelization patent-application December 2011
System and method for implementing a knowledge management system patent-application February 2004
Method and system for data segmentation patent-application May 2005
Chart display device and program for the same patent-application March 2007
System and method for searching and processing databases comprising named annotated text strings patent June 2001
Dynamic document icons patent-application June 2006
Technical classification method for searching patents patent-application September 2008
Method and system for automatic comparison of text classifications patent May 2002
Method for obtaining consensus classifications and identifications by combining data from different experiments patent-application January 2005
Browsable database for biological use patent-application July 2005
Information system for healthcare and biology patent-application March 2011
Optimal dissimilarity method for choosing distinctive items of information from a large body of information patent March 2003
Systems and methods for resolving ambiguity between names and entities patent April 2011
Term-level text with mining with taxonomies patent August 2002
System with user directed enrichment and import/export control patent-application April 2006
System and Method for Recommending Educational Resources patent-application June 2010
Information data retrieval, where the data is organized in terms, documents and document corpora patent-application July 2005
A combining approach to find all taxon names (FAT) journal June 2006