DOE Patents title logo U.S. Department of Energy
Office of Scientific and Technical Information

Title: Semiotic indexing of digital resources

Abstract

A method of classifying a plurality of documents. The method includes steps of providing a first set of classification terms and a second set of classification terms, the second set of classification terms being different from the first set of classification terms; generating a first frequency array of a number of occurrences of each term from the first set of classification terms in each document; generating a second frequency array of a number of occurrences of each term from the second set of classification terms in each document; generating a first similarity matrix from the first frequency array; generating a second similarity matrix from the second frequency array; determining an entrywise combination of the first similarity matrix and the second similarity matrix; and clustering the plurality of documents based on the result of the entrywise combination.

Inventors:
;
Issue Date:
Research Org.:
NamesforLife LLC, East Lansing, MI (United States)
Sponsoring Org.:
USDOE
OSTI Identifier:
1164666
Patent Number(s):
8903825
Application Number:
13/478,973
Assignee:
NamesforLife LLC (East Lansing, MI)
Patent Classifications (CPCs):
G - PHYSICS G06 - COMPUTING G06F - ELECTRIC DIGITAL DATA PROCESSING
DOE Contract Number:  
FG02-07ER86321; FG02-04ER63933
Resource Type:
Patent
Resource Relation:
Patent File Date: 2012 May 23
Country of Publication:
United States
Language:
English
Subject:
99 GENERAL AND MISCELLANEOUS; 97 MATHEMATICS AND COMPUTING

Citation Formats

Parker, Charles T., and Garrity, George M. Semiotic indexing of digital resources. United States: N. p., 2014. Web.
Parker, Charles T., & Garrity, George M. Semiotic indexing of digital resources. United States.
Parker, Charles T., and Garrity, George M. Tue . "Semiotic indexing of digital resources". United States. https://www.osti.gov/servlets/purl/1164666.
@article{osti_1164666,
title = {Semiotic indexing of digital resources},
author = {Parker, Charles T. and Garrity, George M.},
abstractNote = {A method of classifying a plurality of documents. The method includes steps of providing a first set of classification terms and a second set of classification terms, the second set of classification terms being different from the first set of classification terms; generating a first frequency array of a number of occurrences of each term from the first set of classification terms in each document; generating a second frequency array of a number of occurrences of each term from the second set of classification terms in each document; generating a first similarity matrix from the first frequency array; generating a second similarity matrix from the second frequency array; determining an entrywise combination of the first similarity matrix and the second similarity matrix; and clustering the plurality of documents based on the result of the entrywise combination.},
doi = {},
journal = {},
number = ,
volume = ,
place = {United States},
year = {Tue Dec 02 00:00:00 EST 2014},
month = {Tue Dec 02 00:00:00 EST 2014}
}

Works referenced in this record:

Method of Peer Review of a Web-Based Encyclopedia
patent-application, August 2007


Method of rapidly screening X-ray powder diffraction patterns
patent, December 2001


Boosting to Determine Indicative Features from a Training Set
patent-application, December 2010


Method and system for tracking the lifecycles of technology items
patent-application, May 2004


Expression Construct for Digesting Aggregating Protein and Method of Inhibiting the Aggregation of Aggregating Protein
patent-application, November 2010


System and method for the triage and classification of documents
patent-application, February 2008


Managing taxonomic information
patent-application, September 2003


Insecticidal protein toxins from Photorhabdus
patent-application, November 2003


Systems and Methods for Automatically Identifying and Linking Names in Digital Resources
patent-application, August 2010


Cross-Trace Scalable Issue Detection and Clustering
patent-application, June 2012


Method of vector analysis for a document
patent, July 2009


Method and apparatus for measuring similarity among electronic documents
patent, January 2006


High-Throughput Identification of Chemistry in Life Science Texts
book, January 2006


Methods For Data Classification
patent-application, December 2009


Generation of materials with enhanced hydrogen content from microbial consortia including thermotoga
patent-application, October 2006


Taxonomy generation for electronic documents
patent, July 2007


Method of Analyzing Documenta
patent-application, February 2009


Using Categorical Metadata to Rank Search Results
patent-application, February 2011


Systems and methods for resolving ambiguity between names and entities
patent-application, July 2005


Method and System for Failure Signal Detention Analysis
patent-application, December 2007


Method and computer-based sytem for non-probabilistic hypothesis generation and verification
patent-application, May 2004


Computer systems and methods for visualizing data with generation of marks
patent-application, September 2006


Clustering Using Non-Negative Matrix Factorization on Sparse Graphs
patent-application, October 2009


Methods and Materials for Canine Breed Identification
patent-application, September 2011


Method and Apparatus for User Modelization
patent-application, December 2011


System and method for implementing a knowledge management system
patent-application, February 2004


Method and system for data segmentation
patent-application, May 2005


Chart display device and program for the same
patent-application, March 2007


Dynamic document icons
patent-application, June 2006


Technical classification method for searching patents
patent-application, September 2008


Method and system for automatic comparison of text classifications
patent, May 2002


Method for obtaining consensus classifications and identifications by combining data from different experiments
patent-application, January 2005


Browsable database for biological use
patent-application, July 2005


Information system for healthcare and biology
patent-application, March 2011


Systems and methods for resolving ambiguity between names and entities
patent, April 2011


Term-level text with mining with taxonomies
patent, August 2002


System with user directed enrichment and import/export control
patent-application, April 2006


System and Method for Recommending Educational Resources
patent-application, June 2010


A combining approach to find all taxon names (FAT)
journal, June 2006