Semiotic indexing of digital resources
Abstract
A method of classifying a plurality of documents. The method includes steps of providing a first set of classification terms and a second set of classification terms, the second set of classification terms being different from the first set of classification terms; generating a first frequency array of a number of occurrences of each term from the first set of classification terms in each document; generating a second frequency array of a number of occurrences of each term from the second set of classification terms in each document; generating a first similarity matrix from the first frequency array; generating a second similarity matrix from the second frequency array; determining an entrywise combination of the first similarity matrix and the second similarity matrix; and clustering the plurality of documents based on the result of the entrywise combination.
- Inventors:
- Issue Date:
- Research Org.:
- NamesforLife LLC, East Lansing, MI (United States)
- Sponsoring Org.:
- USDOE
- OSTI Identifier:
- 1164666
- Patent Number(s):
- 8903825
- Application Number:
- 13/478,973
- Assignee:
- NamesforLife LLC (East Lansing, MI)
- Patent Classifications (CPCs):
-
G - PHYSICS G06 - COMPUTING G06F - ELECTRIC DIGITAL DATA PROCESSING
- DOE Contract Number:
- FG02-07ER86321; FG02-04ER63933
- Resource Type:
- Patent
- Resource Relation:
- Patent File Date: 2012 May 23
- Country of Publication:
- United States
- Language:
- English
- Subject:
- 99 GENERAL AND MISCELLANEOUS; 97 MATHEMATICS AND COMPUTING
Citation Formats
Parker, Charles T., and Garrity, George M. Semiotic indexing of digital resources. United States: N. p., 2014.
Web.
Parker, Charles T., & Garrity, George M. Semiotic indexing of digital resources. United States.
Parker, Charles T., and Garrity, George M. Tue .
"Semiotic indexing of digital resources". United States. https://www.osti.gov/servlets/purl/1164666.
@article{osti_1164666,
title = {Semiotic indexing of digital resources},
author = {Parker, Charles T. and Garrity, George M.},
abstractNote = {A method of classifying a plurality of documents. The method includes steps of providing a first set of classification terms and a second set of classification terms, the second set of classification terms being different from the first set of classification terms; generating a first frequency array of a number of occurrences of each term from the first set of classification terms in each document; generating a second frequency array of a number of occurrences of each term from the second set of classification terms in each document; generating a first similarity matrix from the first frequency array; generating a second similarity matrix from the second frequency array; determining an entrywise combination of the first similarity matrix and the second similarity matrix; and clustering the plurality of documents based on the result of the entrywise combination.},
doi = {},
journal = {},
number = ,
volume = ,
place = {United States},
year = {Tue Dec 02 00:00:00 EST 2014},
month = {Tue Dec 02 00:00:00 EST 2014}
}
Works referenced in this record:
Method of Peer Review of a Web-Based Encyclopedia
patent-application, August 2007
- Izhikevich, Eugene M.
- US Patent Application 11/619194; 20070180388
Method of rapidly screening X-ray powder diffraction patterns
patent, December 2001
- Murray, Richard; Bratu, Cheryl M.; Lewis, Gregory J.
- US Patent Document 6,327,334
Boosting to Determine Indicative Features from a Training Set
patent-application, December 2010
- Platt, John; Rook, Harvey; Yan, Shengquan
- US Patent Application 12/471841; 20100306147
Method and system for tracking the lifecycles of technology items
patent-application, May 2004
- Hicks, Jaye D.; Mears, Randall F.; Wacker, Jeffrey L.
- US Patent Application 10/294170; 20040098271
Expression Construct for Digesting Aggregating Protein and Method of Inhibiting the Aggregation of Aggregating Protein
patent-application, November 2010
- Yamada, Shin-ichi; Niwa, Jyun-ichi; Sobue, Gen
- US Patent Application 12/225311; 20100279402
System and method for the triage and classification of documents
patent-application, February 2008
- Kolo, Brian; Buhain, Ed; Koslow, Chad
- US Patent Application 11/892594; 20080052289
Managing taxonomic information
patent-application, September 2003
- Remsen, David Pratt; Norton, Catherine N.
- US Patent Application 10/087621; 20030167283
Insecticidal protein toxins from Photorhabdus
patent-application, November 2003
- Ensign, Jerald C.; Bowen, David J.; Petell, James
- US Patent Application 10/262794; 20030207806
Information Classifying Device, Information Classifying Method, Information Classifying Program, Information Classifying System
patent-application, May 2008
- Ihara, Masayoshi
- US Patent Application 11/791705; 20080114564
Systems and Methods for Automatically Identifying and Linking Names in Digital Resources
patent-application, August 2010
- Parker, Charles T.; Lyons, Catherine M.; Roston, Gerald P.
- US Patent Application 12/685964; 20100198841
Cross-Trace Scalable Issue Detection and Clustering
patent-application, June 2012
- Han, Shi; Dang, Yingnong; Hsiao, Shuo-Hsien
- US Patent Application 12/960015; 20120143795
Method of vector analysis for a document
patent, July 2009
- Kawatani, Takahiko
- US Patent Document 7,562,066
Method and apparatus for measuring similarity among electronic documents
patent, January 2006
- Palmer, Michael E.; Sun, Gordon; Zha, Hongyuan
- US Patent Document 6,990,628
High-Throughput Identification of Chemistry in Life Science Texts
book, January 2006
- Corbett, Peter; Murray-Rust, Peter; Hutchison, David
- Computational Life Sciences II, p. 107-118
Methods For Data Classification
patent-application, December 2009
- Garrity, George; Lilburn, Timothy G.
- US Patent Application 11/922273; 20090299926
Generation of materials with enhanced hydrogen content from microbial consortia including thermotoga
patent-application, October 2006
- Pfeiffer, Robert S.; Ulrich, Glenn A.; Vanzin, Gary
- US Patent Application 11/099880; 20060223159
Taxonomy generation for electronic documents
patent, July 2007
- Woehler, Johannes; Faerber, Franz
- US Patent Document 7,243,092
Method of Analyzing Documenta
patent-application, February 2009
- Handley, John C.
- US Patent Application 12/241903; 20090037390
Using Categorical Metadata to Rank Search Results
patent-application, February 2011
- Svore, Krysta Marie; Bennett, Paul Nathan; Dumais, Susan T.
- US Patent Application 12/541166; 20110040752
Systems and methods for resolving ambiguity between names and entities
patent-application, July 2005
- Garrity, George; Lyons, Catherine
- US Patent Application 10/759817; 20050160059
Method and System for Failure Signal Detention Analysis
patent-application, December 2007
- Burch, Richard; Lin, Paul; Graves, Spencer
- US Patent Application 10/596960; 20070288185
Method and computer-based sytem for non-probabilistic hypothesis generation and verification
patent-application, May 2004
- Andreev, Leonid; Andreev, Dmitry
- US Patent Application 10/716325; 20040103108
Computer systems and methods for visualizing data with generation of marks
patent-application, September 2006
- Hanrahan, Patrick; Stolte, Chris
- US Patent Application 11/005652; 20060206512
Clustering Using Non-Negative Matrix Factorization on Sparse Graphs
patent-application, October 2009
- Perronnin, Florent; Bouchard, Guillaume
- US Patent Application 12/109496; 20090271433
Methods and Materials for Canine Breed Identification
patent-application, September 2011
- Ostrander, Elaine; Kruglyak, Leonid; Parker, Heidi G.
- US Patent Application 13/039240; 20110224911
Method and Apparatus for User Modelization
patent-application, December 2011
- Donneau-Golencer, Thierry; Hardt, Stephen L.
- US Patent Application 13/149536; 20110295612
System and method for implementing a knowledge management system
patent-application, February 2004
- Copperman, Max; Angel, Mark; Rudy, Jeffrey H.
- US Patent Application 10/610994; 20040024739
Method and system for data segmentation
patent-application, May 2005
- Lakshminarayan, Choudur K.; Singh, Pramond; Yu, Qingfeng
- US Patent Application 10/871148; 20050114382
Chart display device and program for the same
patent-application, March 2007
- Shinohara, Noboru; Doi, Naohito; Tsuchida, Kazuhiro
- US Patent Application 11/332282; 20070046672
System and method for searching and processing databases comprising named annotated text strings
patent, June 2001
- Macke, Thomas J.; Butler, Bill F.; O'Connell, James P.
- US Patent Document 6,249,784
Dynamic document icons
patent-application, June 2006
- Berkner, Kathrin
- US Patent Application 11/019802; 20060136478
Technical classification method for searching patents
patent-application, September 2008
- Huang, Chung-Jen; Hrong, Alex
- US Patent Application 11/797053; 20080228724
Method and system for automatic comparison of text classifications
patent, May 2002
- Kreulen, Jeffrey Thomas; Spangler, William Scott; Strong, Jr., Hovey R.
- US Patent Document 6,397,215
Method for obtaining consensus classifications and identifications by combining data from different experiments
patent-application, January 2005
- Vauterin, Luc R.G.; Vauterin, Paul J.J.
- US Patent Application 10/758249; 20050014195
Browsable database for biological use
patent-application, July 2005
- Thomas, Paul D.; Kejariwal, Anish; Champbell, Michael J.
- US Patent Application 10/731875; 20050149269
Information system for healthcare and biology
patent-application, March 2011
- Goldsmith, Brian J.; Maloney, Alan G.; Kelly, Paul
- US Patent Application 12/589806; 20110066968
Optimal dissimilarity method for choosing distinctive items of information from a large body of information
patent, March 2003
- Clark, Robert D.
- US Patent Document 6,535,819
Systems and methods for resolving ambiguity between names and entities
patent, April 2011
- Garrity, George M.; Lyons, Catherine M.
- US Patent Document 7,925,444
Term-level text with mining with taxonomies
patent, August 2002
- Feldman, Ronen; Aumann, Yehonatan; Schler, Jonathan
- US Patent Document 6,442,545
System with user directed enrichment and import/export control
patent-application, April 2006
- Hubert, Laurence; Guerin, Nicolas
- US Patent Application 11/284878; 20060080314
System and Method for Recommending Educational Resources
patent-application, June 2010
- German, Kristine A.; Lofthus, Robert M.; Price, Robert Roy
- US Patent Application 12/340116; 20100159438
Information data retrieval, where the data is organized in terms, documents and document corpora
patent-application, July 2005
- Lindh, Per; Londahl, Bjorn
- US Patent Application 10/501397; 20050149494
A combining approach to find all taxon names (FAT)
journal, June 2006
- Sautter, Guido; Böhm, Klemens; Agosti, Donat
- Biodiversity Informatics, Vol. 3, Issue 0