System for information discovery
Abstract
A sequence of word filters are used to eliminate terms in the database which do not discriminate document content, resulting in a filtered word set and a topic word set whose members are highly predictive of content. These two word sets are then formed into a two dimensional matrix with matrix entries calculated as the conditional probability that a document will contain a word in a row given that it contains the word in a column. The matrix representation allows the resultant vectors to be utilized to interpret document contents.
- Inventors:
-
- Richland, WA
- Kennewick, WA
- Issue Date:
- Research Org.:
- Pacific Northwest National Laboratory (PNNL), Richland, WA (United States)
- Sponsoring Org.:
- USDOE
- OSTI Identifier:
- 874890
- Patent Number(s):
- 6484168
- Application Number:
- 09/455849
- Assignee:
- Battelle Memorial Institute (Richland, WA)
- Patent Classifications (CPCs):
-
G - PHYSICS G06 - COMPUTING G06F - ELECTRIC DIGITAL DATA PROCESSING
Y - NEW / CROSS SECTIONAL TECHNOLOGIES Y10 - TECHNICAL SUBJECTS COVERED BY FORMER USPC Y10S - TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
- DOE Contract Number:
- AC06-76RL01830
- Resource Type:
- Patent
- Country of Publication:
- United States
- Language:
- English
- Subject:
- information; discovery; sequence; word; filters; eliminate; terms; database; discriminate; document; content; resulting; filtered; set; topic; highly; predictive; sets; formed; dimensional; matrix; entries; calculated; conditional; probability; contain; row; contains; column; representation; allows; resultant; vectors; utilized; interpret; contents; /707/
Citation Formats
Pennock, Kelly A, and Miller, Nancy E. System for information discovery. United States: N. p., 2002.
Web.
Pennock, Kelly A, & Miller, Nancy E. System for information discovery. United States.
Pennock, Kelly A, and Miller, Nancy E. Tue .
"System for information discovery". United States. https://www.osti.gov/servlets/purl/874890.
@article{osti_874890,
title = {System for information discovery},
author = {Pennock, Kelly A and Miller, Nancy E},
abstractNote = {A sequence of word filters are used to eliminate terms in the database which do not discriminate document content, resulting in a filtered word set and a topic word set whose members are highly predictive of content. These two word sets are then formed into a two dimensional matrix with matrix entries calculated as the conditional probability that a document will contain a word in a row given that it contains the word in a column. The matrix representation allows the resultant vectors to be utilized to interpret document contents.},
doi = {},
journal = {},
number = ,
volume = ,
place = {United States},
year = {2002},
month = {11}
}
Works referenced in this record:
Results on lattice vector quantization with dithering
journal, January 1996
- Kirac, A.; Vaidyanathan, P. P.
- IEEE Transactions on Circuits and Systems II: Analog and Digital Signal Processing, Vol. 43, Issue 12