DOE Patents title logo U.S. Department of Energy
Office of Scientific and Technical Information

Title: System for information discovery

Abstract

A sequence of word filters are used to eliminate terms in the database which do not discriminate document content, resulting in a filtered word set and a topic word set whose members are highly predictive of content. These two word sets are then formed into a two dimensional matrix with matrix entries calculated as the conditional probability that a document will contain a word in a row given that it contains the word in a column. The matrix representation allows the resultant vectors to be utilized to interpret document contents.

Inventors:
 [1];  [2]
  1. Richland, WA
  2. Kennewick, WA
Issue Date:
Research Org.:
Pacific Northwest National Laboratory (PNNL), Richland, WA (United States)
Sponsoring Org.:
USDOE
OSTI Identifier:
874890
Patent Number(s):
6484168
Application Number:
09/455849
Assignee:
Battelle Memorial Institute (Richland, WA)
Patent Classifications (CPCs):
G - PHYSICS G06 - COMPUTING G06F - ELECTRIC DIGITAL DATA PROCESSING
Y - NEW / CROSS SECTIONAL TECHNOLOGIES Y10 - TECHNICAL SUBJECTS COVERED BY FORMER USPC Y10S - TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
DOE Contract Number:  
AC06-76RL01830
Resource Type:
Patent
Country of Publication:
United States
Language:
English
Subject:
information; discovery; sequence; word; filters; eliminate; terms; database; discriminate; document; content; resulting; filtered; set; topic; highly; predictive; sets; formed; dimensional; matrix; entries; calculated; conditional; probability; contain; row; contains; column; representation; allows; resultant; vectors; utilized; interpret; contents; /707/

Citation Formats

Pennock, Kelly A, and Miller, Nancy E. System for information discovery. United States: N. p., 2002. Web.
Pennock, Kelly A, & Miller, Nancy E. System for information discovery. United States.
Pennock, Kelly A, and Miller, Nancy E. Tue . "System for information discovery". United States. https://www.osti.gov/servlets/purl/874890.
@article{osti_874890,
title = {System for information discovery},
author = {Pennock, Kelly A and Miller, Nancy E},
abstractNote = {A sequence of word filters are used to eliminate terms in the database which do not discriminate document content, resulting in a filtered word set and a topic word set whose members are highly predictive of content. These two word sets are then formed into a two dimensional matrix with matrix entries calculated as the conditional probability that a document will contain a word in a row given that it contains the word in a column. The matrix representation allows the resultant vectors to be utilized to interpret document contents.},
doi = {},
journal = {},
number = ,
volume = ,
place = {United States},
year = {2002},
month = {11}
}

Works referenced in this record:

Results on lattice vector quantization with dithering
journal, January 1996

  • Kirac, A.; Vaidyanathan, P. P.
  • IEEE Transactions on Circuits and Systems II: Analog and Digital Signal Processing, Vol. 43, Issue 12
  • https://doi.org/10.1109/82.553397