DOE Patents title logo U.S. Department of Energy
Office of Scientific and Technical Information

Title: Dynamic reduction of dimensions of a document vector in a document search and retrieval system

Abstract

The method and system of the invention involves processing each new document (20) coming into the system into a document vector (16), and creating a document vector with reduced dimensionality (17) for comparison with the data model (15) without recomputing the data model (15). These operations are carried out by a first computer (11) while a second computer (12) updates the data model (18), which can be comprised of an initial large group of documents (19) and is premised on the computing an initial data model (13, 14, 15) to provide a reference point for determining document vectors from documents processed from the data stream (20).

Inventors:
;
Issue Date:
Research Org.:
Oak Ridge National Laboratory (ORNL), Oak Ridge, TN (United States)
Sponsoring Org.:
USDOE
OSTI Identifier:
1176426
Patent Number(s):
7937389
Application Number:
12/072,723
Assignee:
UT-Battelle, LLC (Oak Ridge, TN)
Patent Classifications (CPCs):
G - PHYSICS G06 - COMPUTING G06F - ELECTRIC DIGITAL DATA PROCESSING
DOE Contract Number:  
AC05-00OR22725
Resource Type:
Patent
Country of Publication:
United States
Language:
English
Subject:
97 MATHEMATICS AND COMPUTING

Citation Formats

Jiao, Yu, and Potok, Thomas E. Dynamic reduction of dimensions of a document vector in a document search and retrieval system. United States: N. p., 2011. Web.
Jiao, Yu, & Potok, Thomas E. Dynamic reduction of dimensions of a document vector in a document search and retrieval system. United States.
Jiao, Yu, and Potok, Thomas E. Tue . "Dynamic reduction of dimensions of a document vector in a document search and retrieval system". United States. https://www.osti.gov/servlets/purl/1176426.
@article{osti_1176426,
title = {Dynamic reduction of dimensions of a document vector in a document search and retrieval system},
author = {Jiao, Yu and Potok, Thomas E.},
abstractNote = {The method and system of the invention involves processing each new document (20) coming into the system into a document vector (16), and creating a document vector with reduced dimensionality (17) for comparison with the data model (15) without recomputing the data model (15). These operations are carried out by a first computer (11) while a second computer (12) updates the data model (18), which can be comprised of an initial large group of documents (19) and is premised on the computing an initial data model (13, 14, 15) to provide a reference point for determining document vectors from documents processed from the data stream (20).},
doi = {},
journal = {},
number = ,
volume = ,
place = {United States},
year = {2011},
month = {5}
}