Dynamic reduction of dimensions of a document vector in a document search and retrieval system
Abstract
The method and system of the invention involves processing each new document (20) coming into the system into a document vector (16), and creating a document vector with reduced dimensionality (17) for comparison with the data model (15) without recomputing the data model (15). These operations are carried out by a first computer (11) while a second computer (12) updates the data model (18), which can be comprised of an initial large group of documents (19) and is premised on the computing an initial data model (13, 14, 15) to provide a reference point for determining document vectors from documents processed from the data stream (20).
- Inventors:
- Issue Date:
- Research Org.:
- Oak Ridge National Laboratory (ORNL), Oak Ridge, TN (United States)
- Sponsoring Org.:
- USDOE
- OSTI Identifier:
- 1176426
- Patent Number(s):
- 7937389
- Application Number:
- 12/072,723
- Assignee:
- UT-Battelle, LLC (Oak Ridge, TN)
- Patent Classifications (CPCs):
-
G - PHYSICS G06 - COMPUTING G06F - ELECTRIC DIGITAL DATA PROCESSING
- DOE Contract Number:
- AC05-00OR22725
- Resource Type:
- Patent
- Country of Publication:
- United States
- Language:
- English
- Subject:
- 97 MATHEMATICS AND COMPUTING
Citation Formats
Jiao, Yu, and Potok, Thomas E. Dynamic reduction of dimensions of a document vector in a document search and retrieval system. United States: N. p., 2011.
Web.
Jiao, Yu, & Potok, Thomas E. Dynamic reduction of dimensions of a document vector in a document search and retrieval system. United States.
Jiao, Yu, and Potok, Thomas E. Tue .
"Dynamic reduction of dimensions of a document vector in a document search and retrieval system". United States. https://www.osti.gov/servlets/purl/1176426.
@article{osti_1176426,
title = {Dynamic reduction of dimensions of a document vector in a document search and retrieval system},
author = {Jiao, Yu and Potok, Thomas E.},
abstractNote = {The method and system of the invention involves processing each new document (20) coming into the system into a document vector (16), and creating a document vector with reduced dimensionality (17) for comparison with the data model (15) without recomputing the data model (15). These operations are carried out by a first computer (11) while a second computer (12) updates the data model (18), which can be comprised of an initial large group of documents (19) and is premised on the computing an initial data model (13, 14, 15) to provide a reference point for determining document vectors from documents processed from the data stream (20).},
doi = {},
journal = {},
number = ,
volume = ,
place = {United States},
year = {2011},
month = {5}
}