skip to main content
OSTI.GOV title logo U.S. Department of Energy
Office of Scientific and Technical Information

Title: Rapid Exploitation and Analysis of Documents

Technical Report ·
DOI:https://doi.org/10.2172/1033748· OSTI ID:1033748

Analysts are overwhelmed with information. They have large archives of historical data, both structured and unstructured, and continuous streams of relevant messages and documents that they need to match to current tasks, digest, and incorporate into their analysis. The purpose of the READ project is to develop technologies to make it easier to catalog, classify, and locate relevant information. We approached this task from multiple angles. First, we tackle the issue of processing large quantities of information in reasonable time. Second, we provide mechanisms that allow users to customize their queries based on latent topics exposed from corpus statistics. Third, we assist users in organizing query results, adding localized expert structure over results. Forth, we use word sense disambiguation techniques to increase the precision of matching user generated keyword lists with terms and concepts in the corpus. Fifth, we enhance co-occurrence statistics with latent topic attribution, to aid entity relationship discovery. Finally we quantitatively analyze the quality of three popular latent modeling techniques to examine under which circumstances each is useful.

Research Organization:
Lawrence Livermore National Lab. (LLNL), Livermore, CA (United States)
Sponsoring Organization:
USDOE
DOE Contract Number:
W-7405-ENG-48
OSTI ID:
1033748
Report Number(s):
LLNL-TR-517731; TRN: US201203%%227
Country of Publication:
United States
Language:
English

Similar Records

Novel Analysis and Visualization of Chemical Events for Public Health Surveillance
Journal Article · Tue May 02 00:00:00 EDT 2017 · Online Journal of Public Health Informatics · OSTI ID:1033748

Finding Text Information in the Ocean of Electronic Documents
Conference · Wed Feb 05 00:00:00 EST 2003 · OSTI ID:1033748

VisIRR: A Visual Analytics System for Information Retrieval and Recommendation for Large-Scale Document Data
Journal Article · Wed Jan 31 00:00:00 EST 2018 · ACM Transactions on Knowledge Discovery from Data · OSTI ID:1033748