skip to main content
OSTI.GOV title logo U.S. Department of Energy
Office of Scientific and Technical Information

Title: Extraction of information from unstructured text

Abstract

Extracting information from unstructured text has become an emphasis in recent years due to the large amount of text now electronically available. This status report describes the findings and work done by the end of the first year of a two-year LDRD. Requirements of the approach included that it model the information in a domain independent way. This means that it would differ from current systems by not relying on previously built domain knowledge and that it would do more than keyword identification. Three areas that are discussed and expected to contribute to a solution include (1) identifying key entities through document level profiling and preprocessing, (2) identifying relationships between entities through sentence level syntax, and (3) combining the first two with semantic knowledge about the terms.

Authors:
; ;
Publication Date:
Research Org.:
Sandia National Lab. (SNL-NM), Albuquerque, NM (United States)
Sponsoring Org.:
USDOE, Washington, DC (United States)
OSTI Identifier:
148697
Report Number(s):
SAND-95-2532
ON: DE96003216
DOE Contract Number:  
AC04-94AL85000
Resource Type:
Technical Report
Resource Relation:
Other Information: PBD: Nov 1995
Country of Publication:
United States
Language:
English
Subject:
99 MATHEMATICS, COMPUTERS, INFORMATION SCIENCE, MANAGEMENT, LAW, MISCELLANEOUS; INFORMATION SYSTEMS; DATA BASE MANAGEMENT; INFORMATION THEORY; INFORMATION RETRIEVAL; DOCUMENT TYPES

Citation Formats

Irwin, N H, DeLand, S M, and Crowder, S V. Extraction of information from unstructured text. United States: N. p., 1995. Web. doi:10.2172/148697.
Irwin, N H, DeLand, S M, & Crowder, S V. Extraction of information from unstructured text. United States. https://doi.org/10.2172/148697
Irwin, N H, DeLand, S M, and Crowder, S V. 1995. "Extraction of information from unstructured text". United States. https://doi.org/10.2172/148697. https://www.osti.gov/servlets/purl/148697.
@article{osti_148697,
title = {Extraction of information from unstructured text},
author = {Irwin, N H and DeLand, S M and Crowder, S V},
abstractNote = {Extracting information from unstructured text has become an emphasis in recent years due to the large amount of text now electronically available. This status report describes the findings and work done by the end of the first year of a two-year LDRD. Requirements of the approach included that it model the information in a domain independent way. This means that it would differ from current systems by not relying on previously built domain knowledge and that it would do more than keyword identification. Three areas that are discussed and expected to contribute to a solution include (1) identifying key entities through document level profiling and preprocessing, (2) identifying relationships between entities through sentence level syntax, and (3) combining the first two with semantic knowledge about the terms.},
doi = {10.2172/148697},
url = {https://www.osti.gov/biblio/148697}, journal = {},
number = ,
volume = ,
place = {United States},
year = {Wed Nov 01 00:00:00 EST 1995},
month = {Wed Nov 01 00:00:00 EST 1995}
}