skip to main content
OSTI.GOV title logo U.S. Department of Energy
Office of Scientific and Technical Information

Title: A High Accuracy Method for Semi-supervised Information Extraction

Abstract

Customization to specific domains of dis-course and/or user requirements is one of the greatest challenges for today’s Information Extraction (IE) systems. While demonstrably effective, both rule-based and supervised machine learning approaches to IE customization pose too high a burden on the user. Semi-supervised learning approaches may in principle offer a more resource effective solution but are still insufficiently accurate to grant realistic application. We demonstrate that this limitation can be overcome by integrating fully-supervised learning techniques within a semi-supervised IE approach, without increasing resource requirements.

Authors:
;
Publication Date:
Research Org.:
Pacific Northwest National Lab. (PNNL), Richland, WA (United States)
Sponsoring Org.:
USDOE
OSTI Identifier:
908955
Report Number(s):
PNNL-SA-53858
400904120; TRN: US200722%%831
DOE Contract Number:  
AC05-76RL01830
Resource Type:
Conference
Resource Relation:
Conference: Proceedings of Human Language Technologies: The Conference of the North American Chapter of the Association for Computational Linguistics (NAACL-HLT 2007), 169-172
Country of Publication:
United States
Language:
English
Subject:
99 GENERAL AND MISCELLANEOUS//MATHEMATICS, COMPUTING, AND INFORMATION SCIENCE; ACCURACY; LEARNING; INFORMATION RETRIEVAL; INFORMATION SYSTEMS; information extraction; machine learning; unsupervised learning; computational linguistics

Citation Formats

Tratz, Stephen C., and Sanfilippo, Antonio P. A High Accuracy Method for Semi-supervised Information Extraction. United States: N. p., 2007. Web.
Tratz, Stephen C., & Sanfilippo, Antonio P. A High Accuracy Method for Semi-supervised Information Extraction. United States.
Tratz, Stephen C., and Sanfilippo, Antonio P. Sun . "A High Accuracy Method for Semi-supervised Information Extraction". United States. doi:. https://www.osti.gov/servlets/purl/908955.
@article{osti_908955,
title = {A High Accuracy Method for Semi-supervised Information Extraction},
author = {Tratz, Stephen C. and Sanfilippo, Antonio P.},
abstractNote = {Customization to specific domains of dis-course and/or user requirements is one of the greatest challenges for today’s Information Extraction (IE) systems. While demonstrably effective, both rule-based and supervised machine learning approaches to IE customization pose too high a burden on the user. Semi-supervised learning approaches may in principle offer a more resource effective solution but are still insufficiently accurate to grant realistic application. We demonstrate that this limitation can be overcome by integrating fully-supervised learning techniques within a semi-supervised IE approach, without increasing resource requirements.},
doi = {},
journal = {},
number = ,
volume = ,
place = {United States},
year = {Sun Apr 22 00:00:00 EDT 2007},
month = {Sun Apr 22 00:00:00 EDT 2007}
}

Conference:
Other availability
Please see Document Availability for additional information on obtaining the full-text document. Library patrons may search WorldCat to identify libraries that hold this conference proceeding.

Save / Share: