Automatically generating extraction patterns from untagged text

Riloff, E

Title: Automatically generating extraction patterns from untagged text

Conference · Tue Dec 31 00:00:00 EST 1996

OSTI ID:430781

Riloff, E ^[1]

Univ. of Utah, Salt Lake City, UT (United States)

Many corpus-based natural language processing systems rely on text corpora that have been manually annotated with syntactic or semantic tags. In particular, all previous dictionary construction systems for information extraction have used an annotated training corpus or some form of annotated input. We have developed a system called AutoSlog-TS that creates dictionaries of extraction patterns using only untagged text. AutoSlog-TS is based on the AutoSlog system, which generated extraction patterns using annotated text and a set of heuristic rules. By adapting AutoSlog and combining it with statistical techniques, we eliminated its dependency on tagged text. In experiments with the MUC-4 terrorism domain, AutoSlog-TS created a dictionary of extraction patterns that performed comparably to a dictionary created by AutoSlog, using only preclassified texts as input.

OSTI does not have a digital full text copy available. For more information, please see document availability, search WorldCat, or search Google Scholar.

Cite

Export

Save

OSTI ID:: 430781

Report Number(s):: CONF-960876-; CNN: Grant MIP 9023174; TRN: 96:006521-0156

Resource Relation:: Conference: 13. National conference on artifical intelligence and the 8. Innovative applications of artificial intelligence conference, Portland, OR (United States), 4-8 Aug 1996; Other Information: PBD: 1996; Related Information: Is Part Of Proceedings of the thirteenth national conference on artificial intelligence and the eighth innovative applications of artificial intelligence conference. Volume 1 and 2; PB: 1626 p.

Country of Publication:: United States

Language:: English

Similar Records

Information Extraction from Unstructured Text for the Biodefense Knowledge Center

Conference · Fri Apr 29 00:00:00 EDT 2005 · OSTI ID:430781

Samatova, N F; Park, B; Krishnamurthy, R; +6 more

Literature mining of protein-residue associations with graph rules learned through distant supervision

Journal Article · Fri Oct 05 00:00:00 EDT 2012 · Journal of Biomedical Semantics · OSTI ID:430781

Ravikumar, K. E.; Liu, Haibin; Cohn, Judith D.; +2 more

Experiments in automatic word class and word sense identification for information retrieval

Technical Report · Sat Dec 31 00:00:00 EST 1994 · OSTI ID:430781

Gauch, S; Futrelle, R P

Related Subjects

99 MATHEMATICS
COMPUTERS
INFORMATION SCIENCE
MANAGEMENT
LAW
MISCELLANEOUS
ARTIFICIAL INTELLIGENCE
NATURAL LANGUAGE
ALGORITHMS
LEARNING

Title: Automatically generating extraction patterns from untagged text

Citation Formats

Similar Records

Related Subjects