| | |
Summary: Extracting Relations from Large Text Collections
Yevgeny (Eugene) Agichtein
Submitted in partial fulfillment of the
requirements for the degree
of Doctor of Philosophy
in the Graduate School of Arts and Sciences
COLUMBIA UNIVERSITY
2005
c 2005
Yevgeny (Eugene) Agichtein
All Rights Reserved
ABSTRACT
Extracting Relations from Large Text Collections
Yevgeny (Eugene) Agichtein
Advisor: Professor Luis Gravano
A wealth of information is hidden within unstructured text. Often, this information can
be best exploited in structured or relational form, which is well suited for sophisticated
query processing, for integration with relational database management systems, and for
data mining. This thesis addresses two fundamental problems in extracting relations from
large text collections: (1) portability: tuning extraction systems for new domains and (2)
|