| | |
Summary: Assembling Documents from Digital Libraries ?;??
Helena Ahonen, Barbara Heikkinen, Oskari Heinonen, and Pekka Kilpel#inen
University of Helsinki, Department of Computer Science
P. O. Box 26, FIN–00014 University of Helsinki, Finland
Abstract. We consider assembling documents using, as a source, a dig
ital library containing SGML documents. The assembly process contains
two parts: 1) ønding interesting fragments, and 2) constructing a co
herent document. We present a general document assembly framework.
First, we describe a system for tailoring control engineering textbooks.
Its assembling facilities are rather restricted but, on the other hand, the
quality of documents produced is high. Second, we address the problem
of øltering and combining interesting information from a large heteroge
neous document collection. The methods presented ooeer various ways to
ønd the interesting document fragments. Moreover, the elements found
in the fragments are mapped to generic elements, like sections, paragraph
containers, paragraphs and strings, which have known semantics. Hence,
even arbitrary compositions can be formatted and printed.
1 Introduction
In an ongoing research and development project called Structured and Intelli
gent Documents (SID) [1] we study document assembly in its dioeerent aspects.
|