| | |
Summary: The Lifecycle of a Digital Historical Document: Structure
and Content
A. Antonacopoulos, D. Karatzas
Department of Computer Science
University of Liverpool
Liverpool, United Kingdom
http://www.csc.liv.ac.uk/~prima
H. Krawczyk, B. Wiszniewski
Faculty of Electronics, Telecomm/s and Informatics
Technical University of Gdask
Gdask, Poland
http://www.eti.pg.gda.pl
ABSTRACT
This paper describes the lifecycle of a digital historical document,
from template-based structure definition through to content
extraction from the scanned pages and its final reconstitution as
an electronic document (combining content and semantic
information) along with the tools that have been created to realise
each stage in the lifecycle. The whole approach is described in the
context of different types of typewritten documents relating to
|