Home

About

Advanced Search

Browse by Discipline

Scientific Societies

E-print Alerts

Add E-prints

E-print Network
FAQHELPSITE MAPCONTACT US


  Advanced Search  

 
A Complete Approach to the Conversion of Typewritten Historical Documents for Digital Archives
 

Summary: A Complete Approach to the Conversion of
Typewritten Historical Documents for Digital Archives
A. Antonacopoulos and D. Karatzas
Pattern Recognition and Image Analysis (PRImA) group, Department of Computer Science
University of Liverpool, Liverpool, L69 3BX, United Kingdom
http://www.csc.liv.ac.uk/~prima
Abstract. This paper presents a complete system that historians/archivists can
use to digitize whole collections of documents relating to personal information.
The system integrates tools and processes that facilitate scanning, image
indexing, document (physical and logical) structure definition, document image
analysis, recognition, proofreading/correction and semantic tagging. The system
is described in the context of different types of typewritten documents relating
to prisoners in World-War II concentration camps and is the result of a
multinational collaboration under the MEMORIAL project funded (1.5M) by
the European Union (www.memorial-project.info). Results on a representative
selection of documents show a significant improvement not only in terms of
OCR accuracy but also in terms of overall time/cost involved in converting
these documents for digital archives.
1 Introduction
The problem of converting collections of documents into digital archives or libraries

  

Source: Antonacopoulos, Apostolos - School of Computing, Science and Engineering, University of Salford

 

Collections: Computer Technologies and Information Sciences