Summary: Proceedings of the 20th International Conference on Pattern Recognition (ICPR2010), Istanbul, Turkey, August
2326, 2010, IEEECS Press, pp. 257260.
1051-4651/10 $26.00 © 2010 IEEE 257
The PAGE (Page Analysis and Ground-truth Elements) Format Framework
S. Pletschacher and A. Antonacopoulos
Pattern Recognition and Image Analysis (PRImA) Research Lab
School of Computing, Science and Engineering, University of Salford, United Kingdom
This work has been supported in part through the EU 7th
Framework Programme grant IMPACT (Ref: 215064).
There is a plethora of established and proposed
document representation formats but none that can
adequately support individual stages within an entire
sequence of document image analysis methods (from
document image enhancement to layout analysis to
OCR) and their evaluation. This paper describes
PAGE, a new XML-based page image representation