Performance evaluation of two OCR systems
Technical Report
·
OSTI ID:68585
- Univ. of Washington, Seattle, WA (United States)
- Seattle Univ., Seattle, WA (United States)
An experimental protocol for the performance evaluation of Optical Character Recognition (OCR) algorithms is described. The protocol is intended to serve as a model for using the University of Washington English Document Image Database-I to evaluate OCR systems. The plain text zones (without special symbols) in this database have over 2,300,000 characters. The performances of two UNIX-based OCR systems, namely Caere OCR v109a and Xerox ScanWorX v2.0, are measured. The results suggest that Caere OCR outperforms ScanWorX in terms of recognition accuracy; however, ScanWorX is more robust in the presence of image flaws.
- Research Organization:
- Nevada Univ., Las Vegas, NV (United States)
- OSTI ID:
- 68585
- Report Number(s):
- CONF-9404212--
- Country of Publication:
- United States
- Language:
- English
Similar Records
An evaluation of information retrieval accuracy with simulated OCR output
Prediction of OCR accuracy using simple image features
Adaptive image enhancement of text images that contain touching or broken characters
Technical Report
·
Fri Dec 30 23:00:00 EST 1994
·
OSTI ID:68569
Prediction of OCR accuracy using simple image features
Technical Report
·
Fri Mar 31 23:00:00 EST 1995
·
OSTI ID:46719
Adaptive image enhancement of text images that contain touching or broken characters
Technical Report
·
Mon Nov 28 23:00:00 EST 1994
·
OSTI ID:42491