Home

About

Advanced Search

Browse by Discipline

Scientific Societies

E-print Alerts

Add E-prints

E-print Network
FAQHELPSITE MAPCONTACT US


  Advanced Search  

 
Colour text segmentation in web images based on human perception D. Karatzas a
 

Summary: Colour text segmentation in web images based on human perception
D. Karatzas a
, A. Antonacopoulos b,*
a
School of Electronics and Computer Science, University of Southampton, Southampton SO17 1BJ, United Kingdom
b
Pattern Recognition and Image Analysis (PRImA) Research Lab, School of Computing, Science and Engineering University of Salford,
Manchester M5 4WT, United Kingdom
Received 24 July 2004; received in revised form 14 April 2006; accepted 16 May 2006
Abstract
There is a significant need to extract and analyse the text in images on Web documents, for effective indexing, semantic analysis and
even presentation by non-visual means (e.g., audio). This paper argues that the challenging segmentation stage for such images benefits
from a human perspective of colour perception in preference to RGB colour space analysis. The proposed approach enables the segmen-
tation of text in complex situations such as in the presence of varying colour and texture (characters and background). More precisely,
characters are segmented as distinct regions with separate chromaticity and/or lightness by performing a layer decomposition of the
image. The method described here is a result of the authors' systematic approach to approximate the human colour perception charac-
teristics for the identification of character regions. In this instance, the image is decomposed by performing histogram analysis of Hue
and Lightness in the HLS colour space and merging using information on human discrimination of wavelength and luminance.
2006 Elsevier B.V. All rights reserved.
Keywords: Web document image analysis; Colour document analysis; Character segmentation; Text segmentation; Colour images

  

Source: Antonacopoulos, Apostolos - School of Computing, Science and Engineering, University of Salford

 

Collections: Computer Technologies and Information Sciences