Text Segmentation in Colour Posters from the Spanish Civil War Era

Antonio Clavelli, Dimosthenis Karatzas
2009 2009 10th International Conference on Document Analysis and Recognition  
The extraction of textual content from colour documents of a graphical nature is a complicated task. The text can be rendered in any colour, size and orientation while the existence of complex background graphics with repetitive patterns can make its localization and segmentation extremely difficult. Here, we propose a new method for extracting textual content from such colour images that makes no assumption as to the size of the characters, their orientation or colour, while it is tolerant to
more » ... it is tolerant to characters that do not follow a straight baseline. We evaluate this method on a collection of documents with historical connotations: the Posters from the Spanish Civil War. 10th International Conference on Document Analysis and Recognition 978-0-7695-3725-2/09 $25.00
doi:10.1109/icdar.2009.32 dblp:conf/icdar/ClavelliK09 fatcat:m6d2mnvntveb3fkh6zsaj7zdja