Open Vocabulary Arabic Handwriting Recognition Using Morphological Decomposition

Mahdi Hamdani, Amr El-Desoky Mousa, Hermann Ney
2013 2013 12th International Conference on Document Analysis and Recognition  
The use of Language Models (LMs) is a very important component in large and open vocabulary recognition systems. This paper presents an open-vocabulary approach for Arabic handwriting recognition. The proposed approach makes use of Arabic word decomposition based on morphological analysis. The vocabulary is a combination of words and subwords obtained by the decomposition process. Out Of Vocabulary (OOV) words can be recognized by combining different elements from the lexicon. The recognition
more » ... stem is based on Hidden Markov Models (HMMs) with position and context dependent character models. An n-gram LM trained on the decomposed text is used along with the HMMs during the search. The approach is evaluated using two Arabic handwriting datasets. The open vocabulary approach leads to a significant improvement in the system performance. Two different types experiments for two Arabic handwriting recognition tasks are conducted in this work. The proposed approach for open vocabulary allows to have an absolute improvement of up to 1% in the Word Error Rate (WER) for the constrained task and to keep the same performance of the baseline system for the unconstrained one.
doi:10.1109/icdar.2013.63 dblp:conf/icdar/HamdaniMN13 fatcat:zwac2g4okfedpebwfmmg2kmz5i