Online Handwritten Gurmukhi Character Recognition Using Elastic Matching

Anuj Sharma, Rajesh Kumar, R. K. Sharma
2008 2008 Congress on Image and Signal Processing  
Computers are greatly influencing the lives of human beings and their usage is increasing at a tremendous rate. The ease with which we can exchange information between user and computer is of immense importance today because input devices such as keyboard and mouse have limitations vis-à-vis input through natural handwriting. We can use the online handwriting recognition process for a quick and natural way of communication between computer and human beings. Handwriting recognition is in
more » ... for over four decades and has attracted many researchers across the world. Variations in handwriting are one prominent problem and achieving high degree of accuracy is a tedious task. The main goal of this thesis is to develop an online handwritten Gurmukhi character recognition system. Gurmukhi is the script of Punjabi language which is widely spoken across the globe. This thesis is divided into six chapters. A brief outline of each chapter is given in the following paragraphs. Chapter 1 includes three sections, namely, issues in online handwriting recognition system, literature review and overview of Gurmukhi script. Issues in online handwriting recognition system include: handwriting styles variations; constrained and unconstrained handwriting; personal, situational and material factors; writer dependent vs. writer independent recognition systems. In literature review, a detailed literature survey on each phase of established procedure of online handwriting recognition has been presented. The established procedure to recognize online handwriting includes data collection, preprocessing, feature extraction, segmentation, recognition and post-processing. We have also reviewed literature for different recognition methods. These recognition methods are statistical, syntactical and structural, neural network and elastic matching methods. In addition, we have also discussed some of the results reported in the literature of online handwriting recognition. This literature review covers different languages such as English, Chinese, Japanese, Urdu, Hindi, Bangali, Tamil and Telugu. In the overview of Gurmukhi script, we have included nature of handwriting in Gurmukhi script and different characters of Gurmukhi script. Chapter 2 contains the work carried out for three phases of online handwriting character recognition. These phases are data collection, preprocessing and feature extraction. These phases are discussed in three sections entitled data collection phase, preprocessing phase and computation of features phase. In data collection phase, input handwritten strokes are iv collected. We have discussed the procedure to collect the data at stroke level. Preprocessing phase is followed by data collection phase. In the preprocessing phase, we have considered size normalization and centering of stroke, interpolating missing points in stroke, smoothing of stroke, slant correction of stroke and resampling of points in stroke. We have proposed algorithms for the respective stages. In computation of features phase, features are computed after preprocessing of input handwritten stroke. The high level features are computed on the basis of low level features. The high level features include loop, crossings, straight line, headline and dots. The common low level features are position of stroke, area, length, curliness, slope etc. We have introduced algorithms to recognize these high level features. We have noted an improvement of 5%, 3.33%, 6.66% and 8.34% in recognition of loop, headline, straight line and dot features, respectively after using preprocessing stage. Chapter 3 focuses on recognition of online handwritten Gurmukhi characters using elastic matching method. This chapter also illustrates the use of post-processing stage. In this chapter, we have presented a process to recognize online handwritten Gurmukhi characters which in turn uses forty unique dependent strokes for 41 Gurmukhi characters. These dependent strokes are assigned unique stroke ids. This process recognizes Gurmukhi character in two stages. In first stage a stroke id is recognized and in second stage the character on the basis of recognized stroke ids is finally recognized. In this process, two databases, namely, stroke database and character database have been prepared. Strokes are recognized using stroke database and characters are recognized using character database. We have used elastic matching method as the recognition method in this chapter. The post-processing phase has been used after implementing recognition method. The recognition rate achieved without implementing post-processing steps is 87.40%, whereas, it is 90.08% when post-processing steps have been included. As such, we could achieve an improvement of 2.68% in recognition of Gurmukhi characters when post-processing steps are in place. It has been noted that 24 characters have shown improvement in their recognition rate after using post-processing steps. A maximum of 6.67% improvement has been found in some of the characters after using post-processing steps. In Chapter 4, we have recognized online handwritten Gurmukhi characters using two methods, namely, small line segments and hidden markov model. We have proposed a new recognition method based on elastic matching and chain code techniques. This method has been called small line segments method. The proposed method includes a 112 5.12 Recognition of the input handwritten word shown in Fig. 5.9 . 113 5.13 Stability of online handwritten Gurmukhi words recognizer for the first 10, 20, 30, 40 and 50 writers. 117 5.14 Number of writers who achieved recognition rate (%) between 70-75, 75-80, 80-85, 85-90, 90-95 and 95-100.
doi:10.1109/cisp.2008.297 fatcat:q4zl5vgttraatin3rv4mq7fs7a