Indian Journal of Science and Technology
DOI: 10.17485/ijst/2015/v8i27/83917
Year: 2015, Volume: 8, Issue: 27, Pages: 1-12
Original Article
Kawarbir Singh Dhanju1* , Gurpreet Singh Lehal 1 , Tejinder Singh Saini 2 and Arshdeep Kaur1
1 DCS, Punjabi University, Patiala - 147 002, Punjab, India; [email protected], [email protected], [email protected]
2 ACTDPL, Punjabi University, Patiala - 147 002, Punjab, India [email protected]
Spellchecker is a software tool that identifies and corrects any spelling mistakes in a text document. Designing a spell checker for Punjabi language is a challenging task. Punjabi language can be written in two scripts, Gurmukhi script (a Left to Right script based on Devanagari) and Perso-Arabic Script (a Right to Left script) which is also referred as Shahmukhi. Gurmukhi script follow ‘one sound - one symbol’ principle where as Shahmukhi follows ‘one sound - multiple symbol’ principle. Thus making Shahmukhi text even more challenging which complicates the design of spell checker for Shahmukhi text. The text written in Shahmukhi normally does not have short vowels and diacritic marks. So missing some of diacritic marks should not be considered as a mistake. But for Holy books like Quran, missing diacritic marks are considered as a mistake. So spell checker is designed in such a way that it can spell check with and without diacritic marks compulsion, which depends on user’s selection to spell check. In addition to this, Shahmukhi text has complex grammatical rules and phonetic properties. Thus it needs different algorithms and techniques for expected efficiency. This paper presents the complete design and implementation of a spell checker for Shahmukhi text.
Keywords: Edit Distance, Gurmukhi, Punjabi, Shahmukhi, Spellchecker, Typing Errors
Subscribe now for latest articles and news.