Punjabi Stemmer Using Punjabi WordNet Database

Rajeev Puri; R  P  S  Bedi and Vishal Goyal

doi:10.17485/ijst/2015/v8i27/82943

Article

Punjabi Stemmer Using Punjabi WordNet Database

VIEWS 1170
PDF 298

Abstract
Full-Text HTML
Full-Text PDF
How to Cite

Indian Journal of Science and Technology

DOI: 10.17485/ijst/2015/v8i27/82943

Year: 2015, Volume: 8, Issue: 27, Pages: 1-5

Original Article

Punjabi Stemmer Using Punjabi WordNet Database

Rajeev Puri^*, R. P. S. Bedi and Vishal Goyal

Punjab Technical University, Kapurthala Road, Jalandhar - 144601, Punjab, India; [email protected]

This work is licensed under a Creative Commons Attribution 4.0 International License.

Abstract

Stemming is used as a pre-processing phase in the information retrieval tasks. The stemming process produces linguistically normalized text, which helps in improving the results of information retrieval tasks. In this paper, a revised suffix removal approach with extended set of stripping rules has been discussed for creating a Punjabi language Stemming tool. The stemming algorithm discussed in this paper uses regular expressions for finding suffix matches. The WordNet* database is used here for improving the stemming results.
Keywords: Brute Force, Rule Based Stemming, Punjabi Stemmer, Suffix Stripping, WordNet .