• P-ISSN 0974-6846 E-ISSN 0974-5645

Indian Journal of Science and Technology


Indian Journal of Science and Technology

Year: 2015, Volume: 8, Issue: 27, Pages: 1-7

Original Article

Developing a Preliminary Scheme of Chunking for Kashmiri Corpora using Closed-class Words and Morphological Features


For Natural Language Processing, annotation is indispensable. Annotation of a corpus can be carried out at many levels - part of speech level, phrase or clausal level, dependency level, etc. The present paper is an attempt towards annotating Kashmiri corpora at the chunk level as no work has been carried out in this area. Chunk may be a phrase, larger or smaller, corresponding to actual phrases. Chunking is the process of dividing strings into groups of correlated tokens by a computer program. The present paper attempts to propose a preliminary idea of a chunker for Kashmiri corpus using closed-class words.
Keywords: Annotation, Chunking, Closed-Class Words, Natural Language Processing


Subscribe now for latest articles and news.