Indian Journal of Science and Technology
Year: 2015, Volume: 8, Issue: 27, Pages: 1-7
Aadil Amin Kak, Shahid Yousuf Gilkar* and Nahida Ali
Department of Linguistics, University of Kashmir, Srinagar - 190006, J and K, India; [email protected]
For Natural Language Processing, annotation is indispensable. Annotation of a corpus can be carried out at many levels - part of speech level, phrase or clausal level, dependency level, etc. The present paper is an attempt towards annotating Kashmiri corpora at the chunk level as no work has been carried out in this area. Chunk may be a phrase, larger or smaller, corresponding to actual phrases. Chunking is the process of dividing strings into groups of correlated tokens by a computer program. The present paper attempts to propose a preliminary idea of a chunker for Kashmiri corpus using closed-class words.
Keywords: Annotation, Chunking, Closed-Class Words, Natural Language Processing
Subscribe now for latest articles and news.