Indian Journal of Science and Technology
DOI: 10.17485/IJST/v15i43.1964
Year: 2022, Volume: 15, Issue: 43, Pages: 2275-2281
Original Article
Nawaz Ali Lone1, Kaiser J Giri2*, Rumaan Bashir2
1Research Scholar, Department of Computer Science, Islamic University of Science & Technology, Kashmir
2Associate Professor, Department of Computer Science, Islamic University of Science & Technology, Kashmir
*Corresponding Author
Email: [email protected]
Received Date:30 September 2022, Accepted Date:11 October 2022, Published Date:16 November 2022
Objectives: The main objective of this paper as a maiden attempt is to identify the basic resources necessary for undertaking Natural Language Processing (NLP) specific research activities pertaining to Kashmiri language. The paper also deliberates on key issues related to Natural Language Processing of Kashmiri language such as complex linguistic phenomena, the lack of standard linguistic tools, documented as well as standardized resources and the influence of some dominant languages mostly Urdu and English on Kashmiri language. Methods: As there is no substantial work reported in literature specific to NLP of Kashmiri language, a holistic research strategy was adopted to explore the possible sources as potential means for creation of basic resources to undertake the NLP research for Kashmiri language. Findings: After thorough investigation, it was observed that there has been some trivial work reported in the literature related to Machine Translation of Kashmiri language. Further there are few newspapers published in Kashmiri language which can be used as a means for creation of Kashmiri corpus. Moreover crowdsourcing could be used a potential means for development of digital linguistic resources for Kashmiri language. Novelty: The present study is a maiden attempt towards identification of NLP resources for Kashmiri language and will be of immense importance for the research community interested to work for development of Kashmiri language in digital domain.
Keywords: Natural Language Processing; Transliteration; Kashmiri Language; Scheduled Languages; crowdsource; Tag set; P-o-S Tagging
© 2022 Lone et al. This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited. Published By Indian Society for Education and Environment (iSee)
Subscribe now for latest articles and news.