Indian Journal of Science and Technology
Year: 2013, Volume: 6, Issue: 4, Pages: 1-4
R. C. Balabantaray1*, S. K. Lenka2 and D. Sahoo3
1 Asst Prof., CLIA Lab, Department of Computer Science, [email protected]
2 Project Fellow, CLIA Lab, IIIT, Bhubaneswar, san[email protected]
3 Research Project Fellow, CLIA Lab, IIIT, [email protected]
*Author For Correspondence
R. C. Balabantaray
Department of Computer Science,
Name Entity Recognition (NER) is a process of information extraction that seeks to locate atomic elements in text and classify them into predefined categories such as the name of persons, organizations, locations, expressions of times, quantities, monetary values, percentages, etc. In this paper, we present an Odia Named Entity Recognizer which will be very useful in searching of information about not only tourism but also all general domains. Here, we follow the method of conditional random field. This is a machine learning technique based on linguistic rules of Name Entities (NE). It handles nested tagging of name entities with a hierarchical tag set containing forty four attributes (level one), thirty eight attributes (level two) and thirty five attributes (level three) in tag set. We have experimented building Conditional Random Field (CRF) models by training the noun phrases of the training data and it gives encouraging results.
Keywords: Odia Tourism Corpus, Name Entity Recognition, Precision, Recall, F-measure.
Subscribe now for latest articles and news.