Indian Journal of Science and Technology
Year: 2016, Volume: 9, Issue: 47, Pages: 1-5
A. Pandian*, V. V. Ramalingam and R. P. Vishnu Preet
Department of Computer Science and Engineering, SRM University, Kattankulathur, Chennai - 603203, Tamil Nadu, India; [email protected]
Objectives: To training classifier based on the features extracted from the poems of Mukkoodar Pallu, authors for various unknown poems can be classified. Methods/Analysis: The classification accuracy by performing classification in the dataset using C4.5 algorithm is illustrated in this paper. Findings: The results of performing classification on dataset that consists of features extracted from the dataset are shown in this paper. Features like number of characters, number of sentences and the classification accuracy when C4.5 algorithm is used is illustrated. Novelty/Improvement: By doing this, authors of various other poems in Tamil language can be identified which will be helpful to the society. Also a generalized authorship identification tool for all regional languages can be achieved.
Keywords: Authorship, Classification, Feature Selection, Tamil Articles
Subscribe now for latest articles and news.