News Classification: A Data Mining Approach

Dipak Ramchandra Kawade   and Kavita S  Oza

doi:10.17485/ijst/2016/v9i46/84444

Article

News Classification: A Data Mining Approach

VIEWS 1029
PDF 505

Abstract
Full-Text HTML
Full-Text PDF
How to Cite

Indian Journal of Science and Technology

DOI: 10.17485/ijst/2016/v9i46/84444

Year: 2016, Volume: 9, Issue: 46, Pages: 1-6

Original Article

News Classification: A Data Mining Approach

Dipak Ramchandra Kawade^1* and Kavita S. Oza²

¹Department of Computer Science, Sangola College, Sangola – 413307, Maharashtra, India; [email protected] ²Department of Computer Science, Shivaji University, Kolhapur – 416004, Maharashtra, India; skavita.oza@gmailcom

*Author for correspondence
Dipak Ramchandra Kawade
Department of Computer Science, Sangola College, Sangola – 413307, Maharashtra, India; [email protected]

This work is licensed under a Creative Commons Attribution 4.0 International License.

Abstract

Objectives: Text classification is one of the important applications of data mining. Text classification classifies text documents on the basis of words, phrases, combination of words etc. into predefined class labels. Method/Analysis: Present study classifies news data into four predefined classes namely Business, Entertainment, sports and Technology. For text classification WEKA an open source data mining tool is used. Different classification algorithms are applied on News data set. A comparative study of these algorithms is done based on Accuracy, Time, Errors and ROC to predict the best algorithm for news data set classification. Findings: Present study analyzed result on the basis of accuracy, time, error and ROC curve. Present work concludes that NaïveBayes Multinomial algorithm is best for news classification.

Keywords: Classification Algorithms, Data Mining, Text Classification, WEKA