Comparison of Different Attributes of Authorship Data using Data Mining Approach

Parul Kalra; Navjot Kaur Walia; Deepti Mehrotra  and Abdul Wahid

doi:10.17485/ijst/2016/v9i45/106368

Article

Comparison of Different Attributes of Authorship Data using Data Mining Approach

VIEWS 961
PDF 813

Abstract
Full-Text HTML
Full-Text PDF
How to Cite

Indian Journal of Science and Technology

DOI: 10.17485/ijst/2016/v9i45/106368

Year: 2016, Volume: 9, Issue: 45, Pages: 1-4

Original Article

Comparison of Different Attributes of Authorship Data using Data Mining Approach

Parul Kalra^1*, Navjot Kaur Walia¹ , Deepti Mehrotra¹ and Abdul Wahid²

¹Amity School of Engineering and Technology, Amity University, Amity Campus Sector –125, Noida –201303, Uttar Pradesh, India; [email protected], [email protected], [email protected] ²School of Computer Science and Information Technology, University of Hyderabad, Central University P.O., Prof. C.R.Rao Road, Gachibowli, Hyderabad–500046, India; [email protected]

*Author for correspondence
Parul Kalra Amity School of Engineering and Technology, Amity University, Amity Campus Sector –125, Noida –201303, Uttar Pradesh, India; [email protected]

This work is licensed under a Creative Commons Attribution 4.0 International License.

Abstract

In recent years, with the rapid increase in Internet usage, the data that has been generated is huge and unstructured. These data can be interpreted with various techniques of Data Mining. Many useful patterns can be extracted from these trends. Classifying these data into meaningful analysis is the key concept behind this study. In this paper, the authorship data for books was used. A data was created where various attributes of users were stored along with the book that they like to read. Naive Bayes was applied on the data set to find which factor is majorly affecting the ratings of the books. The various attributes were compared using data mining tool and found that the rating of books highly depends upon the location of the user. This interpretation was also verified by the measure of precision and recall. High precision results into more accuracy of the system.

Keywords: Authorship Data, Information Retrieval, Naïve Bayes, Precision, Recall