Indian Journal of Science and Technology
Year: 2016, Volume: 9, Issue: 45, Pages: 1-4
Parul Kalra1*, Navjot Kaur Walia1 , Deepti Mehrotra1 and Abdul Wahid2
1Amity School of Engineering and Technology, Amity University, Amity Campus Sector –125, Noida –201303, Uttar Pradesh, India; [email protected], [email protected], [email protected] 2School of Computer Science and Information Technology, University of Hyderabad, Central University P.O., Prof. C.R.Rao Road, Gachibowli, Hyderabad–500046, India; [email protected]
*Author for correspondence
Parul Kalra Amity School of Engineering and Technology, Amity University, Amity Campus Sector –125, Noida –201303, Uttar Pradesh, India; [email protected]
In recent years, with the rapid increase in Internet usage, the data that has been generated is huge and unstructured. These data can be interpreted with various techniques of Data Mining. Many useful patterns can be extracted from these trends. Classifying these data into meaningful analysis is the key concept behind this study. In this paper, the authorship data for books was used. A data was created where various attributes of users were stored along with the book that they like to read. Naive Bayes was applied on the data set to find which factor is majorly affecting the ratings of the books. The various attributes were compared using data mining tool and found that the rating of books highly depends upon the location of the user. This interpretation was also verified by the measure of precision and recall. High precision results into more accuracy of the system.
Keywords: Authorship Data, Information Retrieval, Naïve Bayes, Precision, Recall
Subscribe now for latest articles and news.