Indian Journal of Science and Technology
DOI: 10.17485/ijst/2016/v9i47/107360
Year: 2016, Volume: 9, Issue: 47, Pages: 1-7
Original Article
N. P. Nethravathi1*, Prasanth G. Rao1 ,Chaitra C. Vaidya2 , S. Geethanjali2 , P. Madhura2 , K. Neha Nandan2 , P. Deepa Shenoy2 , M. Indiramma3 and K. R. Venugopal2
1 Visvesvaraya Technological University, Belagavi - 590 018, Karnataka, India; [email protected] 2 University Visvesvaraya College of Engineering, Bangalore University, Bangalore - 560001, Karnataka, India 3 BMS College of Engineering, Bangalore - 560019, Karnataka, India
*Author for correspondence
N. P. Nethravathi
Visvesvaraya Technological University, Belagavi - 590 018, Karnataka, India; [email protected]
Objectives: Preservation of privacy is a significant aspect of data mining. The main objective of PPDM is to hide or provide privacy to certain sensitive information so that they can be protected from unauthorized parties or intruders. Methods/ Statistical Analysis: Though privacy is achieved by hiding the sensitive or private data, it will affect the data mining algorithms in knowledge extraction, so an effective method or strategy is required to provide privacy to the data and simultaneously protecting the quality of data mining algorithms. Instead of removing or encrypting sensitive or private data, we make use of data transformation strategies that keep the statistical, semantic and heuristic nature of data while protecting the sensitive or private data. Findings: In this paper we studied the technical feasibility of realizing Privacy Preserving Data Mining. In the proposed work, Correlation Based Transformation Strategy for Privacy Preserving Data Mining is used for ordinal data. We apply the method on few datasets namely soybean, Breast Cancer, Nursery dataset and Car dataset. We tabulate the end results applying the proposed strategy on both the original and the transformed dataset and observe correlation difference, Information Entropy and Classification Accuracy with different machine learning algorithms and Clustering Quality. Application/Improvements: As an improvement, the proposed work can be extended by use of vector marking techniques where these techniques help in increasing the efficiency by avoiding unauthorised access to the information.
Keywords: Correlation Analysis, Nominal Data, Ordinal Data, Privacy Preserving Data Mining, Transformation Strategy
Subscribe now for latest articles and news.