Indian Journal of Science and Technology
Year: 2016, Volume: 9, Issue: 4, Pages: 1-9
Sivakumar Venkataraman1* , Subitha Sivakumar1 and Rajalakshmi Selvaraj2
1Faculty of Computing, Botho University, Botswana and Department of Information Systems, BIUST, Botswana; [email protected], [email protected] 2Department of Information Systems, BIUST, Botswana; [email protected]
*Author For Correspondence
Sivakumar Venkataraman Faculty of Computing, Botho University, Botswana and Department of Information Systems, BIUST, Botswana; [email protected]
Background/Objectives: A novel feature selection framework using minimum variance method is proposed. The purpose of the proposed method is to reduce the computational complexity, reduce the number of initial features and increase the classification accuracy of the selected feature subsets. Methods/Statistical Analysis: The clusters are formed using minimum variance method. The process must be repeated for different pairs of records and voting is done on the different sets of cluster pairs. The cluster pair which has the maximum number of votes is chosen and the highest priority member is chosen from each cluster using information gain and removing the remaining attributes, thus reducing the number of attributes. Findings: The proposed feature selector is evaluated by comparing it with existing feature selection algorithms over 9 datasets from UCI and WebKb Datasets. The proposed method shows better results in terms of number of selected features, classification accuracy, and running time than most existing algorithms. Improvements/Applications: A new feature selector using minimum variance method is implemented and found that it performs better than the popular and computationally expensive traditional algorithms.
Keywords: Classification, Data Mining, Dimensionality Reduction, Feature Selection, Information Gain, Minimum Variance Method
Subscribe now for latest articles and news.