A Hybrid K-Mean Clustering Algorithm for Prediction Analysis

Vikas Verma; Shaweta Bhardwaj and Harjit Singh

doi:10.17485/ijst/2016/v9i28/98392

Article

A Hybrid K-Mean Clustering Algorithm for Prediction Analysis

VIEWS 1090
PDF 429

Abstract
Full-Text HTML
Full-Text PDF
How to Cite

Indian Journal of Science and Technology

DOI: 10.17485/ijst/2016/v9i28/98392

Year: 2016, Volume: 9, Issue: 28, Pages: 1-5

Original Article

A Hybrid K-Mean Clustering Algorithm for Prediction Analysis

Vikas Verma^*, Shaweta Bhardwaj and Harjit Singh

School of Computer Science and Engineering, [email protected]
[email protected]
[email protected]
*Author for correspondence
Vikas Verma
School of Computer Science and Engineering,
Email: [email protected]

This work is licensed under a Creative Commons Attribution 4.0 International License.

Abstract

Background/Objective: The objective of this research is to make improvement in defining the clusters automatically and to assign required clusters to un-clustered points. Methods/Statistical Analysis: The main disadvantage of k-mean is of accuracy, as in k-mean clustering user needs to define number of clusters during the start of process. This restriction of predefined number of clusters leads to some points of the dataset remained un-clustered. So by enhancing the cluster technique, the predictions can be improved. We use Iris dataset for the current study and to generate the results using normalization in the methodology which will lead to improvement in accuracy and will reduce clustering time by the member assigned to the cluster. Findings: The normalization is used to get better results in the form of finding distance to have exact centroid and to remove noise data which is not needed. We are applying backtracking method to find the exact number of clusters that should be defined to analyze the data in better way. The results shows that there is an improvement in clustering when compared to the existing methodologies.
Keywords: K-mean Clustering, Prediction Analysis, Data Mining, Classification, Clustering, Hybrid Clustering,