Notice: Undefined offset: 1 in /var/www/indjst.org/article-detail-page.php on line 103
Correlation Similarity Measure based Document Clustering with Directed Ridge Regression
 
  • P-ISSN 0974-6846 E-ISSN 0974-5645

Indian Journal of Science and Technology

Article

Indian Journal of Science and Technology

Year: 2014, Volume: 7, Issue: 5, Pages: 692–697

Original Article

Correlation Similarity Measure based Document Clustering with Directed Ridge Regression

Abstract

Correlation Preserving Indexing (CPI) can discover the intrinsic structures implanted in high-dimensional document space. To predict the result of one variable based on another variable is not suitable for all the situations since two variable prediction problems takes places. In this paper, Directed Ridge Regression is introduced to predict two or more variables which are highly correlated in high dimensional document space. Directed Ridge Regression is a statistical technique to estimate the relationship among the variables based on the Eigen values to find the similarity between the documents. The directed ridge estimator alters the diagonal elements of the Eigen values. The objective of the Directed Ridge Regression is to achieve efficient document clustering in similarity measure. Experimental results shows that compared to Correlation Preserving Indexing, the Directed Ridge Regression achieves efficient document clustering.

Keywords: Correlation Similarity Measure, Directed Ridge Regression, Document Clustering, Latent Semantic Indexing 

DON'T MISS OUT!

Subscribe now for latest articles and news.