• P-ISSN 0974-6846 E-ISSN 0974-5645

Indian Journal of Science and Technology


Indian Journal of Science and Technology

Year: 2016, Volume: 9, Issue: 11, Pages: 1-16

Original Article

Comparative Analysis of Data Mining Tools and Techniques for Information Retrieval


Background/Objectives: There are a lot of information retrieval techniques available for getting information from different kinds of sources. Main aim of this paper is to improve information retrieval activities to a higher level. Different methods for information retrieval have been studied and discussed. It involves use of Fuzzy Ontology Generation framework (FOGA) framework along with Formal Concept Analysis (FCA) based clustering and keyword matching approach. Hidden Markov Model has been used for retrieval of data from search engines in an intelligent and efficient way for correct identification and retrieval from the database. Classification algorithm has been used for detection of community and conversion of large community graph to sub community graph for its better study and usage. Findings: It has been found that fuzzy ontology generation framework can automatically generate the fuzzy ontology which is very hard and time consuming task otherwise. Clustering of data can be done using the technique of formal concept analysis along with keyword matching method. There is a large amount of data available under similar words but with different meanings. So there has been a lot of problem in retrieval of exact data as required in a very short amount of time. In this case Hidden Markov Model can be used which can find the non-observable or hidden stochastic process from the observable stochastic process. Application/ Improvements: Generalized Expectation-Maximization algorithm used with Hidden Markov Model can find unknown parameters. By adding frequency tracking algorithm along with Hidden Markov Model, we can also track audible data from a large database. Community detection algorithm along with Informap and Bigclam algorithms used with Hidden Markov Model will increase the modularity of data. Its applications include use of information retrieval of different types of data in an extremely faster and efficient way. 

Keywords: Classification, Clustering, Community Detection Algorithm, Fuzzy Ontology Generation Framework, Formal Concept Analysis, Hidden Markov Model


Subscribe now for latest articles and news.