Indian Journal of Science and Technology
DOI: 10.17485/ijst/2014/v7i10.15
Year: 2014, Volume: 7, Issue: 10, Pages: 1643–1649
Original Article
Vahid Rafe and Morteza Nozari
1 Department of Computer Engineering, Malayer Branch, Islamic Azad university, Malayer, Iran; nozari_st@yahoo.com, v-rafe@araku.ac.ir
The study presents an algorithm for precise index-based multiple pattern matching which detects Quranic verses in a text and pinpoints them. To be sufficiently precise, Arabic diacritical symbols are removed from the input text, and then a unique algorithm changes the detected strings into indices and detects Quranic verses by focusing on indices consecutiveness. To accelerate the function, the stored strings in databanks were decreased from 84845 to 13362 strings; therefore, the search speed increased. The proposed Quranic algorithm is used for text analysis, and information retrieval criteria such as recall and precision and F criteria have been used to evaluate it. The results suggest that they had a profound impact on the efficiency of the algorithm.
Keywords: Indexing, Information Retrieval, Strings Matching, Text Analysis
Subscribe now for latest articles and news.