Indian Journal of Science and Technology
DOI: 10.17485/ijst/2019/v12i46/147512
Year: 2019, Volume: 12, Issue: 46, Pages: 1-6
Original Article
Gehad Zakria1,*, Mamdouh Farouk2, Khaled Fathy2 and Malak N. Makar1
1 Department of Mathematics, Faculty of Science, Assiut University, Egypt; [email protected], [email protected]
2 Department of Computer Science, Faculty of Computers and Information, Assiut University, Egypt; [email protected], [email protected]
Objectives/Methods: This study aims to extract relations between entities from Arabic text. RelationExtraction is one of the most important tasks in text mining. Relation extraction is considered as a main step for many applications such as extracting triples from the text, Question Answering and Ontology building. However, extracting relations from the Arabic text is a difficult task compared to English due to lack of annotated Arabic corpora. This paper proposes a method for extracting relations from Arabic text based on ArabicWikipedia articles characteristics.The propose system extracts sentences that contain principle entity, secondary entity and relation from Wikipedia article, then we use WordNet and DBpedia to build the training set. Finally Naive Bayes Classifier is used to train and test the datasets. Finding: There are few works to extract relations from Arabic text. These works depend on classification, clustering and rule based. Application/ improvement: The experiments show the effectiveness of the proposed approach which achieves high precision with 89% for classifying 19 type of semantic relations.
Keywords: Relation Extraction, Arabic Wikipedia, Semantic Relation, Arabic language.
Subscribe now for latest articles and news.