Indian Journal of Science and Technology
Year: 2016, Volume: 9, Issue: 17, Pages: 1-8
Nguyen Van-Tu1 , Le Anh-Cuong2*
*Author of Corresponding: Le Anh-Cuong Faculty of Information Technology, Ton Duc Thang University, Ho Chi Minh City, Vietnam; [email protected]
Question classification is the task of predicting the entity type of the answering sentence for a given question in natural language. It plays an important role in finding or constructing accurate answers and therefore helps to improve quality of automated question answering systems. Different lexical, syntactical and semantic features was extracted automatically from a question to serve the classification in previous studies. However, combining all those features doesn’t always give the best results for all types of questions. Different from previous studies, this paper focuses on the problem of how to extract and select efficient features adapting to each different types of question. We first propose a method of using a feature selection algorithm to determine appropriate features corresponding to different question types. Secondly, we design a new type of features, which is based on question patterns. We tested our proposed approach on the benchmark dataset TREC and using Support Vector Machines (SVM) for the classification algorithm. The experiment shows obtained results with the accuracies of 95.2% and 91.6% for coarse grain and fine grain data sets respectively, which are much better in comparison with the previous studies.
Keywords: Feature Extraction, Feature Selection, Question Answering Systems, Question Classification, Question Patterns
Subscribe now for latest articles and news.