首页 | 本学科首页   官方微博 | 高级检索  
     检索      


Analysis of Statistical Question Classification for Fact-Based Questions
Authors:Donald Metzler  W Bruce Croft
Institution:(1) University of Massachusetts, Amherst
Abstract:Question classification systems play an important role in question answering systems and can be used in a wide range of other domains. The goal of question classification is to accurately assign labels to questions based on expected answer type. Most approaches in the past have relied on matching questions against hand-crafted rules. However, rules require laborious effort to create and often suffer from being too specific. Statistical question classification methods overcome these issues by employing machine learning techniques. We empirically show that a statistical approach is robust and achieves good performance on three diverse data sets with little or no hand tuning. Furthermore, we examine the role different syntactic and semantic features have on performance. We find that semantic features tend to increase performance more than purely syntactic features. Finally, we analyze common causes of misclassification error and provide insight into ways they may be overcome.
Keywords:question classification  question answering  machine learning  Support Vector Machines  syntactic features  semantic features  WordNet
本文献已被 SpringerLink 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号