首页 | 本学科首页   官方微博 | 高级检索  
     检索      


Improvement of building field association term dictionary using passage retrieval
Authors:Uddin Md  Elmarhomy  Elsayed  Masao  Kazuhiro  Jun-ichi
Institution:aDepartment of Information Science and Intelligent Systems, University of Tokushima, Tokushima 770-8506, Japan
Abstract:Field Association (FA) terms are a limited set of discriminating terms that can specify document fields. Document fields can be decided efficiently if there are many relevant FA terms in that documents. An earlier approach built FA terms dictionary using a WWW search engine, but there were irrelevant selected FA terms in that dictionary because that approach extracted FA terms from the whole documents. This paper proposes a new approach for extracting FA terms using passage (portions of a document text) technique rather than extracting them from the whole documents. This approach extracts FA terms more accurately than the earlier approach. The proposed approach is evaluated for 38,372 articles from the large tagged corpus. According to experimental results, it turns out that by using the new approach about 24% more relevant FA terms are appending to the earlier FA term dictionary and around 32% irrelevant FA terms are deleted. Moreover, precision and recall are achieved 98% and 94% respectively using the new approach.
Keywords:Field association terms  Passage retrieval  WWW search engine  FA terms dictionary  Recall  Precision
本文献已被 ScienceDirect 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号