首页 | 本学科首页   官方微博 | 高级检索  
     检索      


Indexing strategies for Swedish full text retrieval under different user scenarios
Authors:Per Ahlgren  Jaana Kekäläinen
Institution:1. University College of Borås, Swedish School of Library and Information Science, 501 90 Borås, Sweden;2. University of Tampere, Department of Information Studies, Kanslerinrinne 1, 33014 Tampere, Finland
Abstract:This paper deals with Swedish full text retrieval and the problem of morphological variation of query terms in the document database. The effects of combination of indexing strategies with query terms on retrieval effectiveness were studied. Three of five tested combinations involved indexing strategies that used conflation, in the form of normalization. Further, two of these three combinations used indexing strategies that employed compound splitting. Normalization and compound splitting were performed by SWETWOL, a morphological analyzer for the Swedish language. A fourth combination attempted to group related terms by right hand truncation of query terms. The four combinations were compared to each other and to a baseline combination, where no attempt was made to counteract the problem of morphological variation of query terms in the document database. The five combinations were evaluated under six different user scenarios, where each scenario simulated a certain user type. The four alternative combinations outperformed the baseline, for each user scenario. The truncation combination had the best performance under each user scenario. The main conclusion of the paper is that normalization and right hand truncation (performed by a search expert) enhanced retrieval effectiveness in comparison to the baseline. The performance of the three combinations of indexing strategies with query terms based on normalization was not far below the performance of the truncation combination.
Keywords:Base word form index  Discounted cumulated gain  Indexing strategy  Inflected word form index  Truncation  User scenario
本文献已被 ScienceDirect 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号