首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到19条相似文献,搜索用时 671 毫秒
1.
陈洁 《情报探索》2020,(2):114-119
[目的/意义]旨在为信息检索相关性研究提供参考。[方法/过程]以CNKI为数据源,采用定性方法,从信息检索的历史脉络和研究学派进行梳理总结,分析信息检索的影响因素和发展趋势。[结果/结论]信息检索相关性是用户、系统的相关性的综合体,任何一方都不能脱离。相关性应该是以用户为关键,系统为基础,研究用户与检索系统的交互、认知以及真实需求的描述与反馈。随着信息检索相关性研究的深入,系统观与用户观将会相互交融,检索技术与用户需求将会协调统一,共同推进检索相关性的发展。  相似文献   

2.
基于向量空间模型的信息检索系统的设计   总被引:8,自引:1,他引:7  
包金龙 《情报杂志》2005,24(7):44-45,49
向量空间检索模型最早提出于1958年,它在信息检索系统中有着良好的性能表现。先对该模型作了介绍,然后在向量空间模型的基础上,根据文献标引、索引项权重的确定、相似度测算这三个步骤设计了一个信息检索系统,总结了该系统要用到的核心技术,并评价了该系统。  相似文献   

3.
孙鑫 《情报杂志》2005,24(3):116-117
相关性反馈是应用于信息检索中的一项重要技术。它对于提高信息检索的质量是非常有效的。就用户提问与相关性反馈、标准的相关性反馈界面以及如何获得相关信息等问题进行了探讨。  相似文献   

4.
相关性是情报学研究中的一个基本概念。用户在利用网络获取信息的过程中,对信息进行相关性判断是信息选择行为中的核心环节。文章对影响相关性判断的因素进行了分析,认为次序效应对用户判断检索结果相关性有一定的影响;了解这一影响,无论对信息检索系统的设计还是对提高用户的信息检索效率来说都是非常有意义的。  相似文献   

5.
网络上存在大量丰富的开源代码,人工阅读耗时耗力,通过信息检索技术,对代码片段自动分析生成描述,可以更加高效直观地阅读。本文对需要描述的代码函数,通过已有信息检索模型进行文本分类和文本相似度匹配,结合检索库中的代码描述,建立Django框架,对任意函数返回代码摘要到网页。  相似文献   

6.
相关性是信息学科尤其是信息检索领域的一个相对热门的研究课题。立足于相关性评价,在简要论述研究现状后,着重分析检索结果相关性的两个基本评价指标-查全率和查准率二值改进算法以及基于布尔模型、向量空间模型和概率模型的计算方法,从系统和用户两个角度出发,提出一些提高信息检索相关性计算准确性的改进建议。  相似文献   

7.
信息检索模型及相关性算法的研究   总被引:2,自引:0,他引:2  
对信息检索模型研究的主要内容和构建策略进行了描述,给出了几种常用的信息检索模型相关性算法,分析了它们的优缺点并就存在的问题进行了探讨,总结了信息检索模型的研究现状和发展趋势。  相似文献   

8.
个性化信息检索系统的用户模型研究   总被引:1,自引:0,他引:1  
李爱明  刘冰 《情报杂志》2007,26(3):121-123,126
分析了传统信息检索系统实现个性化信息检索的必然性,对个性化信息检索系统的信息代理Agent、用户模型等相关问题进行了探讨,提出了一个基于向量空间的个性化信息检索用户模型。  相似文献   

9.
成全  司辉 《情报杂志》2008,27(2):129-133
相关性是衡量信息检索系统性能的重要指标之一,对信息检索相关性进行深入分析对于信息检索系统的有效开发具有重要的理论指导意义.从系统、用户以及当前的网络环境信息检索三个方面分析了信息检索相关性的评价方法及改善策略.  相似文献   

10.
应用于信息检索的统计语言模型研究进展   总被引:1,自引:0,他引:1  
统计语言模型作为一种自然语言处理的工具,已经被证明有能力处理大规模真实文本.而统计语言模型和IR相结合后所形成的SLM-IR模型的提出,是信息检索模型研究上的重大进展.本文介绍了统计语言模型在信息检索领域的基本模型及相关问题,重点分析了Lemur工具箱和标题语言模型的原理及模型,最后从整体上介绍了该领域的国际动态和研究进展情况.  相似文献   

11.
How to merge and organise query results retrieved from different resources is one of the key issues in distributed information retrieval. Some previous research and experiments suggest that cluster-based document browsing is more effective than a single merged list. Cluster-based retrieval results presentation is based on the cluster hypothesis, which states that documents that cluster together have a similar relevance to a given query. However, while this hypothesis has been demonstrated to hold in classical information retrieval environments, it has never been fully tested in heterogeneous distributed information retrieval environments. Heterogeneous document representations, the presence of document duplicates, and disparate qualities of retrieval results, are major features of an heterogeneous distributed information retrieval environment that might disrupt the effectiveness of the cluster hypothesis. In this paper we report on an experimental investigation into the validity and effectiveness of the cluster hypothesis in highly heterogeneous distributed information retrieval environments. The results show that although clustering is affected by different retrieval results representations and quality, the cluster hypothesis still holds and that generating hierarchical clusters in highly heterogeneous distributed information retrieval environments is still a very effective way of presenting retrieval results to users.  相似文献   

12.
基于关联理论的信息检索相关性研究   总被引:1,自引:0,他引:1  
利用关联理论的信息处理模型阐释了信息检索交互模型中的相关性评估模块,认为采用关联理论的信息处理模型阐释信息检索的相关性判断过程是可行的。  相似文献   

13.
基于关联理论的信息检索相关性研究——信息生产、标引   总被引:1,自引:0,他引:1  
文摘:在Saracevic以及Harter研究的基础上,提出了将语言学中的关联理论作为相关性研究的理论基础,并利用关联理论具体阐释了信息检索交互模型中的信息生产以及信息标引两项工作。  相似文献   

14.
It is well-known that relevance feedback is a method significant in improving the effectiveness of information retrieval systems. Improving effectiveness is important since these information retrieval systems must gain access to large document collections distributed over different distant sites. As a consequence, efforts to retrieve relevant documents have become significantly greater. Relevance feedback can be viewed as an aid to the information retrieval task. In this paper, a relevance feedback strategy is presented. The strategy is based on back-propagation of the relevance of retrieved documents using an algorithm developed in a neural approach. This paper describes a neural information retrieval model and emphasizes the results obtained with the associated relevance back-propagation algorithm in three different environments: manual ad hoc, automatic ad hoc and mixed ad hoc strategy (automatic plus manual ad hoc).  相似文献   

15.
相关反馈技术在知识检索中的应用   总被引:4,自引:1,他引:4  
艾丹祥  张玉峰 《情报科学》2003,21(10):1100-1103
本文从相关反馈技术的基本原理出发,综合应用信息管理与机器学习,探讨了相关反馈技术在知识检索中的应用模式、相关反馈检索算法和相关反馈学习算法,最后提出了对该领域研究的建议。  相似文献   

16.
Lately there has been intensive research into the possibilities of using additional information about documents (such as hyperlinks) to improve retrieval effectiveness. It is called data fusion, based on the intuitive principle that different document and query representations or different methods lead to a better estimation of the documents' relevance scores.In this paper we propose a new method of document re-ranking that enables us to improve document scores using inter-document relationships. These relationships are expressed by distances and can be obtained from the text, hyperlinks or other information. The method formalizes the intuition that strongly related documents should not be assigned very different weights.  相似文献   

17.
In this paper we propose an effective sentence retrieval method that consists of incorporating query-independent features into standard sentence retrieval models. To meet this aim, we apply a formal methodology and consider different query-independent features. In particular, we show that opinion-based features are promising. Opinion mining is an increasingly important research topic but little is known about how to improve retrieval algorithms with opinion-based components. In this respect, we consider here different kinds of opinion-based features to act as query-independent evidence and study whether this incorporation improves retrieval performance. On the other hand, information needs are usually related to people, locations or organizations. We hypothesize here that using these named entities as query-independent features may also improve the sentence relevance estimation. Finally, the length of the retrieval unit has been shown to be an important component in different retrieval scenarios. We therefore include length-based features in our study.  相似文献   

18.
苏君华  周林兴 《情报科学》2005,23(8):1218-1220
本文通过研究注意力与搜索引擎、注意力与网络数据库、注意力与检索结果的相关性,论证了注意力与网络信息检索的互动及其影响。  相似文献   

19.
Although relevance judgments are fundamental to the design and evaluation of all information retrieval systems, information scientists have not reached a consensus in defining the central concept of relevance. In this paper we ask two questions: What is the meaning of relevance? and What role does relevance play in information behavior? We attempt to address these questions by reviewing literature over the last 30 years that presents various views of relevance as topical, user-oriented, multidimensional, cognitive, and dynamic. We then discuss traditional assumptions on which most research in the field has been based and begin building a case for an approach to the problem of definition based on alternative assumptions. The dynamic, situational approach we suggest views the user — regardless of system — as the central and active determinant of the dimensions of relevance. We believe that relevance is a multidimensional concept; that it is dependent on both internal (cognitive) and external (situational) factors; that it is based on a dynamic human judgment process; and that it is a complex but systematic and measurable phenomenon.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号