首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 671 毫秒
1.
钱永胜 《情报科学》1992,13(6):47-50
本文就倒排档检索中的核心环节——检索提问式的逆波兰转换,提出了一种崭新的算法——二项拆分法,从逻辑式自身的性质出发,避开了常用的福岛法的判断算子优先级别的弊端,巧妙地实现了逆波兰转换,并在C—dBASEⅡ上得到了实现。  相似文献   

2.
优化倒排文档检索算法   总被引:1,自引:0,他引:1  
目前,国内对倒排文档的检索大多采用福岛法,以及在此基础上的一系列优化算法,如:预处理法[1]、准逆波兰变换法[2]、逆波兰回溯算法(RPRSA)[3]等.这些算法固然优化了福岛算法,却未能完全克服福岛法本身的缺陷.  相似文献   

3.
刘永丹 《情报科学》1993,14(4):35-39
本文针对《计算机情报检索的新算法及其在C—dBASEⅢ上的实现》一文中的可以避开算子优先级别的讨论实现逆波兰算法的观点提出了不同看法。然后提出了一种逆波兰算法的改进算法——括号加权法,并在dBASEⅢ、Fox Base和Fox Pro上得到了实现,证明是可行和高效的算法。  相似文献   

4.
徐文静  崔俊凯 《情报杂志》2005,24(12):65-67,70
研究了一套改进的基于XML架构的关系数据映射索引机制。针对DTD不足,设计了XML Schema算法,完成了关系数据向XML模式的完全转换,并在映射后的XML文档OEM树上运用了一种新的逆波兰链索引技术,同时利用Hash表存储结点指针,实现了查询处理代价的最小化O(n)。  相似文献   

5.
郭群 《黑龙江科技信息》2007,(8X):36-36,262
介绍数据结构学科中的一个重要领域——后缀表达式(逆波兰式)的求法。通过栈的应用、标识符树、扩号转换三种方法讨论后缀表达式实现方法以及它们的适用范围。  相似文献   

6.
介绍数据结构学科中的一个重要领域——后缀表达式(逆波兰式)的求法。通过栈的应用、标识符树、扩号转换三种方法讨论后缀表达式实现方法以及它们的适用范围。  相似文献   

7.
本文在大量调查研究的基础上,就军工科研院所转换运行机制,实现军民结合的模式选择问题进行了研究,分析了“亦军亦民”和“一所两制”两种结合模式利弊。  相似文献   

8.
基于图像检索的距离度量算法主要有两类:一种是最优化方法,其代表算法是旅行商距离(EMD);另一种是统计方法,其代表算法是渐进似然估计(ALA)距离.IALA图像检索算法利用分层的方法来提高检索精度,既克服了最优化方法EMD算法对混合成分发散的源图像的检索效果不佳的缺点,又对传统ALA算法对高斯混合模型方差较大的数据库图像产生误判的不足之处进行了改进.实验证明IALA图像检索算法可以大大提高检索的效率和精度.  相似文献   

9.
查全率-查准率间存在顺变关系的数学证明   总被引:7,自引:0,他引:7  
马景娣 《情报科学》2003,21(1):27-29
本文从理论上利用数学推导方法对查全率与查准率之间的关系进行了推导、论证,得出:不同的检索条件下,查全率与查准率之间可以呈现出逆变、顺变或不变的关系;并给出呈现不同关系时的条件关系式。文中同时对现有方面中仅得出“查全率-查准率互逆相关”结论的原因进行了剖析。  相似文献   

10.
针对传统的检索算法在互联网旅游资源检索中精确度不高的问题,本文提出了一种混合特征阈值抽取的互联网旅游资源检索算法。首先使用LLSF、kNN、Im-Rocchio算法计算个人特征矩阵,利用混合特征阈值抽取匹配策略提高检索的准确性,并在Rocchio算法的基础上进行算法优化,实现混合特征阈值抽取的类别匹配,最后采用PageRank搜索排序算法对匹配的结果进行排序,输出检索结果。实例仿真结果表明,通过本文提出的改进策略,大大提高了旅游资源检索的精确度。  相似文献   

11.
In this paper, a new source selection algorithm for uncooperative distributed information retrieval environments is presented. The algorithm functions by modeling each information source as an integral, using the relevance score and the intra-collection position of its sampled documents in reference to a centralized sample index and selects the collections that cover the largest area in the rank-relevance space. Based on the above novel metric, the algorithm explicitly focuses on addressing the two goals of source selection; high-recall, which is important for source recommendation applications and high-precision which is important for distributed information retrieval, aiming to produce a high-precision final merged list.  相似文献   

12.
We study several machine learning algorithms for cross-language patent retrieval and classification. In comparison with most of other studies involving machine learning for cross-language information retrieval, which basically used learning techniques for monolingual sub-tasks, our learning algorithms exploit the bilingual training documents and learn a semantic representation from them. We study Japanese–English cross-language patent retrieval using Kernel Canonical Correlation Analysis (KCCA), a method of correlating linear relationships between two variables in kernel defined feature spaces. The results are quite encouraging and are significantly better than those obtained by other state of the art methods. We also investigate learning algorithms for cross-language document classification. The learning algorithm are based on KCCA and Support Vector Machines (SVM). In particular, we study two ways of combining the KCCA and SVM and found that one particular combination called SVM_2k achieved better results than other learning algorithms for either bilingual or monolingual test documents.  相似文献   

13.
14.
Taxonomy is widely used in many of the website and directory navigation schemes for content/knowledge retrieval. However, information or content navigation support through taxonomy is often constrained due to its inability to take into account the full nomenclature and cultural nuances of knowledge seekers. The emergence and increasing adoption of collaborative tagging (social bookmarking) tools have provided lightweight and informal conceptual structures called folksonomies for knowledge retrieval. As for folksonomies, they reflect the vocabulary of the users. Hence, integrating folksonomies into a taxonomy combines the best of the two schemes as the resultant structure enhances taxonomy navigation with personsalisation for knowledge search and retrieval. This paper presents TaxoFolk, an algorithm for deriving hybrid taxonomy-folksonomy classification for enhanced knowledge navigation. The algorithm integrates folksonomy with a taxonomy through several unsupervised data mining techniques with augmented heuristics.  相似文献   

15.
Collaborative information retrieval involves retrieval settings in which a group of users collaborates to satisfy the same underlying need. One core issue of collaborative IR models involves either supporting collaboration with adapted tools or developing IR models for a multiple-user context and providing a ranked list of documents adapted for each collaborator. In this paper, we introduce the first document-ranking model supporting collaboration between two users characterized by roles relying on different domain expertise levels. Specifically, we propose a two-step ranking model: we first compute a document-relevance score, taking into consideration domain expertise-based roles. We introduce specificity and novelty factors into language-model smoothing, and then we assign, via an Expectation–Maximization algorithm, documents to the best-suited collaborator. Our experiments employ a simulation-based framework of collaborative information retrieval and show the significant effectiveness of our model at different search levels.  相似文献   

16.
在文本检索过程中充分利用词语之间的上下文关系有助于提高检索性能.首先对已有的相关工作进行综述;然后针对已有研究对词语上下文关系应用不足的现状,提出一种基于词语上下文关系的文本检索算法;最后通过实验对该算法进行验证.  相似文献   

17.
机器翻译中设计的两个算法   总被引:3,自引:2,他引:1  
在机器翻译的研究中,混合式方法是一种好方法。本文的工作主要有两部分:第一部分讨论直接散列检索算法;第二部分讨论近似机器翻译算法。  相似文献   

18.
采用在HSV彩色空间的色调累积直方图和边缘直方图分别表示颜色和形状特征,进行相似性检索,并结合综合权重调整的相关反馈技术来满足用户的检索需求。实验结果表明,此算法能获得有效的检索效果。  相似文献   

19.
基于情报检索的汉语同义词识别初探   总被引:3,自引:0,他引:3  
随着计算机的飞速发展,自然语言越来越广泛地应用于情报检索,同义词控制问题也成为情报学的研究热点。本文提出一种识别同义词的方法,这种方法基于检索网络搜索得到的统计数据,使用Dice测度方法测量两个词的相关度,相关度在给定的阈值内就可以认为是同义词。通过分析测试结果,验证这种方法的可行性,并提出了这种方法的优缺点及其应用。  相似文献   

20.
In the information retrieval systems, one of the most important and difficult operations is to extract appropriate keywords from documents. This paper proposes an effective substring search method by extending a pattern matching machine for multi-keyword based on Aho and Corasick (AC) called AC machine. The proposed method enables us to extract keyword candidates as much as possible and to select the suitable keywords for users' purpose at a retrieval stage. This method contains four types of substring search methods (exact, prefix, suffix and proper substring search). This paper also proposes a construction algorithm of the retrieval structure for speeding up the substring search. From the simulation results, it is shown that the retrieval time of the presented method is as fast as the key retrieval method based on the trie.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号