期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

基于图像检索的距离度量算法主要有两类:一种是最优化方法,其代表算法是旅行商距离(EMD);另一种是统计方法,其代表算法是渐进似然估计(ALA)距离.IALA图像检索算法利用分层的方法来提高检索精度,既克服了最优化方法EMD算法对混合成分发散的源图像的检索效果不佳的缺点,又对传统ALA算法对高斯混合模型方差较大的数据库图像产生误判的不足之处进行了改进.实验证明IALA图像检索算法可以大大提高检索的效率和精度. 相似文献

9.

查全率-查准率间存在顺变关系的数学证明 总被引：7，自引：0，他引：7

马景娣《情报科学》2003,21(1):27-29

本文从理论上利用数学推导方法对查全率与查准率之间的关系进行了推导、论证，得出：不同的检索条件下，查全率与查准率之间可以呈现出逆变、顺变或不变的关系；并给出呈现不同关系时的条件关系式。文中同时对现有方面中仅得出“查全率－查准率互逆相关”结论的原因进行了剖析。相似文献

10.

一种混合特征阈值抽取的互联网旅游资源检索算法

《科技通报》2017,(8)

针对传统的检索算法在互联网旅游资源检索中精确度不高的问题,本文提出了一种混合特征阈值抽取的互联网旅游资源检索算法。首先使用LLSF、kNN、Im-Rocchio算法计算个人特征矩阵,利用混合特征阈值抽取匹配策略提高检索的准确性,并在Rocchio算法的基础上进行算法优化,实现混合特征阈值抽取的类别匹配,最后采用PageRank搜索排序算法对匹配的结果进行排序,输出检索结果。实例仿真结果表明,通过本文提出的改进策略,大大提高了旅游资源检索的精确度。相似文献

11.

Modeling information sources as integrals for effective and efficient source selection

Georgios Paltoglou Michail Salampasis Maria Satratzemi 《Information processing & management》2011

In this paper, a new source selection algorithm for uncooperative distributed information retrieval environments is presented. The algorithm functions by modeling each information source as an integral, using the relevance score and the intra-collection position of its sampled documents in reference to a centralized sample index and selects the collections that cover the largest area in the rank-relevance space. Based on the above novel metric, the algorithm explicitly focuses on addressing the two goals of source selection; high-recall, which is important for source recommendation applications and high-precision which is important for distributed information retrieval, aiming to produce a high-precision final merged list. 相似文献

12.

Advanced learning algorithms for cross-language patent retrieval and classification

Yaoyong Li John Shawe-Taylor 《Information processing & management》2007

We study several machine learning algorithms for cross-language patent retrieval and classification. In comparison with most of other studies involving machine learning for cross-language information retrieval, which basically used learning techniques for monolingual sub-tasks, our learning algorithms exploit the bilingual training documents and learn a semantic representation from them. We study Japanese–English cross-language patent retrieval using Kernel Canonical Correlation Analysis (KCCA), a method of correlating linear relationships between two variables in kernel defined feature spaces. The results are quite encouraging and are significantly better than those obtained by other state of the art methods. We also investigate learning algorithms for cross-language document classification. The learning algorithm are based on KCCA and Support Vector Machines (SVM). In particular, we study two ways of combining the KCCA and SVM and found that one particular combination called SVM_2k achieved better results than other learning algorithms for either bilingual or monolingual test documents. 相似文献

13.

Enhancing information source selection using a genetic algorithm and social tagging

《International Journal of Information Management》2017,37(6):741-749

相似文献

14.

TaxoFolk: a hybrid taxonomy–folksonomy classification for enhanced knowledge navigation

Ching-Chieh Kiu Eric Tsui 《知识管理研究与实践》2010,8(1):24-32

Taxonomy is widely used in many of the website and directory navigation schemes for content/knowledge retrieval. However, information or content navigation support through taxonomy is often constrained due to its inability to take into account the full nomenclature and cultural nuances of knowledge seekers. The emergence and increasing adoption of collaborative tagging (social bookmarking) tools have provided lightweight and informal conceptual structures called folksonomies for knowledge retrieval. As for folksonomies, they reflect the vocabulary of the users. Hence, integrating folksonomies into a taxonomy combines the best of the two schemes as the resultant structure enhances taxonomy navigation with personsalisation for knowledge search and retrieval. This paper presents TaxoFolk, an algorithm for deriving hybrid taxonomy-folksonomy classification for enhanced knowledge navigation. The algorithm integrates folksonomy with a taxonomy through several unsupervised data mining techniques with augmented heuristics. 相似文献

15.

On domain expertise-based roles in collaborative information retrieval

Laure Soulier Lynda Tamine Wahiba Bahsoun 《Information processing & management》2014

Collaborative information retrieval involves retrieval settings in which a group of users collaborates to satisfy the same underlying need. One core issue of collaborative IR models involves either supporting collaboration with adapted tools or developing IR models for a multiple-user context and providing a ranked list of documents adapted for each collaborator. In this paper, we introduce the first document-ranking model supporting collaboration between two users characterized by roles relying on different domain expertise levels. Specifically, we propose a two-step ranking model: we first compute a document-relevance score, taking into consideration domain expertise-based roles. We introduce specificity and novelty factors into language-model smoothing, and then we assign, via an Expectation–Maximization algorithm, documents to the best-suited collaborator. Our experiments employ a simulation-based framework of collaborative information retrieval and show the significant effectiveness of our model at different search levels. 相似文献

16.

一种基于词语上下文关系的文本检索算法

郭少友《情报理论与实践》2008,31(4)

在文本检索过程中充分利用词语之间的上下文关系有助于提高检索性能.首先对已有的相关工作进行综述;然后针对已有研究对词语上下文关系应用不足的现状,提出一种基于词语上下文关系的文本检索算法;最后通过实验对该算法进行验证. 相似文献

17.

机器翻译中设计的两个算法 总被引：3，自引：2，他引：1

杨宪泽秦沿海唐向阳撒晓英刘明志《科技通报》2005,21(2):189-192,197

在机器翻译的研究中,混合式方法是一种好方法。本文的工作主要有两部分：第一部分讨论直接散列检索算法;第二部分讨论近似机器翻译算法。相似文献

18.

基于颜色和形状的彩色图像检索

文立《人天科学研究》2009,(7)

采用在HSV彩色空间的色调累积直方图和边缘直方图分别表示颜色和形状特征,进行相似性检索,并结合综合权重调整的相关反馈技术来满足用户的检索需求。实验结果表明,此算法能获得有效的检索效果。相似文献

19.

基于情报检索的汉语同义词识别初探 总被引：3，自引：0，他引：3

刘华梅侯汉清《情报理论与实践》2005,28(4):373-375,382

随着计算机的飞速发展，自然语言越来越广泛地应用于情报检索，同义词控制问题也成为情报学的研究热点。本文提出一种识别同义词的方法，这种方法基于检索网络搜索得到的统计数据，使用Dice测度方法测量两个词的相关度，相关度在给定的阈值内就可以认为是同义词。通过分析测试结果，验证这种方法的可行性，并提出了这种方法的优缺点及其应用。相似文献

20.

An efficient substring search method by using delayed keyword extraction

《Information processing & management》2001,37(5):741-761

In the information retrieval systems, one of the most important and difficult operations is to extract appropriate keywords from documents. This paper proposes an effective substring search method by extending a pattern matching machine for multi-keyword based on Aho and Corasick (AC) called AC machine. The proposed method enables us to extract keyword candidates as much as possible and to select the suitable keywords for users' purpose at a retrieval stage. This method contains four types of substring search methods (exact, prefix, suffix and proper substring search). This paper also proposes a construction algorithm of the retrieval structure for speeding up the substring search. From the simulation results, it is shown that the retrieval time of the presented method is as fast as the key retrieval method based on the trie. 相似文献