首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 287 毫秒
1.
This paper describes the problem thematic structure of databases on studies of studies on religion that is formed at the Institute of Scientific Information on Social Sciences (INION) of the Russian Academy of Sciences. The principles of the informal processing of documents and queries based on a complex of linguistic tools of the system are uncovered. The problems of information searching using queries of different complexity are discussed.  相似文献   

2.
This paper describes the goals and tasks of creating a bibliographic database on sociology at the Institute of Scientific Information on Social Sciences (INION) of the Russian Academy of Sciences. Its problem and thematic structure, along with the specific and language composition of documents are determined. The basic principles of the analytic synthetic processing of documents and queries based on a set of linguistic tools on sociology are considered. The interdisciplinary access to information resources (domestic and foreign) is found to be necessary to improve the quality of scientific and information support to users of sociological information.  相似文献   

3.
This paper describes the features of the problem-thematic structure of the Science Studies Database, which was created at the Institute of Scientific Information on Social Sciences (INION) of the Russian Academy of Sciences. A complex of linguistic tools for science studies, which was developed for analytical and synthetic processing of documents and queries, is described. The features of the formation of information search strategies are considered.  相似文献   

4.
This paper considers the properties of using the multidimensional approach in the information support of scientific research on ethnology and historical anthropology based on the information resources of the Institute of Scientific Information on Social Sciences (INION). The principles of the content processing of documents and queries based on a complex of linguistic tools of the Automated Information System on Social Sciences (AISON) are considered. The problems of increasing the professional competence and level of information culture of consumers who search for ethnological information by queries of different complexities are discussed.  相似文献   

5.
This paper analyzes aims and objectives that are related to the development of a bibliographic database on philosophy. This database has been developed by the Institute of Scientific Information on Social Sciences (INION) of the Russian Academy of Sciences for more than 30 years. The topical structure of the branch database for philosophy is described. The principles that underpin the content processing of documents and queries based on a set of linguistic tools are presented. It is noted that it is necessary to provide access to external (domestic and foreign) electronic resources in order to improve the quality of research and information support that is provided to the consumers of information in the field of philosophy.  相似文献   

6.
The paper studies concept-based cross-language information retrieval (CLIR). The document collection was a subset of the TREC collection. The test requests were formed from TREC's health related topics. As translation dictionaries the study used a general dictionary and a domain-specific (=medical) dictionary. The effects of translation method, conjunction, and facet order on the effectiveness of concept-based cross-language queries were studied, and concept-based structuring of cross-language queries was compared to mechanical structuring based on the output of dictionaries. The performance of translated Finnish queries against English documents was compared to the performance of original English queries against the English documents, and the performance of different CLIR query types was compared with one another. No major difference was found between concept-based and mechanical structuring. The best translation method was a simultaneous look-up in the medical dictionary and the general dictionary, in which case cross-language queries performed as well as the original English queries. The results showed that especially at high exhaustivity (the number of mutually restrictive concepts in a request) levels cross-language queries perform well in relation to monolingual queries. This suggests that conjunction disambiguates cross-language queries. An extensive study was made of the relative importance of the concepts of requests. On the basis of the classification data of request concepts it was shown how the order of facets in a query affects cross-language as well as monolingual queries.  相似文献   

7.
This study investigates the information seeking behavior of general Korean Web users. The data from transaction logs of selected dates from August 2006 to August 2007 were used to examine characteristics of Web queries and to analyze click logs that consist of a collection of documents that users clicked and viewed for each query. Changes in search topics are explored for NAVER users from 2003/2004 to 2006/2007. Patterns involving spelling errors and queries in foreign languages are also investigated. Search behaviors of Korean Web users are compared to those of the United States and other countries. The results show that entertainment is the topranked category, followed by shopping, education, games, and computer/Internet. Search topics changed from computer/Internet to entertainment and shopping from 2003/2004 to 2006/2007 in Korea. The ratios of both spelling errors and queries in foreign languages are low. This study reveals differences for search topics among different regions of the world. The results suggest that the analysis of click logs allows for the reduction of unknown or unidentifiable queries by providing actual data on user behaviors and their probable underlying information needs. The implications for system designers and Web content providers are discussed.  相似文献   

8.
9.
旨在实现对给定的实体对象集匹配出尽可能宽的实体对象面,以帮助用户快速找到相关信息,尤其是那些需动态整合的特定领域的语义关联信息。分析Web文档中的实体对象结构及其关系,并借助Schema.org方案中的语义分类思想,提出构建具有语义特性的实体对象数据库建设方案。基于该数据库提出一个自适应的实体对象检索框架,该框架能对用户的查询意图进行分析并进行语义分类,形成一条条涵盖实体对象的查询语句,接着“智能地”选择、执行某些具有优先权的查询语句以匹配出那些保存在事实数据库中的相关实体对象。本研究旨在一定程度上实现“滚雪球”式的高效检索思想,满足智能检索技术的需求,促进以实体对象作为研究对象的情报理论研究工作的开展,并为智能情报检索技术的应用规划提供有用参考。  相似文献   

10.
The retrieval of sentences that are relevant to a given information need is a challenging passage retrieval task. In this context, the well-known vocabulary mismatch problem arises severely because of the fine granularity of the task. Short queries, which are usually the rule rather than the exception, aggravate the problem. Consequently, effective sentence retrieval methods tend to apply some form of query expansion, usually based on pseudo-relevance feedback. Nevertheless, there are no extensive studies comparing different statistical expansion strategies for sentence retrieval. In this work we study thoroughly the effect of distinct statistical expansion methods on sentence retrieval. We start from a set of retrieved documents in which relevant sentences have to be found. In our experiments different term selection strategies are evaluated and we provide empirical evidence to show that expansion before sentence retrieval yields competitive performance. This is particularly novel because expansion for sentence retrieval is often done after sentence retrieval (i.e. expansion terms are mined from a ranked set of sentences) and there are no comparative results available between both types of expansion. Furthermore, this comparison is particularly valuable because there are important implications in time efficiency. We also carefully analyze expansion on weak and strong queries and demonstrate clearly that expanding queries before sentence retrieval is not only more convenient for efficiency purposes, but also more effective when handling poor queries.  相似文献   

11.
The topical structure of the information array on socio-economic, political and humanitarian problems of Ukrainian development, reflected in the databases on social sciences and humanities of the Institute of Scientific Information on Social Sciences (INION) of the Russian Academy of Sciences is described. The specificities of country-oriented database which would provide information on Ukrainian studies, are discussed. The most important aspects of investigation aimed at information support of Russian-Ukrainian cooperation in various spheres are analyzed.  相似文献   

12.
从Sogou查询日志中选取样本查询且进行人工标注,通过对标注后新闻查询的分析,提出能用于识别新闻意图的新特征,即查询表达式特征、查询随时间分布特征以及点击结果特征。根据这3个特征,利用决策树分类器实现查询中新闻意图的自动识别,结果发现:①新闻类查询的查询目标主要集中在特定主题信息以及娱乐类信息方面,其查询主题大多为娱乐、政治、体育与经济类信息;②相对非新闻查询,新闻查询具有更可能包含实体、随时间分布波动较大、点击结果之间相似度更高的特点;③本方法对查询中新闻意图的识别效果较好,其宏平均准确率、召回率、F值分别为 0.76、0.73、0、74。  相似文献   

13.
A better understanding of users' search interactions in library search systems is key to improving the result ranking. By focusing on known-item searches (searches for an item already known) and search tactics, vast improvement can be made. To better understand user behaviour, we conducted four transaction-log studies, comprising more than 4.2 million search sessions from two German library search systems. Results show that most sessions are rather short; users tend to issue short queries and usually do not go beyond the first search engine result page (SERP). The most frequently used search tactic was the extension of a query (‘Exhaust’). Looking at the known-item searches, it becomes clear that this query type is of great importance. Between 38%–57% of all queries are known-item queries. Titles or title parts were the most frequent elements of these queries, either alone or in combination with the author's name. Unsuccessful known-item searches were often caused by items not available in the system. Results can be applied by libraries and library system vendors to improve their systems, as well as when designing new systems. Future research, in addition to log data, should also include background information on the usage, for example, through user surveys.  相似文献   

14.
This paper describes the topical-thematic structure of the “Political Science” Database created at the Institute for Scientific Information on Social Sciences (INION) of the RAS. The principles of conceptual processing of political science documents are considered. The properties are uncovered of how the complex of linguistic instruments (branch information-retrieval thesaurus, rubricator of political science, and subject headings) is developed and applied to indexing and classification of documents and queries.  相似文献   

15.
The problems of facilitating the processing of context queries in information retrieval systems (IRS) with proprietary data repositories are considered. A method for indexing fields with a large spread in values is proposed and its efficiency for real bases of information on abstracts is demonstrated. A method involving the use of SQL expressions within the IRS language is presented, as well as the processing of combined queries.  相似文献   

16.
This paper addresses the problem of estimating the size of a deep web data source that is accessible by queries only. Since most deep web data sources are non-cooperative, a data source size can only be estimated by sending queries and analyzing the returning results. We propose an efficient estimator based on the capture–recapture method. First we derive an equation between the overlapping rate and the percentage of the data examined when random samples are retrieved from a uniform distribution. This equation is conceptually simple and leads to the derivation of an estimator for samples obtained by random queries. Since random queries do not produce random documents, it is well known that the traditional methods by random queries underestimate the size, i.e., those estimators have negative bias. Based on the simple estimator for random samples, we adjust the equation so that it can handle the samples returned by random queries. We conduct both simulation studies and experiments on corpora including Gov2, Reuters, Newsgroups, and Wikipedia. The results show that our method has small bias and standard deviation.  相似文献   

17.
This paper reflects the results of the analysis of user information needs that was carried out by the Library of Natural Sciences of the Russian Academy of Sciences (LNS RAS) for the purpose of gathering information on the quality of a reference-bibliographic and information service for specialists of the RAS in the subject of their studies.  相似文献   

18.
Query recommendation has long been considered a key feature of search engines, which can improve users’ search experience by providing useful query suggestions for their search tasks. Most existing approaches on query recommendation aim to recommend relevant queries, i.e., alternative queries similar to a user’s initial query. However, the ultimate goal of query recommendation is to assist users to reformulate queries so that they can accomplish their search task successfully and quickly. Only considering relevance in query recommendation is apparently not directly toward this goal. In this paper, we argue that it is more important to directly recommend queries with high utility, i.e., queries that can better satisfy users’ information needs. For this purpose, we attempt to infer query utility from users’ sequential search behaviors recorded in their search sessions. Specifically, we propose a dynamic Bayesian network, referred as Query Utility Model (QUM), to capture query utility by simultaneously modeling users’ reformulation and click behaviors. We then recommend queries with high utility to help users better accomplish their search tasks. We empirically evaluated the performance of our approach on a publicly released query log by comparing with the state-of-the-art methods. The experimental results show that, by recommending high utility queries, our approach is far more effective in helping users find relevant search results and thus satisfying their information needs.  相似文献   

19.
In Information Retrieval, since it is hard to identify users’ information needs, many approaches have been tried to solve this problem by expanding initial queries and reweighting the terms in the expanded queries using users’ relevance judgments. Although relevance feedback is most effective when relevance information about retrieved documents is provided by users, it is not always available. Another solution is to use correlated terms for query expansion. The main problem with this approach is how to construct the term-term correlations that can be used effectively to improve retrieval performance. In this study, we try to construct query concepts that denote users’ information needs from a document space, rather than to reformulate initial queries using the term correlations and/or users’ relevance feedback. To form query concepts, we extract features from each document, and then cluster the features into primitive concepts that are then used to form query concepts. Experiments are performed on the Associated Press (AP) dataset taken from the TREC collection. The experimental evaluation shows that our proposed framework called QCM (Query Concept Method) outperforms baseline probabilistic retrieval model on TREC retrieval.  相似文献   

20.
《The Reference Librarian》2013,54(22):271-282
A significant problem in reference service at the present time is the shortage of reference librarians. The work of answering reference queries places great demands on such professionals. Success in answering reference questions is mainly based on the level of the librarian's knowledge and experience. The traditional method of answering queries emphasizes experience and mental "detective work" based on sources familiar to the librarian from previous searches. This is extremely difficult for the novice librarian, who usual1 lacks a history of exposure to the literature upon which to draw. The following paper presents a method for addressing queries by means of secondary reference books, referred to as the SRB method. It describes the process of answering queries explains the nature of SRBs, and introduces a method an techn~que for answering queries by consulting SRBs.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号