期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

1.

Reducing 0s bias in video moment retrieval with a circular competence-based captioner

《Information processing & management》2023,60(2):103147

The current study addresses the problem of retrieving a specific moment from an untrimmed video by a sentence query. Existing methods have achieved high performance by designing various structures to match visual-text relations. Yet, these methods tend to return an interval starting from 0s, which we named “0s bias”. In this paper, we propose a Circular Co-Teaching (CCT) mechanism using a captioner to improve an existing retrieval model (localizer) from two aspects: biased annotations and easy samples. Correspondingly, CCT contains two processes: (1) Pseudo Query Generation (captioner to localizer), aiming at transferring the knowledge from generated queries to the localizer to balance annotations; (2) Competence-based Curriculum Learning (localizer to captioner), training the captioner in an easy-to-hard fashion guided by localization results, making pairs of the false-positive moment and pseudo query become easy samples for the localizer. Extensive experiments show that our CCT can alleviate “0s bias” with even 4% improvement for existing approaches on average in two public datasets (ActivityNet-Captions, and Charades-STA), in terms of R@1,IoU=0.7. Notably, our method also outperforms baselines in an out-of-distribution scenario. We also quantitatively validate CCT’s ability to cope with “0s bias” by a proposed metric, DM. Our study not only theoretically contributes to detecting “0s bias”, but also provides a highly effective tool for video moment retrieval by alleviating such bias. 相似文献

2.

A microprocessor architecture for bibliographic retrieval system

G. Martella G. Gobbi 《Information processing & management》1981,17(5):239-247

In this paper the features of a microprocessor based architecture for bibliographic retrieval system are illustrated. The proposed system consists of the following three functional blocks: the “query processor”, the “simple query executers” and the “answer composer”. The query processor parses the queries and breaks the complex query into simple queries. Each simple query executer is able to perform the operations satisfying a simple query. Finally, the answer composer puts together the results of all simple query executers and produces the response to the query originally raised. This machine will allow the implementation of a very powerfull query language. The basic design goals are the system modularity and a whatever complex query's fulfilment. This is achieved through the proposed query language and by means of the system architecture allowing high parallelism in the performed operations. 相似文献

3.

GTE-Rank: A time-aware search engine to answer time-sensitive queries

《Information processing & management》2016,52(2):273-298

In the web environment, most of the queries issued by users are implicit by nature. Inferring the different temporal intents of this type of query enhances the overall temporal part of the web search results. Previous works tackling this problem usually focused on news queries, where the retrieval of the most recent results related to the query are usually sufficient to meet the user's information needs. However, few works have studied the importance of time in queries such as “Philip Seymour Hoffman” where the results may require no recency at all. In this work, we focus on this type of queries named “time-sensitive queries” where the results are preferably from a diversified time span, not necessarily the most recent one. Unlike related work, we follow a content-based approach to identify the most important time periods of the query and integrate time into a re-ranking model to boost the retrieval of documents whose contents match the query time period. For that purpose, we define a linear combination of topical and temporal scores, which reflects the relevance of any web document both in the topical and temporal dimensions, thus contributing to improve the effectiveness of the ranked results across different types of queries. Our approach relies on a novel temporal similarity measure that is capable of determining the most important dates for a query, while filtering out the non-relevant ones. Through extensive experimental evaluation over web corpora, we show that our model offers promising results compared to baseline approaches. As a result of our investigation, we publicly provide a set of web services and a web search interface so that the system can be graphically explored by the research community. 相似文献

4.

基于语法的信息检索模型研究——信息检索方法的一个新分支

舒江波胡金柱肖升《情报理论与实践》2011,34(4)

由于目前基于关键词和基于语义的信息检索都只关注查询焦点,导致检索出来的信息太多太杂,且不精确。本文提出一种基于语法的信息检索模型,通过考察查询焦点、关联线索和答案主体之间的关联度,可以较为精确地获得用户期望的答案。该模型可以看作是浅层的基于语义的信息检索方式的一个扩展和补充。相似文献

5.

A conceptual framework for fuzzy query processing—A step toward very intelligent database systems

Valiollah Tahani 《Information processing & management》1977,13(5):289-303

This paper is concerned with techniques for fuzzy query processing in a database system. By a fuzzy query we mean a query which uses imprecise or fuzzy predicates (e.g. AGE = “VERY YOUNG”, SALARY = “MORE OR LESS HIGH”, YEAR-OF-EMPLOYMENT = “RECENT”, SALARY ? 20,000, etc.). As a basis for fuzzy query processing, a fuzzy retrieval system based on the theory of fuzzy sets and linguistic variables is introduced. In our system model, the first step in processing fuzzy queries consists of assigning meaning to fuzzy terms (linguistic values), of a term-set, used for the formulation of a query. The meaning of a fuzzy term is defined as a fuzzy set in a universe of discourse which contains the numerical values of a domain of a relation in the system database.The fuzzy retrieval system developed is a high level model for the techniques which may be used in a database system. The feasibility of implementing such techniques in a real environment is studied. Specifically, within this context, techniques for processing simple fuzzy queries expressed in the relational query language SEQUEL are introduced. 相似文献

6.

Near-duplicate video detection featuring coupled temporal and perceptual visual structures and logical inference based matching

Mohammed Belkhatir Bashar Tahayna 《Information processing & management》2012

We propose in this paper an architecture for near-duplicate video detection based on: (i) index and query signature based structures integrating temporal and perceptual visual features and (ii) a matching framework computing the logical inference between index and query documents. As far as indexing is concerned, instead of concatenating low-level visual features in high-dimensional spaces which results in curse of dimensionality and redundancy issues, we adopt a perceptual symbolic representation based on color and texture concepts. For matching, we propose to instantiate a retrieval model based on logical inference through the coupling of an N-gram sliding window process and theoretically-sound lattice-based structures. The techniques we cover are robust and insensitive to general video editing and/or degradation, making it ideal for re-broadcasted video search. Experiments are carried out on large quantities of video data collected from the TRECVID 02, 03 and 04 collections and real-world video broadcasts recorded from two German TV stations. An empirical comparison over two state-of-the-art dynamic programming techniques is encouraging and demonstrates the advantage and feasibility of our method. 相似文献

7.

Automatic suggestion of phrasal-concept queries for literature search

Youngho Kim Jangwon Seo W. Bruce CroftDavid A. Smith 《Information processing & management》2014

Both general and domain-specific search engines have adopted query suggestion techniques to help users formulate effective queries. In the specific domain of literature search (e.g., finding academic papers), the initial queries are usually based on a draft paper or abstract, rather than short lists of keywords. In this paper, we investigate phrasal-concept query suggestions for literature search. These suggestions explicitly specify important phrasal concepts related to an initial detailed query. The merits of phrasal-concept query suggestions for this domain are their readability and retrieval effectiveness: (1) phrasal concepts are natural for academic authors because of their frequent use of terminology and subject-specific phrases and (2) academic papers describe their key ideas via these subject-specific phrases, and thus phrasal concepts can be used effectively to find those papers. We propose a novel phrasal-concept query suggestion technique that generates queries by identifying key phrasal-concepts from pseudo-labeled documents and combines them with related phrases. Our proposed technique is evaluated in terms of both user preference and retrieval effectiveness. We conduct user experiments to verify a preference for our approach, in comparison to baseline query suggestion methods, and demonstrate the effectiveness of the technique with retrieval experiments. 相似文献

8.

NTCIR-2 as a Rosetta stone in laboratory experiments of IR systems

《Information processing & management》2005,41(3):489-506

This paper presents a laboratory based evaluation study of cross-language information retrieval technologies, utilizing partially parallel test collections, NTCIR-2 (used together with NTCIR-1), where Japanese–English parallel document collections, parallel topic sets and their relevance judgments are available. These enable us to observe and compare monolingual retrieval processes in two languages as well as retrieval across languages. Our experiments focused on (1) the Rosetta stone question (whether a partially parallel collection helps in cross-language information access or not?) and (2) two aspects of retrieval difficulties namely “collection discrepancy” and “query discrepancy”. Japanese and English monolingual retrieval systems are combined by dictionary based query translation modules so that a symmetrical bilingual evaluation environment is implemented. 相似文献

9.

利用Google进行专题信息检索 总被引：5，自引：0，他引：5

林中《情报科学》2003,21(11):1207-1209

Google是当今一个具有强大功能和独到特点的优秀搜索引擎，本文研究Google基本检索和高级检索语法规则;探讨利用Google的语法规则增强Google的关键词检索功能、提高查准率，正确构建检索式，实施专题信息检索的策略。相似文献

10.

Multimedia surrogates for video gisting: Toward combining spoken words and imagery

Gary Marchionini Yaxiao Song Robert Farrell 《Information processing & management》2009

相似文献

11.

Compact group discovery in attributed graphs and social networks

《Information processing & management》2020,57(2):102054

Social networks and many other graphs are attributed, meaning that their nodes are labelled with textual information such as personal data, expertise or interests. In attributed graphs, a common data analysis task is to find subgraphs whose nodes contain a given set of keywords. In many applications, the size of the subgraph should be limited (i.e., a subgraph with thousands of nodes is not desired). In this work, we introduce the problem of compact attributed group (AG) discovery. Given a set of query keywords and a desired solution size, the task is to find subgraphs with the desired number of nodes, such that the nodes are closely connected and each node contains as many query keywords as possible. We prove that finding an optimal solution is NP-hard and we propose approximation algorithms with a guaranteed ratio of two. Since the number of qualifying AGs may be large, we also show how to find approximate top-k AGs with polynomial delay. Finally, we experimentally verify the effectiveness and efficiency of our techniques on real-world graphs. 相似文献

12.

Practical and effective IR-style keyword search over semantic web

Xiaomin Ning Hai Jin Weijia Jia Pingpeng Yuan 《Information processing & management》2009

This paper presents a novel IR-style keyword search model for semantic web data retrieval, distinguished from current retrieval methods. In this model, an answer to a keyword query is a connected subgraph that contains all the query keywords. In addition, the answer is minimal because any proper subgraph can not be an answer to the query. We provide an approximation algorithm to retrieve these answers efficiently. A special ranking strategy is also proposed so that answers can be appropriately ordered. The experimental results over real datasets show that our model outperforms existing possible solutions with respect to effectiveness and efficiency. 相似文献

13.

Theory of functional sentence perspective and its application for the purposes of automatic extracting

Jiří Janoš 《Information processing & management》1979,15(1):19-25

When preparing the more sophisticated methods of text analysis for information systems, an important role may belong to the applications of modified results gained on the basis of the theory of so called “functional sentence perspective”, working with the concepts of the “theme” of a sentence (that which is spoken about in the sentence) and of the “rheme” (that which is said about the theme in the sentence). From the standpoint of the need to establish the “informational content” of the text an analysis of this kind undoubtly is more important than a traditional examination of a subject-predicate relations etc. Along with a brief characterization of the results of this theory, the article analyzes the possibilities for the implementation of this theory in the theory of information systems, particularly with respect to the study of so called thematic progressions in the text and general structural formula of the text. Besides, particular attention is paid to the utilization of this theory in the domain of automatic extracting. 相似文献

14.

An analysis of image retrieval tasks in the field of art history

《Information processing & management》2001,37(5):701-720

相似文献

15.

User-responsive subject control in bibliographic retrieval systems

Jean M. Tague 《Information processing & management》1981,17(3):149-156

相似文献

16.

一种面向Web搜索的查询修正方案

杨建林严明《情报理论与实践》2008,31(1):146-149

本文分析了正方法,查询修正中的用户信息行为,吸收网页抓取、检索与浏览并重的思想,综合考虑用户Web搜索过程中的行为特点、查询修正所用词汇的可用来源,给出一个新的面向Web搜索的查询修正解决方案. 相似文献

17.

Automatic versus manual indexing

W.A. van der Meulen P.J.F.C. Janssen 《Information processing & management》1977,13(1):13-21

A comparative evaluation has been carried out on the Philips “DIRECT” and the British “INSPEC” retrieval system. DIRECT is based on automatic indexing whereas INSPEC uses manual subject indexing.Two queries were submitted to both systems, using the same data base. The results are expressed in terms of recall and precision. Both recall and precision of INSPEC were found to be higher than those of DIRECT by 20%. It is concluded that this is mainly a result of the query formulation. The effectiveness obtained with automatic indexing of documents is equivalent to that of the manual procedure. 相似文献

18.

Geo-temporal distribution of tag terms for event-related image retrieval

Massimiliano Ruocco Heri Ramampiaro 《Information processing & management》2015

Media sharing applications, such as Flickr and Panoramio, contain a large amount of pictures related to real life events. For this reason, the development of effective methods to retrieve these pictures is important, but still a challenging task. Recognizing this importance, and to improve the retrieval effectiveness of tag-based event retrieval systems, we propose a new method to extract a set of geographical tag features from raw geo-spatial profiles of user tags. The main idea is to use these features to select the best expansion terms in a machine learning-based query expansion approach. Specifically, we apply rigorous statistical exploratory analysis of spatial point patterns to extract the geo-spatial features. We use the features both to summarize the spatial characteristics of the spatial distribution of a single term, and to determine the similarity between the spatial profiles of two terms – i.e., term-to-term spatial similarity. To further improve our approach, we investigate the effect of combining our geo-spatial features with temporal features on choosing the expansion terms. To evaluate our method, we perform several experiments, including well-known feature analyzes. Such analyzes show how much our proposed geo-spatial features contribute to improve the overall retrieval performance. The results from our experiments demonstrate the effectiveness and viability of our method. 相似文献

19.

Answering recreational web searches with relevant things to do results

《Information processing & management》2020,57(2):102184

Recreational queries from users searching for places to go and things to do or see are very common in web and mobile search. Users specify constraints for what they are looking for, like suitability for kids, romantic ambiance or budget. Queries like “restaurants in New York City” are currently served by static local results or the thumbnail carousel. More complex queries like “things to do in San Francisco with kids” or “romantic places to eat in Seattle” require the user to click on every element of the search engine result page to read articles from Yelp, TripAdvisor, or WikiTravel to satisfy their needs. Location data, which is an essential part of web search, is even more prevalent with location-based social networks and offers new opportunities for many ways of satisfying information seeking scenarios.In this paper, we address the problem of recreational queries in information retrieval and propose a solution that combines search query logs with LBSNs data to match user needs and possible options. At the core of our solution is a framework that combines social, geographical, and temporal information for a relevance model centered around the use of semantic annotations on Points of Interest with the goal of addressing these recreational queries. A central part of the framework is a taxonomy derived from behavioral data that drives the modeling and user experience. We also describe in detail the complexity of assessing and evaluating Point of Interest data, a topic that is usually not covered in related work, and propose task design alternatives that work well.We demonstrate the feasibility and scalability of our methods using a data set of 1B check-ins and a large sample of queries from the real-world. Finally, we describe the integration of our techniques in a commercial search engine. 相似文献

20.

Effective sentence retrieval based on query-independent evidence

Ronald T. Fernández David E. Losada 《Information processing & management》2012

In this paper we propose an effective sentence retrieval method that consists of incorporating query-independent features into standard sentence retrieval models. To meet this aim, we apply a formal methodology and consider different query-independent features. In particular, we show that opinion-based features are promising. Opinion mining is an increasingly important research topic but little is known about how to improve retrieval algorithms with opinion-based components. In this respect, we consider here different kinds of opinion-based features to act as query-independent evidence and study whether this incorporation improves retrieval performance. On the other hand, information needs are usually related to people, locations or organizations. We hypothesize here that using these named entities as query-independent features may also improve the sentence relevance estimation. Finally, the length of the retrieval unit has been shown to be an important component in different retrieval scenarios. We therefore include length-based features in our study. 相似文献