当前期刊: Information Retrieval Journal Go to current issue    加入关注   
显示样式:        排序: IF: - GO 导出
  • An axiomatic approach to corpus-based cross-language information retrieval
    Inf. Retrieval J. (IF 2.535) Pub Date : 2020-04-09
    Razieh Rahimi, Ali Montazeralghaem, Azadeh Shakery

    Abstract A major challenge in cross-language information retrieval (CLIR) is the adoption of translation knowledge in retrieval models, as it affects term weighting which is known to highly impact the retrieval performance. Despite its importance, how different approaches for integration of translation knowledge into retrieval models relatively perform has not been analytically examined. In this paper

  • Deep cross-platform product matching in e-commerce
    Inf. Retrieval J. (IF 2.535) Pub Date : 2019-08-13
    Juan Li, Zhicheng Dou, Yutao Zhu, Xiaochen Zuo, Ji-Rong Wen

    Abstract Online shopping has become more and more popular in recent years, which leads to a prosperity on online platforms. Generally, the identical products are provided by many sellers on multiple platforms. Thus the comparison between products on multiple platforms becomes a basic demand for both consumers and sellers. However, identifying identical products on multiple platforms is difficult because

  • Offline evaluation options for recommender systems
    Inf. Retrieval J. (IF 2.535) Pub Date : 2020-03-18
    Rocío Cañamares, Pablo Castells, Alistair Moffat

    Abstract We undertake a detailed examination of the steps that make up offline experiments for recommender system evaluation, including the manner in which the available ratings are filtered and split into training and test; the selection of a subset of the available users for the evaluation; the choice of strategy to handle the background effects that arise when the system is unable to provide scores

  • A passage-based approach to learning to rank documents
    Inf. Retrieval J. (IF 2.535) Pub Date : 2020-03-06
    Eilon Sheetrit, Anna Shtok, Oren Kurland

    Abstract According to common relevance-judgments regimes, such as TREC’s, a document can be deemed relevant to a query even if it contains a very short passage of text with pertinent information. This fact has motivated work on passage-based document retrieval: document ranking methods that induce information from the document’s passages. However, the main source of passage-based information utilized

  • Low-cost, bottom-up measures for evaluating search result diversification
    Inf. Retrieval J. (IF 2.535) Pub Date : 2019-04-20
    Zhicheng Dou, Xue Yang, Diya Li, Ji-Rong Wen, Tetsuya Sakai

    Abstract Search result diversification aims at covering different user intents by returning a diversified document list. Most existing diversity measures require a predefined set of intents for a given query, where it is assumed that there is no relationship across these intents. However, studies have shown that modeling a hierarchy of intents has some benefits over the standard measure of using a

  • Evaluating sentence-level relevance feedback for high-recall information retrieval
    Inf. Retrieval J. (IF 2.535) Pub Date : 2019-08-13
    Haotian Zhang, Gordon V. Cormack, Maura R. Grossman, Mark D. Smucker

    Abstract This study uses a novel simulation framework to evaluate whether the time and effort necessary to achieve high recall using active learning is reduced by presenting the reviewer with isolated sentences, as opposed to full documents, for relevance feedback. Under the weak assumption that more time and effort is required to review an entire document than a single sentence, simulation results

  • Fewer topics? A million topics? Both?! On topics subsets in test collections
    Inf. Retrieval J. (IF 2.535) Pub Date : 2019-05-08
    Kevin Roitero, J. Shane Culpepper, Mark Sanderson, Falk Scholer, Stefano Mizzaro

    Abstract When evaluating IR run effectiveness using a test collection, a key question is: What search topics should be used? We explore what happens to measurement accuracy when the number of topics in a test collection is reduced, using the Million Query 2007, TeraByte 2006, and Robust 2004 TREC collections, which all feature more than 50 topics, something that has not been examined in past work.

  • ReBoost: a retrieval-boosted sequence-to-sequence model for neural response generation
    Inf. Retrieval J. (IF 2.535) Pub Date : 2019-09-23
    Yutao Zhu, Zhicheng Dou, Jian-Yun Nie, Ji-Rong Wen

    Abstract Human–computer conversation is an active research topic in natural language processing. One of the representative methods to build conversation systems uses the sequence-to-sequence (Seq2seq) model through neural networks. However, with limited input information, the Seq2seq model tends to generate meaningless and trivial responses. It can be greatly enhanced if more supplementary information

  • Demographic differences in search engine use with implications for cohort selection
    Inf. Retrieval J. (IF 2.535) Pub Date : 2019-01-01
    Elad Yom-Tov

    Abstract The correlation between the demographics of users and the text they write has been investigated through literary texts and, more recently, social media. However, differences pertaining to language use in search engines has not been thoroughly analyzed, especially for age and gender differences. Such differences are important especially due to the growing use of search engine data in the study

  • Beyond word embeddings: learning entity and concept representations from large scale knowledge bases
    Inf. Retrieval J. (IF 2.535) Pub Date : 2018-08-11
    Walid Shalaby, Wlodek Zadrozny, Hongxia Jin

    Abstract Text representations using neural word embeddings have proven effective in many NLP applications. Recent researches adapt the traditional word embedding models to learn vectors of multiword expressions (concepts/entities). However, these methods are limited to textual knowledge bases (e.g., Wikipedia). In this paper, we propose a novel and simple technique for integrating the knowledge about

  • A comparison of filtering evaluation metrics based on formal constraints
    Inf. Retrieval J. (IF 2.535) Pub Date : 2019-04-01
    Enrique Amigó, Julio Gonzalo, Felisa Verdejo, Damiano Spina

    Abstract Although document filtering is simple to define, there is a wide range of different evaluation measures that have been proposed in the literature, all of which have been subject to criticism. Our goal is to compare metrics from a formal point of view, in order to understand whether each metric is appropriate, why and when, in order to achieve a better understanding of the similarities and

  • A selective approach to index term weighting for robust information retrieval based on the frequency distributions of query terms
    Inf. Retrieval J. (IF 2.535) Pub Date : 2018-12-13
    Ahmet Arslan, Bekir Taner Dinçer

    Abstract A typical information retrieval (IR) system applies a single retrieval strategy to every information need of users. However, the results of the past IR experiments show that a particular retrieval strategy is in general good at fulfilling some type of information needs while failing to fulfil some other type, i.e., high variation in retrieval effectiveness across information needs. On the

  • Informational, transactional, and navigational need of information: relevance of search intention in search engine advertising
    Inf. Retrieval J. (IF 2.535) Pub Date : 2019-11-26
    Carsten D. Schultz

    Abstract This study investigates the impact of search query intention when evaluating and managing search engine advertising. Specifically, we study whether the performance of a search engine advertising campaign depends on the informational, transactional, and navigational search intentions and also consider the appearance of an organic result alongside a search engine advertisement on the same search

  • Preference-based interactive multi-document summarisation
    Inf. Retrieval J. (IF 2.535) Pub Date : 2019-11-19
    Yang Gao, Christian M. Meyer, Iryna Gurevych

    Abstract Interactive NLP is a promising paradigm to close the gap between automatic NLP systems and the human upper bound. Preference-based interactive learning has been successfully applied, but the existing methods require several thousand interaction rounds even in simulations with perfect user feedback. In this paper, we study preference-based interactive summarisation. To reduce the number of

  • Boosting learning to rank with user dynamics and continuation methods
    Inf. Retrieval J. (IF 2.535) Pub Date : 2019-11-05
    Nicola Ferro, Claudio Lucchese, Maria Maistro, Raffaele Perego

    Abstract Learning to rank (LtR) techniques leverage assessed samples of query-document relevance to learn effective ranking functions able to exploit the noisy signals hidden in the features used to represent queries and documents. In this paper we explore how to enhance the state-of-the-art LambdaMart LtR algorithm by integrating in the training process an explicit knowledge of the underlying user-interaction

  • Optimizing the recency-relevance-diversity trade-offs in non-personalized news recommendations
    Inf. Retrieval J. (IF 2.535) Pub Date : 2019-02-27
    Abhijnan Chakraborty, Saptarshi Ghosh, Niloy Ganguly, Krishna P. Gummadi

    Abstract Online news media sites are emerging as the primary source of news for a large number of users. Due to a large number of stories being published in these media sites, users usually rely on news recommendation systems to find important news. In this work, we focus on automatically recommending news stories to all users of such media websites, where the selection is not influenced by a particular

  • On the impact of group size on collaborative search effectiveness
    Inf. Retrieval J. (IF 2.535) Pub Date : 2019-01-09
    Felipe Moraes, Kilian Grashoff, Claudia Hauff

    Abstract While today’s web search engines are designed for single-user search, over the years research efforts have shown that complex information needs—which are explorative, open-ended and multi-faceted—can be answered more efficiently and effectively when searching in collaboration. Collaborative search (and sensemaking) research has investigated techniques, algorithms and interface affordances

  • Abstraction of query auto completion logs for anonymity-preserving analysis
    Inf. Retrieval J. (IF 2.535) Pub Date : 2019-06-06
    Unni Krishnan, Bodo Billerbeck, Alistair Moffat, Justin Zobel

    Abstract Query auto completion (QAC) is used in search interfaces to interactively offer a list of suggestions to users as they enter queries. The suggested completions are updated each time the user modifies their partial query, as they either add further keystrokes or interact directly with completions that have been offered. In this work we use a state model to capture the possible interactions

  • The impact of result diversification on search behaviour and performance
    Inf. Retrieval J. (IF 2.535) Pub Date : 2019-05-16
    David Maxwell, Leif Azzopardi, Yashar Moshfeghi

    Abstract Result diversification aims to provide searchers with a broader view of a given topic while attempting to maximise the chances of retrieving relevant material. Diversifying results also aims to reduce search bias by increasing the coverage over different aspects of the topic. As such, searchers should learn more about the given topic in general. Despite diversification algorithms being introduced

  • Evaluation measures for quantification: an axiomatic approach
    Inf. Retrieval J. (IF 2.535) Pub Date : 2019-09-21
    Fabrizio Sebastiani

    Abstract Quantification is the task of estimating, given a set \(\sigma \) of unlabelled items and a set of classes \({\mathcal {C}}=\{c_{1}, \ldots , c_{|{\mathcal {C}}|}\}\) , the prevalence (or “relative frequency”) in \(\sigma \) of each class \(c_{i}\in {\mathcal {C}}\) . While quantification may in principle be solved by classifying each item in \(\sigma \) and counting how many such items have

  • How do interval scales help us with better understanding IR evaluation measures?
    Inf. Retrieval J. (IF 2.535) Pub Date : 2019-09-04
    Marco Ferrante, Nicola Ferro, Eleonora Losiouk

    Abstract Evaluation measures are the basis for quantifying the performance of IR systems and the way in which their values can be processed to perform statistical analyses depends on the scales on which these measures are defined. For example, mean and variance should be computed only when relying on interval scales. In our previous work we defined a theory of IR evaluation measures, based on the representational

Contents have been reproduced by permission of the publishers.
全球疫情及响应:BMC Medicine专题征稿