Test Collection Based Evaluation of Information Retrieval Systems,Foundations and Trends in Information Retrieval

当前位置： X-MOL 学术 › Found. Trends Inf. Ret. › 论文详情

Our official English website, www.x-mol.net, welcomes your feedback! (Note: you will need to create a separate account there.)

Test Collection Based Evaluation of Information Retrieval Systems
Foundations and Trends in Information Retrieval ( IF 10.4 ) Pub Date : 2010-6-21 , DOI: 10.1561/1500000009
Mark Sanderson

Use of test collections and evaluation measures to assess the effectiveness of information retrieval systems has its origins in work dating back to the early 1950s. Across the nearly 60 years since that work started, use of test collections is a de facto standard of evaluation. This monograph surveys the research conducted and explains the methods and measures devised for evaluation of retrieval systems, including a detailed look at the use of statistical significance testing in retrieval experimentation. This monograph reviews more recent examinations of the validity of the test collection approach and evaluation measures as well as outlining trends in current research exploiting query logs and live labs. At its core, the modern-day test collection is little different from the structures that the pioneering researchers in the 1950s and 1960s conceived of. This tutorial and review shows that despite its age, this long-standing evaluation method is still a highly valued tool for retrieval research.

中文翻译：

基于测试集合的信息检索系统评估

使用测试集和评估措施来评估信息检索系统的有效性可追溯到1950年代初。在这项工作开始以来的近60年中，使用测试集实际上是评估的标准。这本专论对进行的研究进行了调查，并解释了为评估检索系统而设计的方法和措施，包括详细介绍了统计意义检验在检索实验中的使用。这本专论回顾了有关测试收集方法和评估措施的有效性的最新检查，并概述了利用查询日志和实时实验室进行当前研究的趋势。本质上，现代的测试集与1950年代和1960年代的开创性研究人员所设想的结构几乎没有什么不同。本教程和评论表明，尽管年代久远，但这种长期的评估方法仍然是用于检索研究的非常有价值的工具。

更新日期：2010-06-21

点击分享查看原文

点击收藏

阅读更多本刊最新论文本刊介绍/投稿指南

全部期刊列表>>