Survey on evaluation methods for dialogue systems,Artificial Intelligence Review

当前位置： X-MOL 学术 › Artif. Intell. Rev. › 论文详情

Our official English website, www.x-mol.net, welcomes your feedback! (Note: you will need to create a separate account there.)

Survey on evaluation methods for dialogue systems
Artificial Intelligence Review ( IF 10.7 ) Pub Date : 2020-06-25 , DOI: 10.1007/s10462-020-09866-x
Jan Deriu , Alvaro Rodrigo , Arantxa Otegi , Guillermo Echegoyen , Sophie Rosset , Eneko Agirre , Mark Cieliebak

In this paper, we survey the methods and concepts developed for the evaluation of dialogue systems. Evaluation, in and of itself, is a crucial part during the development process. Often, dialogue systems are evaluated by means of human evaluations and questionnaires. However, this tends to be very cost- and time-intensive. Thus, much work has been put into finding methods which allow a reduction in involvement of human labour. In this survey, we present the main concepts and methods. For this, we differentiate between the various classes of dialogue systems (task-oriented, conversational, and question-answering dialogue systems). We cover each class by introducing the main technologies developed for the dialogue systems and then present the evaluation methods regarding that class.

中文翻译：

对话系统评价方法调查

在本文中，我们调查了为评估对话系统而开发的方法和概念。评估本身是开发过程中的关键部分。通常，对话系统是通过人工评估和问卷来评估的。然而，这往往非常耗费成本和时间。因此，已经投入了大量工作来寻找允许减少人类劳动参与的方法。在本次调查中，我们介绍了主要概念和方法。为此，我们区分了各类对话系统（面向任务、对话和问答对话系统）。我们通过介绍为对话系统开发的主要技术来涵盖每个类，然后介绍有关该类的评估方法。

更新日期：2020-06-25

点击分享查看原文

点击收藏

公开下载

阅读更多本刊最新论文本刊介绍/投稿指南11