当前位置: X-MOL 学术arXiv.cs.RO › 论文详情
Our official English website, www.x-mol.net, welcomes your feedback! (Note: you will need to create a separate account there.)
Intelligent Conversational Android ERICA Applied to Attentive Listening and Job Interview
arXiv - CS - Robotics Pub Date : 2021-05-02 , DOI: arxiv-2105.00403
Tatsuya Kawahara, Koji Inoue, Divesh Lala

Following the success of spoken dialogue systems (SDS) in smartphone assistants and smart speakers, a number of communicative robots are developed and commercialized. Compared with the conventional SDSs designed as a human-machine interface, interaction with robots is expected to be in a closer manner to talking to a human because of the anthropomorphism and physical presence. The goal or task of dialogue may not be information retrieval, but the conversation itself. In order to realize human-level "long and deep" conversation, we have developed an intelligent conversational android ERICA. We set up several social interaction tasks for ERICA, including attentive listening, job interview, and speed dating. To allow for spontaneous, incremental multiple utterances, a robust turn-taking model is implemented based on TRP (transition-relevance place) prediction, and a variety of backchannels are generated based on time frame-wise prediction instead of IPU-based prediction. We have realized an open-domain attentive listening system with partial repeats and elaborating questions on focus words as well as assessment responses. It has been evaluated with 40 senior people, engaged in conversation of 5-7 minutes without a conversation breakdown. It was also compared against the WOZ setting. We have also realized a job interview system with a set of base questions followed by dynamic generation of elaborating questions. It has also been evaluated with student subjects, showing promising results.

中文翻译:

智能对话Android ERICA适用于专心聆听和求职面试

继智能手机助手和智能扬声器中的口语对话系统(SDS)成功之后,许多通信机器人得以开发和商业化。与被设计为人机界面的传统SDS相比,由于拟人化和身体存在,与机器人的交互被期望与人类对话更加接近。对话的目的或任务可能不是信息检索,而是对话本身。为了实现人类级别的“长而深”的对话,我们开发了一个智能对话式Android ERICA。我们为ERICA设置了多项社交互动任务,包括专心聆听,工作面试和快速约会。为了允许自发的,增量的多次发声,基于TRP(过渡相关地点)预测实现了鲁棒的转弯模型,并且基于时间框架预测而不是基于IPU的预测生成了各种反向信道。我们已经实现了一个开放域的专心聆听系统,该系统具有部分重复和针对重点词以及评估答案的详细说明。已对40位资深人士进行了评估,他们进行了5到7分钟的对话,而对话没有中断。还与WOZ设置进行了比较。我们还实现了具有一组基本问题,然后动态生成详尽问题的工作面试系统。它也已与学生科目进行了评估,显示出令人鼓舞的结果。并且基于时帧预测而不是基于IPU的预测会生成各种反向信道。我们已经实现了一个开放域的专心聆听系统,该系统具有部分重复和针对重点词以及评估答案的详细说明。已对40位资深人士进行了评估,他们进行了5到7分钟的对话,而对话没有中断。还与WOZ设置进行了比较。我们还实现了具有一组基本问题,然后动态生成详尽问题的工作面试系统。它也已与学生科目进行了评估,显示出令人鼓舞的结果。并且基于时帧预测而不是基于IPU的预测会生成各种反向信道。我们已经实现了一个开放域的专心聆听系统,该系统具有部分重复和针对重点词以及评估答案的详细说明。已对40位资深人士进行了评估,他们进行了5到7分钟的对话,而对话没有中断。还与WOZ设置进行了比较。我们还实现了具有一组基本问题,然后动态生成详尽问题的工作面试系统。它也已与学生科目进行了评估,显示出令人鼓舞的结果。进行了5到7分钟的对话,没有对话中断。还与WOZ设置进行了比较。我们还实现了具有一组基本问题,然后动态生成详尽问题的工作面试系统。它也已与学生科目进行了评估,显示出令人鼓舞的结果。进行了5到7分钟的对话,没有对话中断。还与WOZ设置进行了比较。我们还实现了具有一组基本问题,然后动态生成详尽问题的工作面试系统。它也已与学生科目进行了评估,显示出令人鼓舞的结果。
更新日期:2021-05-04
down
wechat
bug