当前位置: X-MOL 学术ACM Comput. Surv. › 论文详情
Our official English website, www.x-mol.net, welcomes your feedback! (Note: you will need to create a separate account there.)
Voice in Human–Agent Interaction
ACM Computing Surveys ( IF 16.6 ) Pub Date : 2021-05-04 , DOI: 10.1145/3386867
Katie Seaborn 1 , Norihisa P. Miyake 2 , Peter Pennefather 3 , Mihoko Otake-Matsuura 2
Affiliation  

Social robots, conversational agents, voice assistants, and other embodied AI are increasingly a feature of everyday life. What connects these various types of intelligent agents is their ability to interact with people through voice. Voice is becoming an essential modality of embodiment, communication, and interaction between computer-based agents and end-users. This survey presents a meta-synthesis on agent voice in the design and experience of agents from a human-centered perspective: voice-based human–agent interaction (vHAI). Findings emphasize the social role of voice in HAI as well as circumscribe a relationship between agent voice and body, corresponding to human models of social psychology and cognition. Additionally, changes in perceptions of and reactions to agent voice over time reveals a generational shift coinciding with the commercial proliferation of mobile voice assistants. The main contributions of this work are a vHAI classification framework for voice across various agent forms, contexts, and user groups, a critical analysis grounded in key theories, and an identification of future directions for the oncoming wave of vocal machines.

中文翻译:

人机交互中的声音

社交机器人、会话代理、语音助手和其他具身人工智能越来越成为日常生活的一个特征。连接这些不同类型的智能代理的是它们通过语音与人交互的能力。语音正在成为基于计算机的代理和最终用户之间的体现、通信和交互的基本形式。本调查从以人为中心的角度对代理设计和体验中的代理语音进行了元综合:基于语音的人机交互 (vHAI)。研究结果强调了声音在 HAI 中的社会作用,并限定了代理声音和身体之间的关系,对应于社会心理学和认知的人类模型。此外,随着时间的推移,对代理语音的看法和反应的变化揭示了与移动语音助手的商业扩散相吻合的代际转变。这项工作的主要贡献是跨各种代理形式、上下文和用户组的语音的 vHAI 分类框架,基于关键理论的批判性分析,以及对即将到来的语音机器浪潮的未来方向的识别。
更新日期:2021-05-04
down
wechat
bug