当前位置: X-MOL 学术Comput. Graph. Forum › 论文详情
Our official English website, www.x-mol.net, welcomes your feedback! (Note: you will need to create a separate account there.)
Statistics‐based Motion Synthesis for Social Conversations
Computer Graphics Forum ( IF 2.5 ) Pub Date : 2020-11-24 , DOI: 10.1111/cgf.14114
Yanzhe Yang 1 , Jimei Yang 2 , Jessica Hodgins 1
Affiliation  

Plausible conversations among characters are required to generate the ambiance of social settings such as a restaurant, hotel lobby, or cocktail party. In this paper, we propose a motion synthesis technique that can rapidly generate animated motion for characters engaged in two‐party conversations. Our system synthesizes gestures and other body motions for dyadic conversations that synchronize with novel input audio clips. Human conversations feature many different forms of coordination and synchronization. For example, speakers use hand gestures to emphasize important points, and listeners often nod in agreement or acknowledgment. To achieve the desired degree of realism, our method first constructs a motion graph that preserves the statistics of a database of recorded conversations performed by a pair of actors. This graph is then used to search for a motion sequence that respects three forms of audio‐motion coordination in human conversations: coordination to phonemic clause, listener response, and partner's hesitation pause. We assess the quality of the generated animations through a user study that compares them to the originally recorded motion and evaluate the effects of each type of audio‐motion coordination via ablation studies.

中文翻译:

用于社交对话的基于统计的运动合成

需要角色之间合理的对话来营造社交环境的氛围,例如餐厅、酒店大堂或鸡尾酒会。在本文中,我们提出了一种动作合成技术,可以为参与两方对话的角色快速生成动画动作。我们的系统为与新的输入音频剪辑同步的二元对话合成手势和其他身体动作。人类对话具有许多不同形式的协调和同步。例如,演讲者使用手势来强调重点,而听众通常会点头表示同意或承认。为了达到理想的真实度,我们的方法首先构建了一个运动图,该图保留了由一对演员进行的记录对话的数据库的统计数据。然后使用该图来搜索符合人类对话中三种形式的音频-运动协调的运动序列:与音素从句的协调、听者的反应和伙伴的犹豫停顿。我们通过用户研究评估生成的动画的质量,将它们与原始记录的运动进行比较,并通过消融研究评估每种类型的音频-运动协调的效果。
更新日期:2020-11-24
down
wechat
bug