Defining laughter context for laughter synthesis with spontaneous speech corpus,IEEE Transactions on Affective Computing

当前位置： X-MOL 学术 › IEEE Trans. Affect. Comput. › 论文详情

Our official English website, www.x-mol.net, welcomes your feedback! (Note: you will need to create a separate account there.)

Defining laughter context for laughter synthesis with spontaneous speech corpus
IEEE Transactions on Affective Computing ( IF 9.6 ) Pub Date : 2020-07-01 , DOI: 10.1109/taffc.2018.2813381
Tomohiro Nagata , Hiroki Mori

In this paper, conversational laughter was synthesized by a statistical model-based speech synthesis framework using spontaneous speech corpora. The phonetic transcriptions of natural laughter in these corpora were annotated, and the context required to synthesize the laughter that accompanies speech sounds was defined from the perspective of the (1) phonetic properties of the current segment, (2) phonetic properties of previous and succeeding segments, and (3) positional factors of the current segment or laughter bout. Laughter was synthesized using the defined context and the framework of HMM-based speech synthesis. To confirm the influence of the contextual factors on the naturalness of speech, a subjective evaluation was performed. As the result of the evaluation, the naturalness of the entire utterance was improved by using the contextual factors defined in this study. This result confirmed the importance of defining the appropriate context to synthesize natural conversational laughter.

中文翻译：

用自发语音语料定义笑声合成的笑声上下文

在本文中，对话笑声是通过使用自发语音语料库的基于统计模型的语音合成框架合成的。对这些语料库中自然笑声的音标进行了注释，并从（1）当前句段的语音特性，（2）前后的语音特性的角度定义了合成伴随语音的笑声所需的上下文。段，以及（3）当前段或笑声的位置因素。笑声是使用定义的上下文和基于 HMM 的语音合成框架合成的。为了确认语境因素对语音自然度的影响，进行了主观评价。根据评估结果，通过使用本研究中定义的上下文因素，整个话语的自然度得到了改善。这一结果证实了定义适当上下文以合成自然对话笑声的重要性。

更新日期：2020-07-01

点击分享查看原文

点击收藏

公开下载

阅读更多本刊最新论文本刊介绍/投稿指南11