RWCP-SSD-Onomatopoeia: Onomatopoeic Word Dataset for Environmental Sound Synthesis,arXiv - CS - Sound

当前位置： X-MOL 学术 › arXiv.cs.SD › 论文详情

Our official English website, www.x-mol.net, welcomes your feedback! (Note: you will need to create a separate account there.)

RWCP-SSD-Onomatopoeia: Onomatopoeic Word Dataset for Environmental Sound Synthesis
arXiv - CS - Sound Pub Date : 2020-07-09 , DOI: arxiv-2007.04719
Yuki Okamoto, Keisuke Imoto, Shinnosuke Takamichi, Ryosuke Yamanishi, Takahiro Fukumori, Yoichi Yamashita

Environmental sound synthesis is a technique for generating a natural environmental sound. Conventional work on environmental sound synthesis using sound event labels cannot finely control synthesized sounds, for example, the pitch and timbre. We consider that onomatopoeic words can be used for environmental sound synthesis. Onomatopoeic words are effective for explaining the feature of sounds. We believe that using onomatopoeic words will enable us to control the fine time-frequency structure of synthesized sounds. However, there is no dataset available for environmental sound synthesis using onomatopoeic words. In this paper, we thus present RWCP-SSD-Onomatopoeia, a dataset consisting of 155,568 onomatopoeic words paired with audio samples for environmental sound synthesis. We also collected self-reported confidence scores and others-reported acceptance scores of onomatopoeic words, to help us investigate the difficulty in the transcription and selection of a suitable word for environmental sound synthesis.

中文翻译：

RWCP-SSD-Onomatopoeia：用于环境声音合成的拟声词数据集

环境声音合成是一种产生自然环境声音的技术。使用声音事件标签进行环境声音合成的传统工作无法精细地控制合成声音，例如音高和音色。我们认为拟声词可用于环境音合成。拟声词对于解释声音的特征是有效的。我们相信使用拟声词将使我们能够控制合成声音的精细时频结构。但是，没有可用于使用拟声词合成环境声音的数据集。因此，在本文中，我们提出了 RWCP-SSD-Onomatopoeia，这是一个由 155,568 个拟声词与音频样本组成的数据集，用于环境声音合成。

更新日期：2020-07-10

点击分享查看原文

点击收藏

阅读更多本刊最新论文