当前位置:
X-MOL 学术
›
arXiv.cs.SD
›
论文详情
Our official English website, www.x-mol.net, welcomes your
feedback! (Note: you will need to create a separate account there.)
RWCP-SSD-Onomatopoeia: Onomatopoeic Word Dataset for Environmental Sound Synthesis
arXiv - CS - Sound Pub Date : 2020-07-09 , DOI: arxiv-2007.04719 Yuki Okamoto, Keisuke Imoto, Shinnosuke Takamichi, Ryosuke Yamanishi, Takahiro Fukumori, Yoichi Yamashita
arXiv - CS - Sound Pub Date : 2020-07-09 , DOI: arxiv-2007.04719 Yuki Okamoto, Keisuke Imoto, Shinnosuke Takamichi, Ryosuke Yamanishi, Takahiro Fukumori, Yoichi Yamashita
Environmental sound synthesis is a technique for generating a natural
environmental sound. Conventional work on environmental sound synthesis using
sound event labels cannot finely control synthesized sounds, for example, the
pitch and timbre. We consider that onomatopoeic words can be used for
environmental sound synthesis. Onomatopoeic words are effective for explaining
the feature of sounds. We believe that using onomatopoeic words will enable us
to control the fine time-frequency structure of synthesized sounds. However,
there is no dataset available for environmental sound synthesis using
onomatopoeic words. In this paper, we thus present RWCP-SSD-Onomatopoeia, a
dataset consisting of 155,568 onomatopoeic words paired with audio samples for
environmental sound synthesis. We also collected self-reported confidence
scores and others-reported acceptance scores of onomatopoeic words, to help us
investigate the difficulty in the transcription and selection of a suitable
word for environmental sound synthesis.
中文翻译:
RWCP-SSD-Onomatopoeia:用于环境声音合成的拟声词数据集
环境声音合成是一种产生自然环境声音的技术。使用声音事件标签进行环境声音合成的传统工作无法精细地控制合成声音,例如音高和音色。我们认为拟声词可用于环境音合成。拟声词对于解释声音的特征是有效的。我们相信使用拟声词将使我们能够控制合成声音的精细时频结构。但是,没有可用于使用拟声词合成环境声音的数据集。因此,在本文中,我们提出了 RWCP-SSD-Onomatopoeia,这是一个由 155,568 个拟声词与音频样本组成的数据集,用于环境声音合成。
更新日期:2020-07-10
中文翻译:
RWCP-SSD-Onomatopoeia:用于环境声音合成的拟声词数据集
环境声音合成是一种产生自然环境声音的技术。使用声音事件标签进行环境声音合成的传统工作无法精细地控制合成声音,例如音高和音色。我们认为拟声词可用于环境音合成。拟声词对于解释声音的特征是有效的。我们相信使用拟声词将使我们能够控制合成声音的精细时频结构。但是,没有可用于使用拟声词合成环境声音的数据集。因此,在本文中,我们提出了 RWCP-SSD-Onomatopoeia,这是一个由 155,568 个拟声词与音频样本组成的数据集,用于环境声音合成。