当前位置: X-MOL 学术arXiv.cs.SD › 论文详情
Our official English website, www.x-mol.net, welcomes your feedback! (Note: you will need to create a separate account there.)
USM-SED - A Dataset for Polyphonic Sound Event Detection in Urban Sound Monitoring Scenarios
arXiv - CS - Sound Pub Date : 2021-05-06 , DOI: arxiv-2105.02592
Jakob Abeßer

This paper introduces a novel dataset for polyphonic sound event detection in urban sound monitoring use-cases. Based on isolated sounds taken from the FSD50k dataset, 20,000 polyphonic soundscapes are synthesized with sounds being randomly positioned in the stereo panorama using different loudness levels. The paper gives a detailed discussion of possible application scenarios, explains the dataset generation process in detail, and discusses current limitations of the proposed USM-SED dataset.

中文翻译:

USM-SED-用于城市声音监视场景中的复音声音事件检测的数据集

本文介绍了用于城市声音监控用例中的复音声音事件检测的新数据集。根据从FSD50k数据集中获取的隔离声音,合成了20,000个多音声景,并使用不同的响度级别将声音随机放置在立体声全景中。本文详细讨论了可能的应用场景,详细说明了数据集生成过程,并讨论了所提出的USM-SED数据集的当前局限性。
更新日期:2021-05-07
down
wechat
bug