当前位置: X-MOL 学术Ecol. Inform. › 论文详情
Our official English website, www.x-mol.net, welcomes your feedback! (Note: you will need to create a separate account there.)
Soundscape segregation based on visual analysis and discriminating features
Ecological Informatics ( IF 5.8 ) Pub Date : 2020-11-04 , DOI: 10.1016/j.ecoinf.2020.101184
Fábio Felix Dias , Helio Pedrini , Rosane Minghim

The distinction of landscapes based on their sound patterns is useful for several analyses. For instance, comparisons of audio files from different periods enable the detection of changes over time in a particular habitat, signaling events of importance, such as modifications in the balance between species and presence of new ones. The handling of a large number of different sound recordings in wild environments also reduces the set of sounds to be examined. However, the current efforts towards soundscape interpretation do not provide enough elements for researchers to automatically split soundscape datasets with degrees of similarity, thus requiring users' feedback for the grouping of highly related recordings. This work introduces a strategy for the exploration and analysis of soundscapes that highlights data characteristics related to differences and similarities among distinct soundscapes. It is based on a visual and numerical evaluation of feature spaces and was applied to three feature sets, namely acoustic indices and measurements, images from audio spectrograms depicted by classic features, and the same images depicted by features automatically generated by Deep Learning techniques. The results indicate that certain combinations of acoustic indices and measurements perform well for the discrimination task, although other feature sets have not been discarded. In addition, visual techniques were able to assist this type of analysis.



中文翻译:

基于视觉分析和区分特征的音景隔离

基于声音模式对景观的区分对于几种分析很有用。例如,通过比较不同时期的音频文件,可以检测特定栖息地随时间的变化,从而发出重要事件的信号,例如改变物种之间的平衡以及存在新物种。在野外环境中处理大量不同的录音也会减少要检查的声音集。但是,当前对音景解释的努力并未为研究人员提供足够的元素来自动分割具有相似度的音景数据集,因此需要用户反馈以对高度相关的录音进行分组。这项工作介绍了一种探索和分析音景的策略,该策略突出显示了与不同音景之间的差异和相似性相关的数据特征。它基于对特征空间的视觉和数字评估,并应用于三个特征集,即声学指数和测量值,经典特征所描绘的音频频谱图图像以及深度学习技术自动生成的特征所描绘的相同图像。结果表明,尽管没有丢弃其他特征集,但某些声学指数和测量值的组合对于区分任务表现良好。另外,视觉技术能够协助这种类型的分析。它基于对特征空间的视觉和数字评估,并应用于三个特征集,即声学指数和测量值,经典特征所描绘的音频频谱图图像以及深度学习技术自动生成的特征所描绘的相同图像。结果表明,尽管没有丢弃其他特征集,但某些声学指数和测量值的组合对于区分任务表现良好。另外,视觉技术能够协助这种类型的分析。它基于对特征空间的视觉和数字评估,并应用于三个特征集,即声学指数和测量值,经典特征所描绘的音频频谱图图像以及深度学习技术自动生成的特征所描绘的相同图像。结果表明,尽管没有丢弃其他特征集,但某些声学指数和测量值的组合对于区分任务表现良好。另外,视觉技术能够协助这种类型的分析。结果表明,尽管没有丢弃其他特征集,但某些声学指数和测量值的组合对于区分任务表现良好。另外,视觉技术能够协助这种类型的分析。结果表明,尽管没有丢弃其他特征集,但某些声学指数和测量值的组合对于区分任务表现良好。另外,视觉技术能够协助这种类型的分析。

更新日期:2020-11-12
down
wechat
bug