当前位置: X-MOL 学术EURASIP J. Audio Speech Music Proc. › 论文详情
Our official English website, www.x-mol.net, welcomes your feedback! (Note: you will need to create a separate account there.)
Robust singer identification of Indian playback singers
EURASIP Journal on Audio, Speech, and Music Processing ( IF 2.4 ) Pub Date : 2019-06-17 , DOI: 10.1186/s13636-019-0153-0
Deepali Y. Loni , Shaila Subbaraman

Singing voice analysis has been a topic of research to assist several applications in the domain of music information retrieval system. One such major area is singer identification (SID). There has been enormous increase in production of movies and songs in Bollywood industry over the last 50 decades. Surveying this extensive dataset of singers, the paper presents singer identification system for Indian playback singers. Four acoustic features namely—formants, harmonic spectral envelope, vibrato, and timbre—that uniquely describe the singer are extracted from the singing voice segments. Using the combination of these multiple acoustic features, we address the major challenges in SID like the variations in singer’s voice, testing of multilingual songs, and the album effect. Systematic evaluation shows the SID is robust against the variations in singer’s singing style and structure of songs and is effective in identifying the cover songs and singers. The results are investigated on in-house cappella database consisting of 26 singers and 550 songs. By performing dimension reduction of the feature vector and using Support Vector Machine classifier, we achieved an accuracy of 86% using fourfold cross validation process. In addition, performance comparison of the proposed work with other existing approaches reveals the superiority in terms of volume of dataset and song duration.

中文翻译:

印度播放歌手的强大歌手识别

歌声分析一直是一个研究课题,以协助音乐信息检索系统领域的多个应用。其中一个主要领域是歌手识别 (SID)。在过去的 50 年里,宝莱坞行业的电影和歌曲制作有了巨大的增长。调查这个广泛的歌手数据集,本文提出了印度播放歌手的歌手识别系统。四个声学特征,即共振峰、谐波频谱包络、颤音和音色——从歌声片段中提取出唯一地描述歌手的声学特征。使用这些多种声学特征的组合,我们解决了 SID 中的主要挑战,如歌手声音的变化、多语言歌曲的测试和专辑效果。系统评估表明,SID对歌手演唱风格和歌曲结构的变化具有鲁棒性,对识别翻唱歌曲和歌手有效。结果在由 26 位歌手和 550 首歌曲组成的内部无伴奏合唱数据库中进行了调查。通过对特征向量进行降维并使用支持向量机分类器,我们使用四重交叉验证过程实现了 86% 的准确率。此外,所提出的工作与其他现有方法的性能比较揭示了数据集数量和歌曲持续时间方面的优势。通过对特征向量进行降维并使用支持向量机分类器,我们使用四重交叉验证过程实现了 86% 的准确率。此外,所提出的工作与其他现有方法的性能比较揭示了数据集数量和歌曲持续时间方面的优势。通过对特征向量进行降维并使用支持向量机分类器,我们使用四重交叉验证过程实现了 86% 的准确率。此外,所提出的工作与其他现有方法的性能比较揭示了数据集数量和歌曲持续时间方面的优势。
更新日期:2019-06-17
down
wechat
bug