Deep learning-based automatic downbeat tracking: a brief review,Multimedia Systems

当前位置： X-MOL 学术 › Multimedia Syst. › 论文详情

Our official English website, www.x-mol.net, welcomes your feedback! (Note: you will need to create a separate account there.)

Deep learning-based automatic downbeat tracking: a brief review
Multimedia Systems ( IF 3.5 ) Pub Date : 2019-03-12 , DOI: 10.1007/s00530-019-00607-x
Bijue Jia , Jiancheng Lv , Dayiheng Liu

As an important format of multimedia, music has filled almost everyone’s life. Automatic analyzing of music is a significant step to satisfy people’s need for music retrieval and music recommendation in an effortless way. Thereinto, downbeat tracking has been a fundamental and continuous problem in Music Information Retrieval (MIR) area. Despite significant research efforts, downbeat tracking still remains a challenge. Previous researches either focus on feature engineering (extracting certain features by signal processing, which are semi-automatic solutions); or have some limitations: they can only model music audio recordings within limited time signatures and tempo ranges. Recently, deep learning has surpassed traditional machine learning methods and has become the primary algorithm in feature learning; the combination of traditional and deep learning methods also has made better performance. In this paper, we begin with a background introduction of downbeat tracking problem. Then, we give detailed discussions of the following topics: system architecture, feature extraction, deep neural network algorithms, data sets, and evaluation strategy. In addition, we take a look at the results from the annual benchmark evaluation—Music Information Retrieval Evaluation eXchange—as well as the developments in software implementations. Although much has been achieved in the area of automatic downbeat tracking, some problems still remain. We point out these problems and conclude with possible directions and challenges for future research.

中文翻译：

基于深度学习的自动悲观跟踪：简要回顾

音乐作为一种重要的多媒体形式，几乎充斥了每个人的生活。音乐的自动分析是轻松满足人们对音乐检索和音乐推荐需求的重要一步。其中，强拍跟踪一直是音乐信息检索（MIR）领域的一个基本且持续存在的问题。尽管进行了大量研究，但悲观追踪仍然是一个挑战。以往的研究要么侧重于特征工程（通过信号处理提取某些特征，属于半自动解决方案）；或者有一些限制：他们只能在有限的拍号和速度范围内对音乐录音进行建模。最近，深度学习已经超越了传统的机器学习方法，成为特征学习的主要算法；传统和深度学习方法的结合也取得了更好的表现。在本文中，我们首先介绍强弱跟踪问题的背景。然后，我们详细讨论了以下主题：系统架构、特征提取、深度神经网络算法、数据集和评估策略。此外，我们还查看了年度基准评估的结果——音乐信息检索评估交换——以及软件实现的发展。尽管在自动强拍跟踪领域已经取得了很大进展，但仍然存在一些问题。我们指出了这些问题，并总结了未来研究可能的方向和挑战。我们首先介绍悲观跟踪问题的背景。然后，我们详细讨论了以下主题：系统架构、特征提取、深度神经网络算法、数据集和评估策略。此外，我们还查看了年度基准评估的结果——音乐信息检索评估交换——以及软件实现的发展。尽管在自动强拍跟踪领域已经取得了很大进展，但仍然存在一些问题。我们指出了这些问题，并总结了未来研究可能的方向和挑战。我们首先介绍悲观跟踪问题的背景。然后，我们详细讨论了以下主题：系统架构、特征提取、深度神经网络算法、数据集和评估策略。此外，我们还查看了年度基准评估的结果——音乐信息检索评估交换——以及软件实现的发展。尽管在自动强拍跟踪领域已经取得了很大进展，但仍然存在一些问题。我们指出了这些问题，并总结了未来研究可能的方向和挑战。我们来看看年度基准评估的结果——音乐信息检索评估交换——以及软件实现的发展。尽管在自动强拍跟踪领域已经取得了很大进展，但仍然存在一些问题。我们指出了这些问题，并总结了未来研究可能的方向和挑战。我们来看看年度基准评估的结果——音乐信息检索评估交换——以及软件实现的发展。尽管在自动强拍跟踪领域已经取得了很大进展，但仍然存在一些问题。我们指出了这些问题，并总结了未来研究可能的方向和挑战。

更新日期：2019-03-12

点击分享查看原文

点击收藏

公开下载

阅读更多本刊最新论文本刊介绍/投稿指南11