Multileveled ternary pattern and iterative ReliefF based bird sound classification,Applied Acoustics

当前位置： X-MOL 学术 › Appl. Acoust. › 论文详情

Our official English website, www.x-mol.net, welcomes your feedback! (Note: you will need to create a separate account there.)

Multileveled ternary pattern and iterative ReliefF based bird sound classification
Applied Acoustics ( IF 3.4 ) Pub Date : 2021-01-02 , DOI: 10.1016/j.apacoust.2020.107866
Turker Tuncer , Erhan Akbal , Sengul Dogan

Birds may need to be identified for purposes such as environmental monitoring, follow-up, and species detection in the ecological area. Automatic sound classifiers have been used to perform species detection. Many methods have been presented in the literature to classify bird sounds with high accuracy. Nowadays, deep learning models have been used to classify data with high classification accuracy. However, these networks have high computational complexity. To obtain a highly accurate and lightweight classification model, a new multileveled and handcrafted features based machine learning model is presented. The presented automated bird sound classification model uses the multileveled ternary pattern (TP) feature generation, feature selection, and classification phases. The multileveled feature generation network can reach high classification accuracies since they generate high-level, low-level, and mid-level features. To construct levels, discrete wavelet transform (DWT) is employed to use the effectiveness of the DWT in bird sound classification. An improved version of the ReliefF, which is iterative ReliefF (IRF), is considered as feature selector. IRF selects the most informative features automatically, and these features are operated on linear discriminant (LD), k nearest neighbor (kNN), bagged tree (BT), and support vector machine (SVM) classifiers to calculate results of variable classifiers. The proposed multilevel TP and IRF based bird sound classification method reached 96.67% accuracy by using SVM on the 18 classes bird sound dataset.

中文翻译：

基于多级三元模式和迭代ReliefF的鸟声分类

出于环境监测，跟进和生态区域物种检测等目的，可能需要识别鸟类。自动声音分类器已用于执行物种检测。文献中已经提出了许多方法来对鸟的声音进行高精度分类。如今，深度学习模型已用于以高分类精度对数据进行分类。但是，这些网络具有很高的计算复杂度。为了获得高度准确和轻量级的分类模型，提出了一种新的基于多层和手工特征的机器学习模型。提出的自动鸟声分类模型使用了多级三元模式（TP）特征生成，特征选择和分类阶段。多层特征生成网络可以达到较高的分类精度，因为它们可以生成高级，低级和中级特征。为了构建级别，采用离散小波变换（DWT）来利用DWT在鸟类声音分类中的有效性。ReliefF的改进版本（迭代ReliefF（IRF））被视为功能选择器。IRF会自动选择信息量最大的功能，然后对线性判别（LD），k最近邻（kNN），袋装树（BT）和支持向量机（SVM）分类器进行操作，以计算变量分类器的结果。所提出的基于TP和IRF的多级鸟声音分类方法，通过在18类鸟声音数据集上使用支持向量机，达到了96.67％的准确性。和中级功能。为了构建级别，采用离散小波变换（DWT）来利用DWT在鸟类声音分类中的有效性。ReliefF的改进版本（迭代ReliefF（IRF））被视为功能选择器。IRF会自动选择信息量最大的功能，然后对线性判别（LD），k最近邻（kNN），袋装树（BT）和支持向量机（SVM）分类器进行操作，以计算变量分类器的结果。所提出的基于TP和IRF的多级鸟声音分类方法，通过在18类鸟声音数据集上使用支持向量机，达到了96.67％的准确性。和中级功能。为了构建级别，采用离散小波变换（DWT）来利用DWT在鸟类声音分类中的有效性。ReliefF的改进版本（迭代ReliefF（IRF））被视为功能选择器。IRF会自动选择信息量最大的功能，然后对线性判别（LD），k最近邻（kNN），袋装树（BT）和支持向量机（SVM）分类器进行操作，以计算变量分类器的结果。所提出的基于TP和IRF的多级鸟声音分类方法，通过在18类鸟声音数据集上使用支持向量机，达到了96.67％的准确性。迭代ReliefF（IRF）被视为功能选择器。IRF会自动选择信息量最大的功能，然后对线性判别（LD），k最近邻（kNN），袋装树（BT）和支持向量机（SVM）分类器进行操作，以计算变量分类器的结果。所提出的基于TP和IRF的多级鸟声音分类方法，通过在18类鸟声音数据集上使用支持向量机，达到了96.67％的准确性。迭代ReliefF（IRF）被视为功能选择器。IRF会自动选择信息量最大的功能，然后对线性判别（LD），k最近邻（kNN），袋装树（BT）和支持向量机（SVM）分类器进行操作，以计算变量分类器的结果。所提出的基于TP和IRF的多级鸟声音分类方法，通过在18类鸟声音数据集上使用支持向量机，达到了96.67％的准确性。

更新日期：2021-01-02

点击分享查看原文

点击收藏

阅读更多本刊最新论文本刊介绍/投稿指南11