论文标题

探测场景分类的深度神经决策森林

Deep Neural Decision Forest for Acoustic Scene Classification

论文作者

Sun, Jianyuan, Liu, Xubo, Mei, Xinhao, Zhao, Jinzheng, Plumbley, Mark D., Kılıç, Volkan, Wang, Wenwu

论文摘要

声学场景分类(ASC)旨在根据记录环境的特征对音频剪辑进行分类。在这方面,基于深度学习的方法已成为ASC问题的有用工具。提高分类精度的常规方法包括整合辅助方法,例如注意机制,预训练模型和集合多个子网络。但是,由于从不同环境中捕获的音频剪辑的复杂性,因此很难在不使用任何仅使用单个分类器的现有深度学习模型的情况下使用任何辅助方法来区分其类别。在本文中,我们提出了一种使用深神经决策森林(DNDF)的新颖方法。 DNDF结合了固定数量的卷积层和决策林作为最终分类器。决策森林由固定数量的决策树分类器组成,这些分类器已被证明比某些数据集中的单个分类器提供了更好的分类性能。尤其是,决策林与传统的随机森林有很大不同,因为它是随机,可区分的,并且能够使用后传播来更新和学习神经网络中的特征表示。 DCASE2019和ESC-50数据集的实验结果表明,我们提出的DNDF方法在分类准确性方面提高了ASC性能,并且与最先进的基线相比,表现出竞争性能。

Acoustic scene classification (ASC) aims to classify an audio clip based on the characteristic of the recording environment. In this regard, deep learning based approaches have emerged as a useful tool for ASC problems. Conventional approaches to improving the classification accuracy include integrating auxiliary methods such as attention mechanism, pre-trained models and ensemble multiple sub-networks. However, due to the complexity of audio clips captured from different environments, it is difficult to distinguish their categories without using any auxiliary methods for existing deep learning models using only a single classifier. In this paper, we propose a novel approach for ASC using deep neural decision forest (DNDF). DNDF combines a fixed number of convolutional layers and a decision forest as the final classifier. The decision forest consists of a fixed number of decision tree classifiers, which have been shown to offer better classification performance than a single classifier in some datasets. In particular, the decision forest differs substantially from traditional random forests as it is stochastic, differentiable, and capable of using the back-propagation to update and learn feature representations in neural network. Experimental results on the DCASE2019 and ESC-50 datasets demonstrate that our proposed DNDF method improves the ASC performance in terms of classification accuracy and shows competitive performance as compared with state-of-the-art baselines.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源