CNN-LTE: A class of 1-X pooling convolutional neural networks on label tree embeddings for audio scene classification

机译：CNN-LTE：在标签树嵌入中用于音频场景分类的一类1-X池卷积神经网络

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

We present in this work an approach for audio scene classification. Firstly, given the label set of the scenes, a label tree is automatically constructed where the labels are grouped into meta-classes. This category taxonomy is then used in the feature extraction step in which an audio scene instance is transformed into a label tree embedding image. Elements of the image indicate the likelihoods that the scene instances belong to different meta-classes. A class of simple 1-X (i.e. 1-max, 1-mean, and 1-mix) pooling convolutional neural networks, which are tailored for the task at hand, are finally learned on top of the image features for scene recognition. Experimental results on the DCASE 2013 and DCASE 2016 datasets demonstrate the efficiency of the proposed method.

机译：我们在这项工作中提出了一种音频场景分类的方法。首先，给定场景的标签集，将自动构建标签树，其中将标签分组为元类。然后，在类别提取步骤中使用此类别分类法，在该步骤中，音频场景实例将转换为标签树嵌入图像。图像的元素指示场景实例属于不同的元类的可能性。最终在图像特征之上学习场景识别的一类简单的1-X（即1-max，1-mean和1-mix）池卷积神经网络。在DCASE 2013和DCASE 2016数据集上的实验结果证明了该方法的有效性。

著录项

来源
《IEEE International Conference on Acoustics, Speech and Signal Processing》|2017年|136-140|共5页
会议地点
作者
Huy Phan; Philipp Koch; Lars Hertel; Marco Maass; Radoslaw Mazur; Alfred Mertins;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Long Term Evolution; Convolution; Noise measurement; Cepstral analysis; Feature extraction; Neural networks; Vegetation;

机译：长期演化卷积噪声测量倒频谱分析特征提取神经网络植被;

相似文献

外文文献
中文文献
专利

1. A novel acoustic scene classification model using the late fusion of convolutional neural networks and different ensemble classifiers [J] . Alamir Mahmoud A. Applied Acoustics . 2021,第Apra期

机译：一种新的声学场景分类模型，使用卷积神经网络的后期融合和不同的集合分类
2. Aerial scene classification via an ensemble extreme learning machine classifier based on discriminative hybrid convolutional neural networks features [J] . Ye Lihua, Wang Lei, Sun Yaxin, International journal of remote sensing . 2019,第7a8期

机译：基于判别式混合卷积神经网络的集成极端学习机分类器进行空中场景分类
3. Aerial scene classification via an ensemble extreme learning machine classifier based on discriminative hybrid convolutional neural networks features [J] . Ye Lihua, Wang Lei, Sun Yaxin, International journal of remote sensing . 2019,第7a8期

机译：基于鉴别的混合卷积神经网络特征的集合极端学习机分类器的空中场景分类
4. CNN-LTE: A class of 1-X pooling convolutional neural networks on label tree embeddings for audio scene classification [C] . Huy Phan, Philipp Koch, Lars Hertel, IEEE International Conference on Acoustics, Speech and Signal Processing . 2017

机译：CNN-LTE：在标签树嵌入上的一类1-X池卷积神经网络，用于音频场景分类
5. A Scalable and Low Power Deep Convolutional Neural Network for Multimodal Data Classification in Embedded Real-Time Systems [D] . Jafari, Ali. 2017

机译：用于嵌入式实时系统中的多模式数据分类的可扩展和低功耗的深卷积神经网络
6. Training Convolutional Neural Networks with Multi-Size Images and Triplet Loss for Remote Sensing Scene Classification [O] . Jianming Zhang, Chaoquan Lu, Jin Wang, 2020

机译：利用多尺寸图像和三重态损失训练卷积神经网络进行遥感场景分类
7. Knowledge Transfer from Weakly Labeled Audio using Convolutional Neural Network for Sound Events and Scenes [O] . Kumar, Anurag, Khadkevich, Maksim, Fugen, Christian 2017

机译：利用卷积神经网络从弱标记音频中获取知识声音事件和场景网络

CNN-LTE: A class of 1-X pooling convolutional neural networks on label tree embeddings for audio scene classification

摘要

著录项

相似文献

相关主题

期刊订阅