Deep Feature Embedding and Hierarchical Classification for Audio Scene Classification

机译：音频场景分类的深度特征嵌入和层次分类

获取原文

获取外文期刊封面目录资料

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

In this work, we propose an approach that features deep feature embedding learning and hierarchical classification with triplet loss function for Acoustic Scene Classification (ASC). In the one hand, a deep convolutional neural network is firstly trained to learn a feature embedding from scene audio signals. Via the trained convolutional neural network, the learned embedding embeds an input into the embedding feature space and transforms it into a high-level feature vector for representation. In the other hand, in order to exploit the structure of the scene categories, the original scene classification problem is structured into a hierarchy where similar categories are grouped into meta-categories. Then, hierarchical classification is accomplished using deep neural network classifiers associated with triplet loss function. Our experiments show that the proposed system achieves good performance on both the DCASE 2018 Task 1A and 1B datasets, resulting in accuracy gains of 15.6% and 16.6% absolute over the DCASE 2018 baseline on Task 1A and 1B, respectively.

机译：在这项工作中，我们提出了一种以深度场景嵌入学习和具有三重损失函数的层次分类为特征的声学场景分类（ASC）方法。一方面，首先训练深度卷积神经网络，以从场景音频信号中学习嵌入的特征。通过训练的卷积神经网络，学习的嵌入将输入嵌入到嵌入特征空间中，并将其转换为高级特征向量以进行表示。另一方面，为了利用场景类别的结构，将原始场景分类问题构造为层次结构，其中将相似的类别分组为元类别。然后，使用与三元组损失函数关联的深度神经网络分类器完成分层分类。我们的实验表明，所提出的系统在DCASE 2018任务1A和1B数据集上均实现了良好的性能，相对于任务1A和1B的DCASE 2018基线，绝对精度分别提高了15.6％和16.6％。

著录项

来源
《International Joint Conference on Neural Networks》|2020年|1-7|共7页
会议地点
作者
Lam Pham; Ian McLoughlin; Huy Phan; R. Palaniappan; Alfred Mertins;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Feature extraction; Spectrogram; Task analysis; Time-frequency analysis; Acoustics; Transforms; Data structures;

机译：特征提取频谱图任务分析时频分析声学变换数据结构;

相似文献

外文文献
中文文献
专利

1. Classification of audio scenes with novel features in a fused system framework [J] . Waldekar Shefali, Saha Goutam Digital Signal Processing . 2018,第期

机译：融合系统框架中具有新功能的音频场景分类
2. Hierarchical feature coding model for high-resolution satellite scene classification [J] . Zhao Fengan, Mu Xiaodong, Yang Zhou, Journal of Applied Remote Sensing . 2019,第1期

机译：高分辨率卫星场景分类的分层特征编码模型
3. Aggregating Rich Hierarchical Features for Scene Classification in Remote Sensing Imagery [J] . Guoli Wang, Bin Fan, Shiming Xiang, Selected Topics in Applied Earth Observations and Remote Sensing, IEEE Journal of . 2017,第9期

机译：聚合丰富的层次结构特征以进行遥感影像中的场景分类
4. Acoustic Scene Classification Using Deep Audio Feature and BLSTM Network [C] . Yanxiong Li, Xianku Li, Yuhan Zhang, International Conference on Audio, Language and Image Processing . 2018

机译：使用深度音频功能和BLSTM网络进行声场分类
5. Deep Features based Hierarchical Classification Scheme for Face Recognition in Heterogeneous Environments [D] . Narang, Neeru. 2017

机译：基于深度特征的异构环境中面部识别的分层分类方案
6. Deep Learning for Feature Extraction in Remote Sensing: A Case-Study of Aerial Scene Classification [O] . Biserka Petrovska, Eftim Zdravevski, Petre Lameski, 2020

机译：遥感中特征提取的深度学习 - 以空域分类为例
7. Learning Hierarchy Aware Embedding from Raw Audio for Acoustic Scene Classification [O] . Vinayak Abrol, Pulkit Sharma 2020

机译：学习层次结构意识到从原始音频嵌入声学场景分类

Deep Feature Embedding and Hierarchical Classification for Audio Scene Classification

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅