SVD-based redundancy removal in 1-D CNNs for acoustic scene classification

Singh Arshdeep; Rajan Padmanabhan; Bhavsar Arnav

首页> 外文期刊>Pattern recognition letters >SVD-based redundancy removal in 1-D CNNs for acoustic scene classification

【24h】

SVD-based redundancy removal in 1-D CNNs for acoustic scene classification

机译：基于SVD的冗余删除在1-D CNN中进行声学场景分类

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

In this letter, we propose a concise feature representation framework for acoustic scene classification by pruning embeddings obtained from SoundNet, a deep convolutional neural network. We demonstrate that the feature maps generated at various layers of SoundNet have redundancy. The proposed singular value decomposition based method reduces the redundancy while relying on the assumption that useful feature maps produced by different classes lie along different directions. This leads to ignoring the feature maps that produce similar embeddings for different classes. In the context of using an ensemble of classifiers on the various layers of SoundNet, pruning the redundant feature maps leads to reduction in dimensionality and computational complexity. Our experiments on acoustic scene classification demonstrate that ignoring 73% of feature maps reduces the performance by less than 1% with 12.67% reduction in computational complexity. In addition to this, we also show that the proposed pruning framework can be utilized to remove filters in the SoundNet network architecture, with 13x lesser model storage requirement. Also, the number of parameters reduce from 28 million to 2 million with marginal degradation in performance. This small model obtained after applying the proposed pruning procedure is evaluated on different acoustic scene classification datasets, and shows excellent generalization ability. (c) 2020 Elsevier B.V. All rights reserved.

机译：在这封信中，我们向声学场景分类提出了一种简洁的特征表示框架，通过从SoundNet获得的嵌入，深度卷积神经网络。我们展示了在各种SoundNet中生成的特征映射具有冗余。所提出的奇异值分解的方法减少了冗余，同时依赖于假设不同类别的有用特征映射沿不同的方向。这导致忽略为不同类别产生类似嵌入的特征映射。在使用各种颜料层上的分类器的集分类的上下文中，修剪冗余特征映射导致减少维度和计算复杂度。我们对声学场景分类的实验表明，忽略了73％的特征贴图将性能降低了小于1％，计算复杂度降低了12.67％。除此之外，我们还表明，建议的修剪框架可用于在SoundNet网络架构中移除过滤器，具有13x较小的模型存储要求。此外，参数的数量从2800万到200万增加到200万到200万，性能边际降级。在应用所提出的修剪程序后获得的这种小型模型在不同的声学场景分类数据集上进行评估，并显示出优异的泛化能力。（c）2020 Elsevier B.v.保留所有权利。

著录项

来源
《Pattern recognition letters》 |2020年第3期|383-389|共7页
作者
Singh Arshdeep; Rajan Padmanabhan; Bhavsar Arnav;
展开▼
作者单位

Indian Inst Technol IIT Mandi Mandi 175005 Himachal Prades India;

Indian Inst Technol IIT Mandi Mandi 175005 Himachal Prades India;

Indian Inst Technol IIT Mandi Mandi 175005 Himachal Prades India;

展开▼
收录信息美国《科学引文索引》(SCI);美国《工程索引》(EI);
原文格式 PDF
正文语种 eng
中图分类
关键词
Pruning; SoundNet; Embedding; Response matrix; Acoustic scene classification;

机译：修剪;Soundnet;嵌入;响应矩阵;声学场景分类;

相似文献

外文文献
中文文献
专利

1. Acoustic scene classification using deep CNN with fine-resolution feature [J] . Zhang Tao, Liang Jinhua, Ding Biyun Expert Systems with Application . 2020,第Apra期

机译：使用具有高分辨率的深CNN进行声场分类
2. PSO optimized 1-D CNN-SVM architecture for real-time detection and classification applications [J] . Navaneeth Bhaskar, Suchetha M. Computers in Biology and Medicine . 2019,第期

机译：PSO优化了1-D CNN-SVM架构，用于实时检测和分类应用
3. Adaptive weights learning in CNN feature fusion for crime scene investigation image classification [J] . Ying Liu, Qian Nan Zhang, Fu Ping Wang, Connection Science . 2021,第3期

机译：CNN的自适应权重学习犯罪现场调查图像分类的特征融合
4. SVD-Based Channel Pruning for Convolutional Neural Network in Acoustic Scene Classification Model [C] . Jun Wang, Shengchen Li, Wenwu Wang IEEE International Conference on Multimedia Expo Workshops . 2019

机译：场景分类模型中基于SVD的卷积神经网络通道修剪
5. Augmented Dual Input CNN (DI-CNN) for the Diagnostic Classification of Lung Nodule Malignancy from CT Scans [D] . Jain, Arshita. 2020

机译：增强双输入CNN（DI-CNN），用于CT扫描的肺结结恶性肿瘤诊断分类
6. Sensor Fault Detection and Diagnosis Method for AHU Using 1-D CNN and Clustering Analysis [O] . Jingjing Liu, Min Zhang, Hai Wang, 2019

机译：一维CNN和聚类分析的AHU传感器故障检测与诊断方法
7. CAA-Net: Conditional Atrous CNNs with Attention for Explainable Device-robust Acoustic Scene Classification [O] . Zhao Ren, Qiuqiang Kong, Jing Han, 2020

机译：CAA-NET：有条件的CNN，注意可解释的可解释装置 - 强大的声学场景分类

SVD-based redundancy removal in 1-D CNNs for acoustic scene classification

摘要

著录项

相似文献

相关主题

期刊订阅