Robust acoustic event recognition using AVMD-PWVD time-frequency image

Zhang Yanhua; Zhang Ke; Wang Jingyu; Su Yu

首页> 外文期刊>Applied Acoustics >Robust acoustic event recognition using AVMD-PWVD time-frequency image

【24h】

Robust acoustic event recognition using AVMD-PWVD time-frequency image

机译：使用AVMD-PWVD时频图像强大的声学事件识别

获取原文

获取原文并翻译 | 示例

获取外文期刊封面封底 >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Environmental sound feature extraction and classification are important signal analysis tools in many applications, such as audio surveillance, multimedia retrieval, and auditory source identification. However, the non-stationarity and discontinuity of environmental signals make quantification and classification a formidable challenge. Hence, researchers proposed to use the time-frequency image representation to quantify these non-stationarity, resulting in higher classification accuracy. In this paper, a time-frequency representation method is proposed to represent environmental sound signals. Our approach consists of three stages: Firstly, we propose an adaptive variational modal decomposition (AVMD) based on central angular frequency difference to decompose environmental sounds into a series of modes. Secondly, we use the pseudo Wigner-Vile distribution (PWVD) to accurately obtain the instantaneous frequency characteristics of mode signals. Thirdly, time-frequency images of sound signals are obtained by combining the mode signals with PWVD. Finally, we put the time-frequency image into a convolutional neural network (CNN) for classification. The method is tested on the Real World Computing Partnership (RWCP) Sound Scene Database of 50 classes in mismatched conditions. Results show that our method is robust to noise and achieves the best average recognition accuracy compared with several state-of-art methods under clean and various noisy conditions. (C) 2021 Elsevier Ltd. All rights reserved.

机译：环境声音特征提取和分类是许多应用中的重要信号分析工具，例如音频监控，多媒体检索和听觉源识别。然而，环境信号的非公平性和不连续性使得量化和分类成为一个强大的挑战。因此，提出的研究人员使用时频图像表示来量化这些非平稳性，从而导致更高的分类精度。在本文中，提出了一种时频表示方法来表示环境声音信号。我们的方法由三个阶段组成：首先，我们提出了一种基于中央角频率差的自适应变分模式分解（AVMD），以将环境声音分解为一系列模式。其次，我们使用伪Wigner-vile分布（PWVD）精确地获得模式信号的瞬时频率特性。第三，通过将模式信号与PWVD组合来获得声音信号的时频图像。最后，我们将时频图像放入卷积神经网络（CNN）中进行分类。该方法在More World Computing Partnership（RWCP）声音场景数据库上进行了测试，其中50个类别中的不匹配条件。结果表明，我们的方法对噪声稳健，与清洁和各种嘈杂条件下的几种最先进的方法相比，实现了最佳的平均识别精度。（c）2021 elestvier有限公司保留所有权利。

著录项

来源
《Applied Acoustics》 |2021年第7期|107970.1-107970.10|共10页
作者
Zhang Yanhua; Zhang Ke; Wang Jingyu; Su Yu;
展开▼
作者单位

Northwestern Polytech Univ Natl Key Lab Aerosp Flight Dynam 127 Youyi Xilu Xian 710072 Shanxi Peoples R China;

Northwestern Polytech Univ Natl Key Lab Aerosp Flight Dynam 127 Youyi Xilu Xian 710072 Shanxi Peoples R China;

展开▼
收录信息美国《科学引文索引》(SCI);美国《工程索引》(EI);
原文格式 PDF
正文语种 eng
中图分类
关键词
Time-frequency image; Acoustic event recognition; Pseudo Wigner-Vile distribution; Variational modal decomposition; Pseudo-color; Convolutional neural network;

机译：时频图像;声学事件识别;伪Wigner-vile分布;变分模态分解;伪颜色;卷积神经网络;

相似文献

外文文献
中文文献
专利

1. Pseudo-color cochleagram image feature and sequential feature selection for robust acoustic event recognition [J] . Sharan Roneel V., Moir Tom J. Applied Acoustics . 2018,第NOVa期

机译：伪彩色耳蜗图像特征和顺序特征选择，可实现可靠的声音事件识别
2. Acoustic event filterbank for enabling robust event recognition by cleaning robot [J] . Park Sangwook, Choi Woohyun, Han David K., Consumer Electronics, IEEE Transactions on . 2015,第2期

机译：声事件过滤器库，可通过清洁机器人实现强大的事件识别
3. Ambient acoustic event assistive framework for identification, detection,and recognition of unknown acoustic events of a residence [J] . Sharnil Pandya, Hemant Ghayvat Advanced engineering informatics . 2021,第Jana期

机译：环境声学事件辅助框架，用于鉴定，检测和识别住所的未知声学事件
4. Time-Frequency Image Resizing Using Interpolation for Acoustic Event Recognition with Convolutional Neural Networks [C] . Roneel V. Sharan, Tom J. Moir 2019 IEEE International Conference on Signals and Systems . 2019

机译：卷积神经网络的插值时频图像尺寸调整用于声事件识别
5. Time-frequency acoustic processing and recognition: Analysis and analog VLSI implementations. [D] . Edwards, Robert Timothy. 1999

机译：时频声学处理和识别：分析和模拟VLSI实现。
6. Characterization and Robust Classification of EEG Signal from Image RSVP Events with Independent Time-Frequency Features [O] . Jia Meng, Lenis Mauricio Meriño, Nima Bigdely Shamlo, -1

机译：表征和图像RsVp活动与独立时频特性脑电信号的鲁棒分类
7. Characterization and robust classification of EEG signal from image RSVP events with independent time-frequency features. [O] . Jia Meng, Lenis Mauricio Meriño, Nima Bigdely Shamlo, 2012

机译：具有独立时频特征的图像RsVp事件的EEG信号的表征和鲁棒分类。

Robust acoustic event recognition using AVMD-PWVD time-frequency image

摘要

著录项

相似文献

相关主题

期刊订阅