A new variance-based approach for discriminative feature extraction in machine hearing classification using spectrogram features

Xie Zhipeng; McLoughlina Ian; Zhang Haomin; Song Yan; Xiao Wei

首页> 外文期刊>Digital Signal Processing >A new variance-based approach for discriminative feature extraction in machine hearing classification using spectrogram features

【24h】

A new variance-based approach for discriminative feature extraction in machine hearing classification using spectrogram features

机译：一种新的基于方差的声谱图特征在机器听力分类中的歧视性特征提取方法

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Machine hearing is an emerging research field that is analogous to machine vision in that it aims to equip computers with the ability to hear and recognise a variety of sounds. It is a key enabler of natural human-computer speech interfacing, as well as in areas such as automated security surveillance, environmental monitoring, smart homes/buildings/cities. Recent advances in machine learning allow current systems to accurately recognise a diverse range of sounds under controlled conditions. However doing so in real-world noisy conditions remains a challenging task. Several front-end feature extraction methods have been used for machine hearing, employing speech recognition features like MFCC and PLP, as well as image-like features such as AIM and SIF. The best choice of feature is found to be dependent upon the noise environment and machine learning techniques used. Machine learning methods such as deep neural networks have been shown capable of inferring discriminative classification rules from less structured front-end features in related domains. In the machine hearing field, spectrogram image features have recently shown good performance for noise-corrupted classification using deep neural networks. However there are many methods of extracting features from spectrograms. This paper explores a novel data-driven feature extraction method that uses variance-based criteria to define spectral pooling of features from spectrograms. The proposed method, based on maximising the pooled spectral variance of foreground and background sound models, is shown to achieve very good performance for robust classification. (C) 2016 Elsevier Inc. All rights reserved.

机译：机器听力是一个与机器视觉类似的新兴研究领域，它旨在使计算机具备听到和识别各种声音的能力。它是自然人机语音接口以及自动化安防监控，环境监控，智能家居/建筑物/城市等领域的关键推动力。机器学习的最新进展允许当前的系统在受控条件下准确识别各种声音。然而，在现实的嘈杂条件下这样做仍然是一项艰巨的任务。几种前端特征提取方法已用于机器听力，它采用了语音识别功能（例如MFCC和PLP）以及类似图像的功能（例如AIM和SIF）。发现功能的最佳选择取决于噪声环境和所使用的机器学习技术。已经显示出诸如深度神经网络之类的机器学习方法，能够从相关领域中结构化程度较低的前端特征中推断出判别性分类规则。在机器听力领域，频谱图图像特征最近显示了使用深度神经网络进行噪声损坏分类的良好性能。但是，有许多方法可以从频谱图中提取特征。本文探索了一种新颖的数据驱动特征提取方法，该方法使用基于方差的标准来定义光谱图中特征的光谱池。所提出的方法在最大化前景和背景声音模型的合并频谱方差的基础上，显示出对于鲁棒分类具有非常好的性能。（C）2016 Elsevier Inc.保留所有权利。

著录项

来源
《Digital Signal Processing》 |2016年第null期|共10页
作者
Xie Zhipeng; McLoughlina Ian; Zhang Haomin; Song Yan; Xiao Wei;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类数字信号处理;
关键词
Machine hearing; Auditory event detection; Robust auditory classification; Sound classification; Discriminative sound features;

机译：机器听力;听觉事件检测;健壮的听觉分类;声音分类;区别性声音特征;

相似文献

外文文献
中文文献
专利

1. A new variance-based approach for discriminative feature extraction in machine hearing classification using spectrogram features [J] . Xie Zhipeng, McLoughlina Ian, Zhang Haomin, Digital Signal Processing . 2016,第Null期

机译：一种新的基于方差的声谱图特征在机器听力分类中的歧视性特征提取方法
2. Micro-Doppler classification of human movements using spectrogram spatial features and support vector machine [J] . Vineet Singh, Somak Bhattacharyya, Pradip K. Jain International journal of RF and microwave computer-aided engineering . 2020,第8期

机译：使用频谱图空间特征和支持向量机的人类运动分类的微量多普勒分类
3. An effective approach to feature extraction for classification of plant diseases using machine learning [J] . S Jeyalakshmi, R Radha Indian Journal of Science and Technology . 2020,第32期

机译：采用机器学习植物疾病分类特征提取的有效方法
4. Time-Frequency Feature Extraction from Spectrograms and Wavelet Packets with Application to Automatic Stress and Emotion Classification in Speech [C] . Ling He, Margaret Lech, Namunu C. Maddage, International Conference on Information, Communications and Signal Processing . 2009

机译：用谱图和小波包的时频特征提取，应用于语音中自动应力和情感分类
5. Discriminative Feature Extraction of Time-Series Data to Improve Temporal Pattern Detection using Classification Algorithms [D] . Stolze, David 2018

机译：使用分类算法区分时间序列数据以提高时间模式检测的特征
6. Facial geometric feature extraction based emotional expression classification using machine learning algorithms [O] . Murugappan M., Mutawa A. 2021

机译：基于面部几何特征提取基于机器学习算法的情绪表达式分类
7. A new variance-based approach for discriminative feature extraction in machine hearing classification using spectrogram features [O] . Xie, Zhi-Peng, McLoughlin, Ian Vince, Zhang, Hao-min, 2016

机译：一种新的基于方差的方法，用于使用频谱图特征进行机器听力分类中的歧视性特征提取

A new variance-based approach for discriminative feature extraction in machine hearing classification using spectrogram features

摘要

著录项

相似文献

相关主题

期刊订阅