首页> 外文会议>European Signal Processing Conference >A COMPARISON OF AUDITORY FEATURES FOR ROBUST SPEECH RECOGNITION

【24h】

A COMPARISON OF AUDITORY FEATURES FOR ROBUST SPEECH RECOGNITION

机译：鲁棒语音识别的听觉特征比较

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

This paper presents a detailed comparison of the performance of two auditory based feature extraction algorithms for automatic speech recognition (ASR). The feature sets are Zero- Crossings with Peak Amplitudes (ZCPA) and the recently introduced Power-Law Nonlinearity and Power-Bias Subtraction (PNCC). Standard Mel-Frequency Cepstral Coefficients (MFCC) are also tested for comparison. Although front-ends have been compared in previous papers, this work focuses on two of the most promising algorithms for noise robustness. The performance of all features is reported on the TIMIT database using a HMM system. It is found that the PNCC features outperform MFCC in clean conditions and are robust to noise. ZCPA performance is shown to vary widely with filterbank configuration and frame length. The ZCPA performance is poor in clean conditions but is the least affected by white noise. PNCC is shown to be the most promising new feature set for robust ASR in recent years.

机译：本文介绍了两个基于听觉的特征提取算法的性能的详细比较，用于自动语音识别（ASR）。特征集是具有峰值幅度（ZCPA）的零点（ZCPA），最近引入的电力 - 法律非线性和功率偏压减法（PNCC）。还测试了标准熔融频率谱系齐数（MFCC）以进行比较。虽然前端已经在先前的论文中进行了比较，但这项工作侧重于噪声稳健性最有前景的两个算法。使用HMM系统在Timit数据库上报告所有功能的性能。发现PNCC在清洁条件下优于MFCC，并且对噪声稳健。 ZCPA性能显示出滤波器配置和帧长度的广泛变化。 ZCPA性能在清洁条件下差，但受白噪声的影响最小。 PNCC显示为近年来为强大的ASR提供最有希望的新功能。

著录项

来源
《European Signal Processing Conference 》|2010年||共5页
会议地点
作者
Finnian Kelly; Naomi Harte;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TN911.7-53;
关键词

相似文献

外文文献
中文文献
专利

1. Combining speech enhancement and auditory feature extraction for robust speech recognition [J] . Michael Kleinschmidt, Jurgen Tchorz, Birger Kollmeier Speech Communication . 2001 ,第1a2期

机译：结合语音增强和听觉特征提取以实现强大的语音识别
2. On the Effect of the Implementation of Human Auditory Systems on Q-Log-Based Features for Robustness of Speech Recognition Against Noise [J] . Pardede Hilman F., Yuliani Asri R., Subekti Agus Journal of Information Recording . 2019 ,第1期

机译：实施人类听觉系统对基于Q-Log的语音识别抗噪声鲁棒性功能的影响
3. On the relevance of auditory-based Gabor features for deep learning in robust speech recognition [J] . Angel Mario Castro Martinez, Sri Harish Mallidi, Bernd T. Meyer Computer speech and language . 2017 ,第Sepa期

机译：基于听觉的Gabor特征与鲁棒语音识别中深度学习的相关性
4. A COMPARISON OF AUDITORY FEATURES FOR ROBUST SPEECH RECOGNITION [C] . Finnian Kelly, Naomi Harte European Signal Processing Conference . 2010

机译：鲁棒语音识别的听觉特征比较
5. Robust Recognition of Binaural Speech Signals Using Techniques Based on Human Auditory Processing [D] . Menon, Anjali I. 2019

机译：基于人类听觉处理技术的双耳语音信号的稳健识别
6. New Features Using Robust MVDR Spectrum of Filtered Autocorrelation Sequence for Robust Speech Recognition [O] . Sanaz Seyedin, Seyed Mohammad Ahadi, Saeed Gazor 2013

机译：使用滤波自相关序列的鲁棒MVDR频谱进行鲁棒语音识别的新功能
7. A Comparison of Auditory Features for Robust Speech Recognition [O] . Kelly Finnian 2010

机译：健壮语音识别的听觉功能比较

A COMPARISON OF AUDITORY FEATURES FOR ROBUST SPEECH RECOGNITION

摘要

著录项

相似文献

相关主题

期刊订阅