Designing of Gabor filters for spectro-temporal feature extraction to improve the performance of ASR system

Anirban Dutta; Gudmalwar Ashishkumar; Ch. V. Rama Rao

首页> 外文期刊>International journal of speech technology >Designing of Gabor filters for spectro-temporal feature extraction to improve the performance of ASR system

【24h】

Designing of Gabor filters for spectro-temporal feature extraction to improve the performance of ASR system

机译：Gabor滤波器的光谱时域特征提取设计以提高ASR系统的性能

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Existing automatic speech recognition (ASR) system uses the spectral or temporal features of speech. The performance of such systems is still poor compared to the human perception of hearing, especially in noisy environments. This paper concentrates on the extraction of spectro-temporal features based on physiological and psychoacoustically inspired approaches. Here, two dimensional Gabor filters are used to estimate the spectro-temporal features from time-frequency representation of uttered speech signals. The Gabor filters are designed using the concept of constant Q factor. It is found that human perception system maintains approximately constant Q in its frequency response along the chain of its filter bank. Constant Q analysis ensures that the Gabor filters occupy a set of geometrically spaced spectral and temporal bins. Time-frequency representation of speech signal is a key ingredient for Gabor based feature extraction method. For time-frequency mapping, Gammatonegram is adopted instead of conventional spectrogram representations. The performance of the ASR system with the proposed feature set is experimentally validated using AURORA2 noisy digit database. Under clean training; the proposed features obtained a relative improvement of about 50% in word error rate (WER) compared to Mel frequency cep-stral coefficients (MFCC) features. A relative improvement of 23% in WER is also obtained compared with that of existing spectro-temporal feature extraction methods. Further analysis is carried out on TIMET corrupted with noise samples taken from the NOISEX-92 database. The experimental verification proves the robustness of proposed features in building a robust acoustic model for the ASR system.

机译：现有的自动语音识别（ASR）系统使用语音的频谱或时间特征。与人类对听力的感知相比，此类系统的性能仍然很差，尤其是在嘈杂的环境中。本文着重于基于生理和心理听觉启发方法的光谱时间特征的提取。在此，使用二维Gabor滤波器从发出的语音信号的时频表示中估计频谱时间特征。 Gabor滤波器是使用恒定Q因子的概念设计的。发现人类感知系统沿其滤波器组链的频率响应中保持近似恒定的Q。常数Q分析可确保Gabor滤波器占据一组在几何上隔开的频谱和时间区间。语音信号的时频表示是基于Gabor的特征提取方法的关键要素。对于时频映射，采用伽玛音标代替常规的频谱图表示。使用AURORA2噪声数字数据库通过实验验证了具有建议功能集的ASR系统的性能。在干净的训练下；与梅尔频率倒谱系数（MFCC）特征相比，拟议的特征在字错误率（WER）方面获得了约50％的相对改进。与现有的光谱时间特征提取方法相比，WER相对提高了23％。对TIMET进行了进一步分析，其中TIMET被NOISEX-92数据库中的噪声样本破坏了。实验验证证明了在为ASR系统建立鲁棒的声学模型时所提出功能的鲁棒性。

著录项

来源
《International journal of speech technology》 |2019年第4期|1085-1097|共13页
作者
Anirban Dutta; Gudmalwar Ashishkumar; Ch. V. Rama Rao;
展开▼
作者单位

Department of Electronics and Communication Engineering National Institute of Technology Shillong Meghalaya 793003 India;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
Spectro-temporal feature; Constant Q factor; Deep neural network; Gabor filter; Speech recognition;

机译：光谱时态特征;恒定的Q因子;深度神经网络Gabor滤波器;语音识别;

相似文献

外文文献
中文文献
专利

1. Gait Recognition Based on the Feature Extraction of Gabor Filter and Linear Discriminant Analysis and Improved Local Coupled Extreme Learning Machine [J] . Hongli Guo, Bin Li, Youmei Zhang, Mathematical Problems in Engineering: Theory, Methods and Applications . 2020,第1期

机译：基于Gabor滤波器的特征提取和线性判别分析和改进的局部耦合极限学习机的步态识别
2. Fast Gabor texture feature extraction with separable filters using GPU [J] . Pang Wai-Man, Choi Kup-Sze, Qin Jing Journal of Real-Time Image Processing . 2016,第1期

机译：使用GPU使用可分离的滤镜快速提取Gabor纹理特征
3. A Robust Iris Feature Extraction Approach Based on Monogenic and 2D Log-Gabor Filters [J] . Walid Aydi, Nade Fadhel, Nouri Masmoudi, Journal of Intelligent Systems . 2015,第2期

机译：基于单基因和二维Log-Gabor滤波器的鲁棒虹膜特征提取方法
4. On the Impact of Gabor Phase for Spectro-Temporal Feature Extraction in Building an ASR System [C] . Anirban Dutta, Gudmalwar Prabhakar, Ch V Rama Rao Annual IEEE Information Technology, Electronics and Mobile Communication Conference . 2020

机译：关于葛兰阶段对建立ASR系统光谱 - 时间特征提取的影响
5. Prefiltering for improved unknown and known source correlation detection of broadband oscillatory transients and predicting the onset of paroxysmal atrial fibrillation using feature extraction and a hamming neural network. [D] . Dean, Marcella Elsener. 2003

机译：进行预过滤，以改进宽带振荡瞬态的未知和已知源相关性检测，并使用特征提取和汉明神经网络预测阵发性房颤的发作。
6. Hybrid Discrete Wavelet Transform and Gabor Filter Banks Processing for Features Extraction from Biomedical Images [O] . Salim Lahmiri, Mounir Boukadoum 2013

机译：混合离散小波变换和Gabor滤波器组处理用于从生物医学图像中提取特征
7. Features Extraction for Pollen Recognition Using Gabor Filters [O] . Dimitar Nikolov Nikolov, Diana Dimitrova Tsankova 2018

机译：使用Gabor滤波器的花粉识别特征提取

Designing of Gabor filters for spectro-temporal feature extraction to improve the performance of ASR system

摘要

著录项

相似文献

相关主题

期刊订阅