Robust Feature Extraction Using Modulation Filtering of Autoregressive Models

Ganapathy S.; Mallidi S.H.; Hermansky H.

首页> 外文期刊>Audio, Speech, and Language Processing, IEEE Transactions on >Robust Feature Extraction Using Modulation Filtering of Autoregressive Models

【24h】

Robust Feature Extraction Using Modulation Filtering of Autoregressive Models

机译：使用自回归模型的调制滤波进行稳健的特征提取

获取原文

获取原文并翻译 | 示例

开具论文收录证明 >>

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

Speaker and language recognition in noisy and degraded channel conditions continue to be a challenging problem mainly due to the mismatch between clean training and noisy test conditions. In the presence of noise, the most reliable portions of the signal are the high energy regions which can be used for robust feature extraction. In this paper, we propose a front end processing scheme based on autoregressive (AR) models that represent the high energy regions with good accuracy followed by a modulation filtering process. The AR model of the spectrogram is derived using two separable time and frequency AR transforms. The first AR model (temporal AR model) of the sub-band Hilbert envelopes is derived using frequency domain linear prediction (FDLP). This is followed by a spectral AR model applied on the FDLP envelopes. The output 2-D AR model represents a low-pass modulation filtered spectrogram of the speech signal. The band-pass modulation filtered spectrograms can further be derived by dividing two AR models with different model orders (cut-off frequencies). The modulation filtered spectrograms are converted to cepstral coefficients and are used for a speaker recognition task in noisy and reverberant conditions. Various speaker recognition experiments are performed with clean and noisy versions of the NIST-2010 speaker recognition evaluation (SRE) database using the state-of-the-art speaker recognition system. In these experiments, the proposed front-end analysis provides substantial improvements (relative improvements of up to 25%) compared to baseline techniques. Furthermore, we also illustrate the generalizability of the proposed methods using language identification (LID) experiments on highly degraded high-frequency (HF) radio channels and speech recognition experiments on noisy data.

机译：主要由于干净的训练和嘈杂的测试条件之间的不匹配，在嘈杂和恶化的信道条件下的说话人和语言识别仍然是一个具有挑战性的问题。在存在噪声的情况下，信号中最可靠的部分是可用于鲁棒特征提取的高能量区域。在本文中，我们提出了一种基于自回归（AR）模型的前端处理方案，该模型以良好的精度表示高能量区域，然后进行调制滤波处理。使用两个可分离的时间和频率AR变换得出频谱图的AR模型。使用频域线性预测（FDLP）导出子带希尔伯特包络的第一个AR模型（时间AR模型）。然后是在FDLP包络上应用的光谱AR模型。输出的2-AR模型代表语音信号的低通调制滤波频谱图。通过将两个AR模型划分为不同的模型阶数（截止频率），可以进一步得出带通调制滤波后的频谱图。调制滤波后的频谱图将转换为倒谱系数，并用于嘈杂和混响条件下的说话人识别任务。使用最新的说话人识别系统，使用NIST-2010说话人识别评估（SRE）数据库的干净且嘈杂的版本执行各种说话人识别实验。在这些实验中，与基线技术相比，提出的前端分析提供了实质性的改进（相对改进高达25％）。此外，我们还说明了在高度退化的高频（HF）无线电信道上使用语言识别（LID）实验并在嘈杂数据上进行语音识别实验的方法的一般性。

著录项

来源
《Audio, Speech, and Language Processing, IEEE Transactions on》 |2014年第8期|1285-1295|共11页
作者
Ganapathy S.; Mallidi S.H.; Hermansky H.;
展开▼
作者单位

IBM T.J. Watson Research Center, Yorktown Heights, USA|c|;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
Autoregressive modeling; feature extraction; modulation filtering; speaker and language recognition;

机译：自回归建模;特征提取;调制滤波;说话人和语言识别;

相似文献

外文文献
中文文献
专利

1. Parameter Estimation of Autoregressive Models Using the Iteratively Robust Filtered Fast-τ Method [J] . NIMA SHARIATI, HAMID SHAHRIARI, RASOUL SHAFAEI Communications in Statistics . 2014,第19a21期

机译：使用迭代强制滤波快速τ方法进行自回归模型的参数估计
2. Time-Varying Autoregressive Model-Based Multiple Modes Particle Filtering Algorithm for Respiratory Rate Extraction From Pulse Oximeter [J] . Biomedical Engineering, IEEE Transactions on . 2011,第3期

机译：基于时变自回归模型的多模式粒子滤波算法从脉搏血氧仪中提取呼吸频率
3. An Autoregressive Model-Based Particle Filtering Algorithms for Extraction of Respiratory Rates as High as 90 Breaths Per Minute From Pulse Oximeter [J] . Lee J., Chon K. H. Biomedical Engineering, IEEE Transactions on . 2010,第9期

机译：基于自回归模型的颗粒过滤算法，可从脉搏血氧仪中提取每分钟高达90次呼吸的呼吸频率
4. Markov Chain Monte Carlo Methods for Noise Robust Feature Extraction Using the Autoregressive Model [C] . Robert W. Morris, Jon A. Arrowood, Mark A. Clements, European Conference on Speech Communication and Technology . 2003

机译：Markov Chain Monte Carlo Carlo用于使用自回归模型提取的噪声强制特征
5. Feature extraction and reconstruction of two dimensional patterns using autoregressive moving average models and Fourier descriptors [D] . Leung, Siu Yun 1989

机译：使用自回归移动平均模型和傅立叶描述符对二维模式进行特征提取和重构
6. A Multistream Feature Framework Based on Bandpass Modulation Filtering for Robust Speech Recognition [O] . Sridhar Krishna Nemala, Kailash Patil, Mounya Elhilali -1

机译：在带通滤波调制多流功能根据框架鲁棒语音识别
7. Robust Feature Extraction Using Modulation Filtering of Autoregressive Models [O] . Sriram Ganapathy, Sri Harish Mallidi, Student Member, 2014

机译：基于自回归模型调制滤波的鲁棒特征提取
8. Robust Feature Extraction from ECG Signals Based on Nonlinear Dynamical Modeling. [R] . Owis, M. I., Abou-Zied, A. H., Youssef, A. M., 2001

机译：基于非线性动力学建模的心电信号鲁棒特征提取。

Robust Feature Extraction Using Modulation Filtering of Autoregressive Models

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅