Temporal Structure Normalization of Speech Feature for Robust Speech Recognition

Xiao X.; Chng E. S.; Li H.

首页> 外文期刊>IEEE signal processing letters >Temporal Structure Normalization of Speech Feature for Robust Speech Recognition

【24h】

Temporal Structure Normalization of Speech Feature for Robust Speech Recognition

机译：语音特征的时态结构归一化，用于鲁棒语音识别

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

This letter presents a new feature normalization technique to normalize the temporal structure of speech features. The temporal structure of the features is partially represented by its power spectral density (PSD). We observed that the PSD of the features varies with the corrupting noise and signal-to-noise ratio. To reduce the PSD variation due to noise, we propose to normalize the PSD of features to a reference function by filtering the features. Experimental results on the AURORA-2 task show that the proposed approach when combined with the mean and variance normalization improves the speech recognition accuracy significantly; the system achieves 69.11% relative error rate reduction over the baseline.

机译：这封信提出了一种新的特征归一化技术，以对语音特征的时间结构进行归一化。特征的时间结构部分由其功率谱密度（PSD）表示。我们观察到，特征的PSD随着噪声和信噪比的变化而变化。为了减少由于噪声引起的PSD变化，我们建议通过过滤特征将特征的PSD归一化为参考函数。在AURORA-2任务上的实验结果表明，与均值和方差归一化相结合时，该方法可以显着提高语音识别的准确性。该系统相对于基线的相对错误率降低了69.11％。

著录项

来源
《IEEE signal processing letters》 |2007年第7期|p.500-503|共4页
作者
Xiao X.; Chng E. S.; Li H.;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类通信理论;
关键词
Feature normalization; robust speech recognition; temporal filter; temporal structure;

机译：特征归一化;鲁棒语音识别;时间滤波器;时间结构;

相似文献

外文文献
中文文献
专利

1. Temporal modulation normalization for robust speech feature extraction and recognition [J] . Xugang Lu, Shigeki Matsuda, Masashi Unoki, Multimedia Tools and Applications . 2011,第1期

机译：时间调制归一化，用于鲁棒的语音特征提取和识别
2. Combination of GMM-Based Speech Estimation Method and Temporal Domain SVD-Based Speech Enhancement for Noise Robust Speech Recognition [J] . Masakiyo Fujimoto, Yasuo Ariki Systems and Computers in Japan . 2007,第3期

机译：基于GMM的语音估计方法与基于时域SVD的语音增强相结合的噪声鲁棒语音识别
3. Normalized Autocorrelation based Features for Robust Speech Recognition in Context with Noisy Environment [J] . Poonam Bansal, Amita Dev, Shail Bala Jain Journal of information and computing science . 2011,第1期

机译：噪声环境下基于归一化自相关的鲁棒语音识别特征
4. Temporal contrast normalization and edge-preserved smoothing on temporal modulation structure for robust speech recognition [C] . Lu X., Matsuda S., Unoki M., IEEE International Conference on Acoustics, Speech and Signal Processing;ICASSP 2009 . 2009

机译：时域对比度结构上的时域对比度归一化和边缘保留平滑，可实现鲁棒的语音识别
5. Duration normalization for robust recognition of spontaneous speech via missing feature methods. [D] . Nedel, Jon P. 2004

机译：持续时间归一化，可通过缺失特征方法对自发语音进行可靠识别。
6. New Features Using Robust MVDR Spectrum of Filtered Autocorrelation Sequence for Robust Speech Recognition [O] . Sanaz Seyedin, Seyed Mohammad Ahadi, Saeed Gazor 2013

机译：使用滤波自相关序列的鲁棒MVDR频谱进行鲁棒语音识别的新功能
7. Cepstral Feature Normalization Methods Using Pole Filtering and Scale Normalization for Robust Speech Recognition [O] . Bo Kyeong Choi, Sung Min Ban, Hyung Soon Kim 2015

机译：抗骨刺特征使用杆滤波和尺度标准化进行规范化方法，用于强大的语音识别
8. Normalized Amplitude Modulation Features for Large Vocabulary Noise- Robust Speech Recognition. [R] . Mitra, V., Franco, H., Graciarena, M., 2012

机译：用于大词汇量噪声 - 鲁棒语音识别的归一化幅度调制特征。

Temporal Structure Normalization of Speech Feature for Robust Speech Recognition

摘要

著录项

相似文献

相关主题

期刊订阅