Robust speech recognition based on emphasis filtering on formant regions in mobile noise environment

机译：基于对移动噪声环境中共振峰区域进行重点过滤的鲁棒语音识别

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

In real environment, the performance of automatic speech recognizer (ASR) has been degraded by various noises. Thus, some speech enhancement techniques have been applied but some distortions have still existed in the speech region. Actually, MFCC-derived features which have been popularly used in most of ASRs are strongly related with the spectral envelope. Moreover, the formant in the spectral envelope is very important factor to recognize the speech. However, it is very difficult to detect exactly the formant frequency in various noisy environments. Therefore, we propose the efficient algorithm to detect the spectral regions presenting any formants based on the spectral ratio instead of the direct detection of formant frequency. Then we check whether each detected frequency range is valid by several confidence measures. If it is acceptable, the region is amplified by the proper emphasis filter. The evaluation in the mobile voice search task shows that the proposed method provides some improvement over baseline system.

机译：在实际环境中，各种噪音会降低自动语音识别器（ASR）的性能。因此，已经应用了一些语音增强技术，但是在语音区域中仍然存在一些失真。实际上，大多数ASR中普遍使用的MFCC衍生功能与频谱包络密切相关。此外，频谱包络中的共振峰是识别语音的非常重要的因素。但是，在各种嘈杂环境中准确检测共振峰频率非常困难。因此，我们提出一种有效的算法，基于频谱比来检测呈现任何共振峰的频谱区域，而不是直接检测共振峰频率。然后，我们通过几种置信度来检查每个检测到的频率范围是否有效。如果可以接受，则通过适当的加重滤镜放大该区域。在移动语音搜索任务中的评估表明，所提出的方法相对于基线系统提供了一些改进。

著录项

来源
《Korean Society of Noise & Vibration Control;Institute of Noise Control Engineering;International congress and exposition on noise control engineering;ASME Noise Control & Acoustics Division》|2015年|1-7|共7页
会议地点 San Francisco, CA(US)
作者
Hwa Jeon Song; Kiyoung Park; Yunkeun Lee;
展开▼
作者单位

Spoken Language Processing Team Electronics and Telecommunications Research Institute Daejeon 305-700 Korea email: hwajeon@etri.re.kr;

Spoken Language Processing Team Electronics and Telecommunications Research Institute Daejeon 305-700 Korea email: pkyoung@ etri.re.kr;

Spoken Language Processing Team Electronics and Telecommunications Research Institute Daejeon 305-700 Korea email: yklee@ etri.re.kr;

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. Dual channel based speech enhancement using novelty filter for robust speech recognition in automobile environment [J] . Jounghoon Beh, Baran R.H., Hanseok Ko IEEE Transactions on Consumer Electronics . 2006,第2期

机译：使用新颖滤波器的基于双通道的语音增强，可在汽车环境中实现可靠的语音识别
2. Robust Arabic speech recognition in noisy environments using prosodic features and formant [J] . A.I. Amrous, M. Debyeche, A. Amrouche International journal of speech technology . 2011,第4期

机译：使用韵律特征和共振峰在嘈杂的环境中进行强大的阿拉伯语语音识别
3. Combination of GMM-Based Speech Estimation Method and Temporal Domain SVD-Based Speech Enhancement for Noise Robust Speech Recognition [J] . Masakiyo Fujimoto, Yasuo Ariki Systems and Computers in Japan . 2007,第3期

机译：基于GMM的语音估计方法与基于时域SVD的语音增强相结合的噪声鲁棒语音识别
4. Robust speech recognition based on emphasis filtering on formant regions in mobile noise environment [C] . Hwa Jeon Song, Kiyoung Park, Yunkeun Lee International Congress and Exposition on Noise Control Engineering . 2013

机译：基于强调滤波在流动噪声环境中的格式区域的强大语音识别
5. THE SPEECH SPECTRUM AND ITS RELATIONSHIP TO INTELLIGIBILITY OF SPEECH (ARTICULATION INDEX, FORMANT REGIONS, FRENCH AND STEINBERG). [D] . ENGLERT, SUE ELLEN. 1986

机译：语音谱及其与语音智能性的关系（发音指数，共振峰区域，法语和斯坦伯格）。
6. A Multistream Feature Framework Based on Bandpass Modulation Filtering for Robust Speech Recognition [O] . Sridhar Krishna Nemala, Kailash Patil, Mounya Elhilali -1

机译：在带通滤波调制多流功能根据框架鲁棒语音识别
7. Dual Channel Based Speech Enhancement Using Novelty Filter for Robust Speech Recognition in Automobile Environment [O] . Jounghoon Beh, Student Member, Robert H. Baran, 2006

机译：基于双通道语音增强的新型滤波器在汽车环境中的鲁棒语音识别

Robust speech recognition based on emphasis filtering on formant regions in mobile noise environment

摘要

著录项

相似文献

相关主题

期刊订阅