Robust speech recognition by selecting mel-filter banks

机译：通过选择熔融滤波器银行的强大语音识别

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Mel-filterbank energies is a key feature that is widely employed in automatic speech recognition (ASR) system. It arises from a sub-band spectrum typically. But when the noise exists in the background, Mel-filterbank energies can not be easy to estimated accurately. In this paper, the fact that the trajectories of not only "traditional" log Mel-filterbank energies, but also its delta parameters can be influenced by noise will be theoretically analyzed. As a result, log Mel-filterbank energies and their delta parameters can not be calculated correctly. In this paper, we propose to remove those severely contaminated Mel-filterbank features and only keep those variations which perform better in the speech remained. We demonstrate the effectiveness of this novel operation through speech recognition experiments conducted on the Aurora-2 database.

机译：Mel-FilterBank Energies是一种在自动语音识别（ASR）系统中广泛采用的关键特征。它通常由子带频谱产生。但是，当背景中存在噪声时，Mel-Filterbank能量不能容易准确估计。在本文中，事实上，不仅是“传统”日志Mel-Filterbank能量的轨迹，还可以理论地分析其噪声的Δ参数。因此，无法正确计算Log Mel-FilterBank能量及其Delta参数。在本文中，我们建议删除那些严重污染的MEL-FILSERBANK特征，并且只保留在剩余的语音中更好地执行的变化。我们通过在Aurora-2数据库上进行的语音识别实验展示了这种新颖操作的有效性。

著录项

来源
《International Conference on Electronics, Electrical Engineering and Information Science》|2017年|523p|共10页
会议地点
作者
Yun-Peng Wu; Jia-Min Mao; Wei-Feng Li;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TN-53;
关键词
Speech recognition; Mel-filterbank (MFB); Melfilterbank energies; Mel-Frequency Cepstral Coefficients (MFCCs);

机译：语音识别;Mel-FilterBank（MFB）;MelfilterBank能量;熔融频率肌肉系数（MFCC）;
入库时间 2022-08-21 11:43:12

相似文献

外文文献
中文文献
专利

1. A computationally efficient mel-filter bank VAD algorithm for distributed speech recognition systems [J] . Vlaj D, Kotnik B, Horvat B, EURASIP journal on applied signal processing . 2005,第4期

机译：一种用于分布式语音识别系统的高效计算的mel-filter bank VAD算法
2. A Computationally Efficient Mel-Filter Bank VAD Algorithm for Distributed Speech Recognition Systems [J] . Damjan Vlaj, Bojan Kotnik, Bogomir Horvat, EURASIP journal on advances in signal processing . 2005,第4期

机译：分布式语音识别系统的高效计算Mel-Filter Bank VAD算法
3. Enhancing robustness for speech recognition through bio-inspired auditory filter-bank [J] . Hari Krishna Maganti, Marco Matassoni International Journal of Bio-Inspired Computation . 2012,第5期

机译：通过生物启发性听觉滤镜库增强语音识别的鲁棒性
4. Robust speech recognition by selecting mel-filter banks [C] . Yun-Peng Wu, Jia-Min Mao, Wei-Feng Li International Conference on Electronics, Electrical Engineering and Information Science . 2017

机译：选择熔融滤波器银行的强大语音识别
5. Robust speech processing based on microphone array, audio-visual, and frame selection for in-vehicle speech recognition and in-set speaker recognition. [D] . Zhang, Xianxian. 2005

机译：基于麦克风阵列，视听和帧选择的强大语音处理功能，可实现车载语音识别和内置说话人识别。
6. New Features Using Robust MVDR Spectrum of Filtered Autocorrelation Sequence for Robust Speech Recognition [O] . Sanaz Seyedin, Seyed Mohammad Ahadi, Saeed Gazor 2013

机译：使用滤波自相关序列的鲁棒MVDR频谱进行鲁棒语音识别的新功能
7. Robust speech recognition by selecting mel-filter banks [O] . Yun-Peng Wu, Jia-Min Mao, Wei-Feng Li 2017

机译：选择熔融滤波器银行的强大语音识别
8. Robust Speech Processing & Recognition: Speaker ID, Language ID, Speech Recognition/Keyword Spotting, Diarization/Co-Channel/Environmental Characterization, Speaker State Assessment. [R] . Hansen, J. H. 2015

机译：强大的语音处理和识别：说话者ID，语言ID，语音识别/关键字识别，Diarization / Co-Channel /环境表征，说话者状态评估。

Robust speech recognition by selecting mel-filter banks

摘要

著录项

相似文献

相关主题

期刊订阅