首页> 外文会议>IEEE Region 8 EUROCON >Usage of Frame Dropping and Frame Attenuation Algorithms in Automatic Speech Recognition Systems

【24h】

Usage of Frame Dropping and Frame Attenuation Algorithms in Automatic Speech Recognition Systems

机译：自动语音识别系统中帧掉落和帧衰减算法的用途

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

In this paper the usage of frame dropping and frame attenuation algorithms in automatic speech recognition systems is presented. On the one hand, the use of frame dropping algorithms is important because the speech recognition system does not need to deal with noise-only parts of input signal, but on the other hand, the speech recognition results can be better if the spectral magnitudes of noise-only frames are attenuated. A novel approach of voice activity detection (VAD) based on the log filter-bank magnitudes needed for the frame dropping or the frame attenuation with the so-called "hangover" criterion is proposed. All tests were made on Slovenian, German and Spanish fixed telephone SpeechDat II databases with the HTK speech recognition toolkit. The results obtained show that small word error rate can be achieved at small number of Gaussian mixtures if either frame dropping or frame attenuation algorithm is used.

机译：本文提出了自动语音识别系统中帧滴加和帧衰减算法的使用。一方面，使用帧掉落算法是重要的，因为语音识别系统不需要处理输入信号的噪声部分，但另一方面，如果频谱大小，则语音识别结果可以更好仅噪声帧衰减。提出了一种基于帧丢弃所需的日志滤波器库幅度的语音活动检测（VAD）的新方法，或者用所谓的“宿醉标准”帧衰减。所有测试都是在斯洛文尼亚语，德语和西班牙固定电话SpeemDAT II数据库上进行的，使用HTK语音识别工具包。获得的结果表明，如果使用帧丢弃或帧衰减算法，则可以在少量高斯混合中实现小字错误率。

著录项

来源
《IEEE Region 8 EUROCON》|2003年||共4页
会议地点
作者
Damjan Vlaj; Bojan Kotnik; Zdravko Kacic; Bogomir Horvat;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TN91-53;
关键词
Automatic speech recognition; Voice activity detection; Frame dropping; Frame attenuation;

机译：自动语音识别;语音活动检测;帧滴;帧衰减;

相似文献

外文文献
中文文献
专利

1. A comparative case study of neural network training by using frame-level cost functions for automatic speech recognition purposes in Spanish [J] . Aldonso Becerra, J. Ismael de la Rosa, Efren Gonzalez, Multimedia Tools and Applications . 2020,第27a28期

机译：用帧级成本函数在西班牙语中使用帧级成本函数来实现神经网络训练的比较案例研究
2. Training deep neural networks with non-uniform frame-level cost function for automatic speech recognition [J] . Becerra Aldonso, Ismael de la Rosa J., Gonzalez Efren, Multimedia Tools and Applications . 2018,第20期

机译：使用非均匀帧级代价函数训练深度神经网络以进行自动语音识别
3. Acoustic landmarks contain more information about the phone string than other frames for automatic speech recognition with deep neural network acoustic model [J] . He Di, Lim Boon Pang, Yang Xuesong, The Journal of the Acoustical Society of America . 2018,第6aPta1期

机译：声学地标包含与具有深度神经网络声学模型的自动语音识别的其他帧的更多信息
4. Usage of frame dropping and frame attenuation algorithms in automatic speech recognition systems [C] . Vlaj D., Kotnik B., Kaciv Z., EUROCON 2003. Computer as a Tool. The IEEE Region 8 . 2003

机译：丢帧和帧衰减算法在自动语音识别系统中的使用
5. Robust speech processing based on microphone array, audio-visual, and frame selection for in-vehicle speech recognition and in-set speaker recognition. [D] . Zhang, Xianxian. 2005

机译：基于麦克风阵列，视听和帧选择的强大语音处理功能，可实现车载语音识别和内置说话人识别。
6. Novel Blind Recognition Algorithm of Frame Synchronization Words Based on Soft-Decision in Digital Communication Systems [O] . Jiangyi Qin, Zhiping Huang, Chunwu Liu, -1

机译：数字通信系统中基于软判决的帧同步词盲识别新算法
7. Unified Frame and Segment Based Models for Automatic Speech Recognition [O] . 2008

机译：基于统一帧和段的自动语音识别模型
8. Effect of Reference Set Selection on Speaker Dependent Speech Recognition. Frame Compression in Isolated Word Recognition [R] . Li, Z., Alleva, F., Reddy, R. 1981

机译：参考集选择对说话人相关语音识别的影响。孤立词识别中的帧压缩

Usage of Frame Dropping and Frame Attenuation Algorithms in Automatic Speech Recognition Systems

摘要

著录项

相似文献

相关主题

期刊订阅