...
首页> 外文期刊>Computers and Electrical Engineering >Combining pulse-based features for rejecting far-field speech in aHMM-based Voice Activity Detector
【24h】

Combining pulse-based features for rejecting far-field speech in aHMM-based Voice Activity Detector

机译:结合基于脉冲的功能以在基于HHMM的语音活动检测器中拒绝远场语音

获取原文
获取原文并翻译 | 示例
   

获取外文期刊封面封底 >>

       

摘要

Nowadays, several computational techniques for speech recognition have been proposed. These techniques suppose an important improvement in real time applications where speaker interacts with speech recognition systems. Although researchers proposed many methods, none of them solve the high false alarm problem when far-field speakers interfere in a human-machine conversation. This paper presents a two-class (speech and non-speech classes) decision-tree based approach for combining new speech pulse features in a VAD (Voice Activity Detector) for rejecting far-field speech in speech recognition systems. This Decision Tree is applied over the speech pulses obtained by a baseline VAD composed of a frame feature extractor, a HMM-based (Hidden Markov Model) segmentation module and a pulse detector. The paper also presents a detailed analysis of a great amount of features for discriminating between close and far-field speech. The detection error obtained with the proposed VAD is the lowest compared to other well-known VADs.
机译:如今,已经提出了几种用于语音识别的计算技术。这些技术假定在说话者与语音识别系统交互的实时应用中有重要的改进。尽管研究人员提出了许多方法,但是当远场发言者干扰人机对话时,它们都无法解决高误报问题。本文提出了一种基于两类(语音和非语音类)决策树的方法,在VAD(语音活动检测器)中组合新的语音脉冲特征以拒绝语音识别系统中的远场语音。该决策树应用于由基线VAD获得的语音脉冲,该基线VAD由帧特征提取器,基于HMM的(隐马尔可夫模型)分割模块和脉冲检测器组成。本文还对用于区分近场和远场语音的大量功能进行了详细分析。与其他众所周知的VAD相比,使用建议的VAD获得的检测误差最低。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号