首页> 外文OA文献 >Combining pulse-based features for rejecting far-field speech in a HMM-based Voice Activity Detector. Computers Electrical Engineering (CAEE).
【2h】

Combining pulse-based features for rejecting far-field speech in a HMM-based Voice Activity Detector. Computers Electrical Engineering (CAEE).

机译:结合基于脉冲的功能,以在基于HMM的语音活动检测器中拒绝远场语音。计算机与电气工程(CAEE)。

摘要

Nowadays, several computational techniques for speech recognition have been proposed. These techniques suppose an important improvement in real time applications where speaker interacts with speech recognition systems. Although researchers proposed many methods, none of them solve the high false alarm problem when far-field speakers interfere in a human-machine conversation. This paper presents a two-class (speech and non-speech classes) decision-tree based approach for combining new speech pulse features in a VAD (Voice Activity Detector) for rejecting far-field speech in speech recognition systems. This Decision Tree is applied over the speech pulses obtained by a baseline VAD composed of a frame feature extractor, a HMM-based (Hidden Markov Model) segmentation module and a pulse detector. The paper also presents a detailed analysis of a great amount of features for discriminating between close and far-field speech. The detection error obtained with the proposed VAD is the lowest compared to other well-known VADs
机译:如今,已经提出了几种用于语音识别的计算技术。这些技术假定在说话者与语音识别系统交互的实时应用中有重要的改进。尽管研究人员提出了许多方法,但是当远场发言者干扰人机对话时,它们都无法解决高误报问题。本文提出了一种基于两类(语音和非语音类)决策树的方法,用于在VAD(语音活动检测器)中组合新的语音脉冲特征以拒绝语音识别系统中的远场语音。该决策树应用于由基线VAD获得的语音脉冲,基线VAD由帧特征提取器,基于HMM的(隐马尔可夫模型)分割模块和脉冲检测器组成。本文还提供了用于区分近场和远场语音的大量功能的详细分析。与其他知名的VAD相比,通过建议的VAD获得的检测误差最低

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号