首页> 外文会议>Recent researches in automatic control, systems science and communications >Efficient Non-linear Changed Mel-filter Bank VAD Algorithm
【24h】

Efficient Non-linear Changed Mel-filter Bank VAD Algorithm

机译:高效的非线性变化梅尔滤波器组VAD算法

获取原文
获取原文并翻译 | 示例

摘要

This paper introduces efficient non-linear changed mel-filter bank (MFB) voice activity detection (VAD) algorithm. Non-linear changed mel-filter bank outputs improve detection of parts in the speech signal, where vowels, diphthongs and semivowels are present. To make voice activity detection of consonants in the speech signal as good as possible, the hangover and hangbefore criteria are used. For this reason the phoneme duration analysis was made. The duration of vowels, diphthongs and semivowels defines how many frames must be detected as speech, so that it can be decided if hangover and hangbefore criteria will be used at all. The duration of consonants defines how many frames will be used for hangover and hangbefore criteria. Comparative tests were made between the MFB VAD algorithm, where non-linear function was used and where it was not used. The experiments were also made on four VAD algorithms used in the ITU G.729, ITU G.723.1, DSR ETSI ES 202 050, and DSR ETSI ES 202 211 standards. The introduction of non-linear function in to the MFB VAD algorithm reduces errors obtained by incorrect voice activity detection.
机译:本文介绍了一种有效的非线性可变梅尔滤波器组(MFB)语音活动检测(VAD)算法。非线性变化的梅尔滤波器组输出改善了语音信号中存在元音,双音和半元音的部分的检测。为了使语音信号中辅音的语音活动检测尽可能好,使用了宿醉和hangbefore标准。因此,进行了音素持续时间分析。元音,二元音和半元音的持续时间定义了必须将多少帧检测为语音,以便可以确定是否完全使用宿醉和hangbefore标准。辅音的持续时间定义了将多少帧用于宿醉和hangbefore标准。在使用非线性函数和不使用非线性函数的MFB VAD算法之间进行了比较测试。还针对ITU G.729,ITU G.723.1,DSR ETSI ES 202 050和DSR ETSI ES 202 211标准中使用的四种VAD算法进行了实验。在MFB VAD算法中引入非线性函数可以减少由于语音活动检测错误而导致的错误。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号