首页> 外文会议>International Joint Conference on Neural Networks >Hybrid SVM/HMM architectures for statistical model-based voice activity detection
【24h】

Hybrid SVM/HMM architectures for statistical model-based voice activity detection

机译:混合SVM / HMM架构,用于基于统计模型的语音活动检测

获取原文

摘要

This research was supported in part by the China National Nature Science Foundation (No.91120303, No.61273267, No.90820011 and No.90820303). The decision function of support vector machine (SVM) using the likelihood ratios (LRs) is successfully used for statistical model-based voice activity detection (VAD). It is known to incorporate an optimised nonlinear decision over two different classes, instead of comparing the geometric mean of the LRs for the individual frequency bands with a given threshold for speech detection. However, the inter-frame correlation of the voice activity is not taken into consideration. In this paper, we explore a hybrid SVM/hidden Markov model (HMM) approach for the VAD, which retains discriminative and nonlinear properties of SVM, while modeling the inter-frame correlation powerfully through a first-order HMM. Experimental results show the significant improvement of the performance of the proposed VAD in comparison with the S VM-based VAD.
机译:该研究得到了中国国家自然科学基金(91120303、61273267、90820011和90820303)的部分支持。支持向量机(SVM)使用似然比(LR)的决策功能已成功用于基于统计模型的语音活动检测(VAD)。已知在两个不同的类别上合并优化的非线性决策,而不是将各个频带的LR的几何平均值与给定的语音检测阈值进行比较。但是,没有考虑语音活动的帧间相关性。在本文中,我们探索了一种用于VAD的混合SVM /隐马尔可夫模型(HMM)方法,该方法保留了SVM的判别和非线性属性,同时通过一阶HMM对帧间相关性进行了有效建模。实验结果表明,与基于SVM的VAD相比,该VAD的性能有了显着提高。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号