...
首页> 外文期刊>Pattern recognition letters >Segmentation of specific speech signals from multi-dialog environment using SVM and wavelet
【24h】

Segmentation of specific speech signals from multi-dialog environment using SVM and wavelet

机译:使用SVM和小波对来自多对话环境的特定语音信号进行分割

获取原文
获取原文并翻译 | 示例
           

摘要

In this paper, a novel multi-speaker segmentation technique is presented. This technique makes use of wavelets and support vector machines (SVMs) to segment specific speakers' speech signals from multi-dialog environments. The proposed method first applies wavelets to determine the acoustical features such as subband power and pitch information from a given multi-dialog speech data. Then the multi-speaker segmentation of the given multi-dialog speech data can be accomplished by the use of a bottom-up SVM over these acoustical features and additional parameters, such as frequency cepstral coefficients. A public audio database, Aurora-2, is used to evaluate the performances of the proposed method. Experimental results show that the accuracy of multi-speaker segmentation is 100% achieved in the combination of two speakers. And the segmental accuracy can achieve at least 94.12% and 85.93% for 4-speaker and 8-speaker conditions, respectively.
机译:在本文中,提出了一种新颖的多扬声器分割技术。该技术利用小波和支持向量机(SVM)来分割来自多对话环境的特定扬声器的语音信号。所提出的方法首先应用小波从给定的多对话语音数据中确定声学特征,例如子带功率和音调信息。然后,可以通过在这些声学特征和其他参数(例如倒频谱系数)上使用自底向上的SVM来完成给定多对话语音数据的多扬声器分割。公共音频数据库Aurora-2用于评估该方法的性能。实验结果表明,在两个扬声器组合的情况下,多扬声器分割的准确性达到了100%。在4扬声器和8扬声器条件下,分段精度分别至少达到94.12%和85.93%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号