...
首页> 外文期刊>Pattern Analysis and Applications >Voicedon-voiced speech classification using adaptive thresholding with bivariate EMD
【24h】

Voicedon-voiced speech classification using adaptive thresholding with bivariate EMD

机译:使用带有双变量EMD的自适应阈值进行语音/非语音语音分类

获取原文
获取原文并翻译 | 示例

摘要

This paper introduces a robust voicedon-voiced (VnV) speech classification method using bivariate empirical mode decomposition (bEMD). Fractional Gaussian noise (fGn) is employed as the reference signal to derive a data adaptive threshold for VnV discrimination. The analyzing speech signal and fGn are combined to generate a complex signal which is decomposed into a finite number of complex-valued intrinsic mode functions (IMFs) by using bEMD. The real and imaginary parts of the IMFs represent the IMFs of observed speech and fGn, respectively. The log-energies of both types of IMFs are calculated. There exist similarities between the IMF log-energy representation of fGn and unvoiced speech signals. Hence, the upper confidence limit from IMF log-energies of fGn is used as data adaptive threshold for VnV classification. If the subband log-energy of speech segment exceeds the threshold, the segment is classified as voiced and unvoiced otherwise. The experimental results show that the proposed algorithm performs better than the recently reported methods without requiring any training data for a wide range of SNRs.
机译:本文介绍了一种使用双变量经验模式分解(bEMD)的鲁棒的有声/无声(VnV)语音分类方法。分数高斯噪声(fGn)被用作参考信号,以得出用于VnV鉴别的数据自适应阈值。将分析语音信号和fGn组合以生成一个复信号,该复信号通过使用bEMD分解为有限数量的复值固有模式函数(IMF)。 IMF的实部和虚部分别代表观察到的语音和fGn的IMF。计算两种类型的IMF的对数能量。 fGn的IMF对数能量表示与清语音信号之间存在相似之处。因此,来自fGn的IMF对数能量的置信上限被用作VnV分类的数据自适应阈值。如果语音段的子带对数能量超过阈值,则将该段分类为有声和无声。实验结果表明,该算法在较宽的信噪比范围内不需要任何训练数据,其性能优于最近报道的方法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号