首页> 外文会议>European Conference on Speech Communication and Technology - EUROSPEECH 2003(INTERSPEECH 2003) vol.4; 20030901-04; Geneva(CH) >A New SVM Approach to Speaker Identification and Verification Using Probabilistic Distance Kernels
【24h】

A New SVM Approach to Speaker Identification and Verification Using Probabilistic Distance Kernels

机译:基于概率距离核的说话人识别和验证的SVM新方法

获取原文
获取原文并翻译 | 示例

摘要

One major SVM weakness has been the use of generic kernel functions to compute distances among data points. Polynomial, linear, and Gaussian are typical examples. They do not take full advantage of the inherent probability distributions of the data. Focusing on audio speaker identification and verification, we propose to explore the use of novel kernel functions that take full advantage of good probabilistic and descriptive models of audio data. We explore the use of generative speaker identification models such as Gaussian Mixture Models and derive a kernel distance based on the Kullback-Leibler (KL) divergence between generative models. In effect our approach combines the best of both generative and discriminative methods. Our results show that these new kernels perform as well as baseline GMM classifiers and outperform generic kernel based SVM's in both speaker identification and verification on two different audio databases.
机译:SVM的一个主要弱点是使用通用内核函数来计算数据点之间的距离。多项式,线性和高斯就是典型示例。他们没有充分利用数据的固有概率分布。我们着眼于音频说话者的识别和验证,建议探索利用新颖的内核功能,充分利用音频数据的良好概率和描述性模型。我们探索诸如高斯混合模型之类的生成说话人识别模型的使用,并基于生成模型之间的Kullback-Leibler(KL)散度推导内核距离。实际上,我们的方法结合了生成方法和判别方法的优点。我们的结果表明,在两个不同的音频数据库上的说话人识别和验证方面,这些新内核的性能与基线GMM分类器相同,并且优于基于通用内核的SVM。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号