首页> 外文会议>International Conference on speech and computer >Discriminating Speakers by Their Voices - A Fusion Based Approach
【24h】

Discriminating Speakers by Their Voices - A Fusion Based Approach

机译:通过语音区分说话者-一种基于融合的方法

获取原文

摘要

The task of Speaker Discrimination (SD) consists in checking whether two speech segments belong to the same speaker or not. In this research field, it is often difficult to decide what could be the best classifier in terms of accuracy and robustness. For that purpose, we have implemented 9 classifiers: Support Vector Machines, Linear Discriminant Analysis, Multi-Layer Percep-tron, Generalized Linear Model, Self Organizing Map, Adaboost, Second Order Statistical Measures, Linear Regression and Gaussian Mixture Models. Furthermore, a new fusion approach is proposed and experimented in speaker discrimination. Several experiments of speaker discrimination were conducted on Hub4 Broadcast-News with relatively short segments. The obtained results have shown that the best classifier is the SVM and that the proposed fusion approach is quite interesting since it provided the best performances at all.
机译:说话者辨别(SD)的任务在于检查两个语音片段是否属于同一说话者。在这个研究领域中,通常难以确定在准确性和鲁棒性方面什么是最佳分类器。为此,我们实现了9个分类器:支持向量机,线性判别分析,多层Percep-tron,广义线性模型,自组织映射,Adaboost,二阶统计量度,线性回归和高斯混合模型。此外,提出了一种新的融合方法,并在说话人辨别中进行了实验。在Hub4 Broadcast-News上以相对较短的片段进行了一些说话人辨别实验。获得的结果表明,最好的分类器是SVM,并且所提出的融合方法非常有趣,因为它可以提供最佳的性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号