首页> 外文会议>International Conference on Speech and Computer >Discriminating Speakers by Their Voices - A Fusion Based Approach
【24h】

Discriminating Speakers by Their Voices - A Fusion Based Approach

机译:通过他们的声音来辨别发言者 - 一种基于融合的方法

获取原文

摘要

The task of Speaker Discrimination (SD) consists in checking whether two speech segments belong to the same speaker or not. In this research field, it is often difficult to decide what could be the best classifier in terms of accuracy and robustness. For that purpose, we have implemented 9 classifiers: Support Vector Machines, Linear Discriminant Analysis, Multi-Layer Perceptron, Generalized Linear Model, Self Organizing Map, Adaboost, Second Order Statistical Measures, Linear Regression and Gaussian Mixture Models. Furthermore, a new fusion approach is proposed and experimented in speaker discrimination. Several experiments of speaker discrimination were conducted on Hub4 Broadcast-News with relatively short segments. The obtained results have shown that the best classifier is the SVM and that the proposed fusion approach is quite interesting since it provided the best performances at all.
机译:发言者歧视(SD)的任务包括检查两个语音段是否属于同一扬声器。在这一研究领域,通常很难在准确性和稳健性方面决定最佳分类器。为此目的,我们已经实施了9分类器:支持向量机,线性判别分析,多层判定,广义线性模型,自组织地图,Adaboost,二阶统计测量,线性回归和高斯混合模型。此外,提出了一种新的融合方法,并在发言者歧视中进行了实验。在Hub4广播新闻中进行了几次发言人歧视 - 具有相对短暂的细分市场。所获得的结果表明,最好的分类器是SVM,并且所提出的融合方法非常有趣,因为它完全提供了最佳性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号