...
首页> 外文期刊>International journal of speech technology >Effective background data selection for SVM-based speaker recognition with unseen test environments: more is not always better
【24h】

Effective background data selection for SVM-based speaker recognition with unseen test environments: more is not always better

机译:在看不见的测试环境中进行有效的背景数据选择,以实现基于SVM的说话人识别:更多并不总是更好

获取原文
获取原文并翻译 | 示例
   

获取外文期刊封面封底 >>

       

摘要

This study focuses on formulating a procedure to select effective negative examples for the development of improved Support Vector Machine (SVM)-based speaker recognition. Selection of a background dataset, or a collection of negative examples, is the crucial step for building an effective decision surface between a target speaker and the non-target speakers. Previous studies heuristically fixed the number of negative examples used based on available development data for performance evaluation; nevertheless, in real applications this does not guarantee sustained performance for unseen data, as will be shown. In the proposed model selection framework, a novel ranking method is first exploited to rank order the negative examples for selecting a set of background datasets with various population sizes. Next, an error estimation and model-selection criterion are proposed and employed to select the most suitable target model among the model candidates. The experimental validation, conducted on the NIST SRE-2008 and SRE-2010 data, demonstrates that the proposed background data selection slightly but consistently outperforms the fixed-size background data selection, and achieves a relative improvement of +6 % over the non-selection background framework in terms of minDCF.
机译:这项研究的重点是制定程序,以选择有效的否定示例,以开发基于改进的支持向量机(SVM)的说话人识别。选择背景数据集或否定示例的集合,是在目标讲话者与非目标讲话者之间建立有效决策面的关键步骤。先前的研究启发式地根据可用的开发数据来评估绩效评估中使用的负面案例的数量;但是,在实际应用中,这并不能保证对看不见的数据具有持续的性能,如下所示。在提出的模型选择框架中,首先利用一种新颖的排序方法对负样本进行排序,以选择一组具有各种人口规模的背景数据集。接下来,提出了误差估计和模型选择准则,并采用该准则来在模型候选者中选择最合适的目标模型。对NIST SRE-2008和SRE-2010数据进行的实验验证表明,建议的背景数据选择略微但始终优于固定大小的背景数据选择,并且相对于非选择而言,相对提高了6% minDCF的背景框架。

著录项

  • 来源
    《International journal of speech technology》 |2014年第3期|211-221|共11页
  • 作者单位

    Center for Robust Speech Systems (CRSS), Erik Jonsson School of Engineering and Computer Science, University of Texas at Dallas, Richardson, TX, USA;

    Center for Robust Speech Systems (CRSS), Erik Jonsson School of Engineering and Computer Science, University of Texas at Dallas, Richardson, TX, USA;

    Center for Robust Speech Systems (CRSS), Erik Jonsson School of Engineering and Computer Science, University of Texas at Dallas, Richardson, TX, USA;

    Center for Robust Speech Systems (CRSS), Erik Jonsson School of Engineering and Computer Science, University of Texas at Dallas, Richardson, TX, USA;

  • 收录信息
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

    Speaker recognition; Support vector machine (SVM); NIST SRE; Robustness in speaker ID;

    机译:说话人识别;支持向量机(SVM);NIST SRE;扬声器ID的稳健性;

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号