Effective background data selection for SVM-based speaker recognition with unseen test environments: more is not always better

John H.L. Hansen; Jun-Won Suh; Pongtep Angkititrakul; Yun Lei

首页> 外文期刊>International journal of speech technology >Effective background data selection for SVM-based speaker recognition with unseen test environments: more is not always better

【24h】

Effective background data selection for SVM-based speaker recognition with unseen test environments: more is not always better

机译：在看不见的测试环境中进行有效的背景数据选择，以实现基于SVM的说话人识别：更多并不总是更好

获取原文

获取原文并翻译 | 示例

获取外文期刊封面封底 >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

This study focuses on formulating a procedure to select effective negative examples for the development of improved Support Vector Machine (SVM)-based speaker recognition. Selection of a background dataset, or a collection of negative examples, is the crucial step for building an effective decision surface between a target speaker and the non-target speakers. Previous studies heuristically fixed the number of negative examples used based on available development data for performance evaluation; nevertheless, in real applications this does not guarantee sustained performance for unseen data, as will be shown. In the proposed model selection framework, a novel ranking method is first exploited to rank order the negative examples for selecting a set of background datasets with various population sizes. Next, an error estimation and model-selection criterion are proposed and employed to select the most suitable target model among the model candidates. The experimental validation, conducted on the NIST SRE-2008 and SRE-2010 data, demonstrates that the proposed background data selection slightly but consistently outperforms the fixed-size background data selection, and achieves a relative improvement of +6 % over the non-selection background framework in terms of minDCF.

机译：这项研究的重点是制定程序，以选择有效的否定示例，以开发基于改进的支持向量机（SVM）的说话人识别。选择背景数据集或否定示例的集合，是在目标讲话者与非目标讲话者之间建立有效决策面的关键步骤。先前的研究启发式地根据可用的开发数据来评估绩效评估中使用的负面案例的数量；但是，在实际应用中，这并不能保证对看不见的数据具有持续的性能，如下所示。在提出的模型选择框架中，首先利用一种新颖的排序方法对负样本进行排序，以选择一组具有各种人口规模的背景数据集。接下来，提出了误差估计和模型选择准则，并采用该准则来在模型候选者中选择最合适的目标模型。对NIST SRE-2008和SRE-2010数据进行的实验验证表明，建议的背景数据选择略微但始终优于固定大小的背景数据选择，并且相对于非选择而言，相对提高了6％ minDCF的背景框架。

著录项

来源
《International journal of speech technology》 |2014年第3期|211-221|共11页
作者
John H.L. Hansen; Jun-Won Suh; Pongtep Angkititrakul; Yun Lei;
展开▼
作者单位

Center for Robust Speech Systems (CRSS), Erik Jonsson School of Engineering and Computer Science, University of Texas at Dallas, Richardson, TX, USA;

Center for Robust Speech Systems (CRSS), Erik Jonsson School of Engineering and Computer Science, University of Texas at Dallas, Richardson, TX, USA;

Center for Robust Speech Systems (CRSS), Erik Jonsson School of Engineering and Computer Science, University of Texas at Dallas, Richardson, TX, USA;

Center for Robust Speech Systems (CRSS), Erik Jonsson School of Engineering and Computer Science, University of Texas at Dallas, Richardson, TX, USA;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
Speaker recognition; Support vector machine (SVM); NIST SRE; Robustness in speaker ID;

机译：说话人识别;支持向量机（SVM）;NIST SRE;扬声器ID的稳健性;

相似文献

外文文献
中文文献
专利

1. Data-Driven Background Dataset Selection for SVM-Based Speaker Verification [J] . McLaren M., Vogt R., Baker B., Audio, Speech, and Language Processing, IEEE Transactions on . 2010,第6期

机译：数据驱动的背景数据集选择，用于基于SVM的扬声器验证
2. Comparison of Speaker Adaptation Methods as Feature Extraction for SVM-Based Speaker Recognition [J] . Ferras M., Cheung-Chi Leung, Barras C., Audio, Speech, and Language Processing, IEEE Transactions on . 2010,第6期

机译：基于SVM的说话人识别中说话人自适应方法作为特征提取的比较
3. Specificity data for the b Test, Dot Counting Test, Rey-15 Item Plus Recognition, and Rey Word Recognition Test in monolingual Spanish-speakers [J] . Robles Luz, Lopez Enrique, Salazar Xavier, Journal of clinical and experimental neuropsychology . 2015,第5a6期

机译：b语言测试，点计数测试，Rey-15项目加识别和Rey单词识别测试在西班牙语中的特异性数据
4. Effective background data selection in SVM speaker recognition for unseen test environment: More is not always better [C] . Suh Jun-Won, Lei Yun, Kim Wooil, 2011 IEEE International Conference on Acoustics, Speech and Signal Processing . 2011

机译：在SVM说话人识别中有效的背景数据选择，适用于看不见的测试环境：并非总是更好
5. Effective data selection technology for robust speaker recognition. [D] . Suh, Jun-Won. 2012

机译：有效的数据选择技术可实现可靠的说话人识别。
6. Automatic Detection of Previously-Unseen Application States for Deployment Environment Testing and Analysis [O] . Christian Murphy, Moses Vaughan, Waseem Ilahi, -1

机译：自动检测以前的未经过申请国的部署环境测试和分析
7. Effective background data selection for SVM-based speaker recognition with unseen test environments: more is not always better [O] . John H. L. Hansen, Jun-Won Suh, Pongtep Angkititrakul, 2014

机译：在看不见的测试环境中进行有效的背景数据选择，以实现基于SVM的说话人识别：更多并不总是更好
8. Trial-Based Calibration for Speaker Recognition in Unseen Conditions. [R] . McLaren, M., Lawson, A., Ferrer, L., 2014

机译：不可见条件下说话人识别的基于试验的校准。

Effective background data selection for SVM-based speaker recognition with unseen test environments: more is not always better

摘要

著录项

相似文献

相关主题

期刊订阅