A Cohort-Based Speaker Model Synthesis for Mismatched Channels in Speaker Verification

Wei Wu; Zheng T.F.; Ming-Xing Xu; Soong F.K.

首页> 外文期刊>IEEE transactions on audio, speech and language processing >A Cohort-Based Speaker Model Synthesis for Mismatched Channels in Speaker Verification

【24h】

A Cohort-Based Speaker Model Synthesis for Mismatched Channels in Speaker Verification

机译：基于队列的说话人验证中不匹配通道的说话人模型综合

获取原文

获取原文并翻译 | 示例

开具论文收录证明 >>

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

Mismatch between enrollment and test data is one of the top performance degrading factors in speaker recognition applications. This mismatch is particularly true over public telephone networks, where input speech data is collected over different handsets and transmitted over different channels from one trial to the next. In this paper, a cohort-based speaker model synthesis (SMS) algorithm, designed for synthesizing robust speaker models without requiring channel-specific enrollment data, is proposed. This algorithm utilizes a priori knowledge of channels extracted from speaker-specific cohort sets to synthesize such speaker models. The cohort selection in the proposed new SMS can be either speaker-specific or Gaussian component based. Results on the China Criminal Police College (CCPC) speaker recognition corpus, which contains utterances from both landline and mobile channel, show the new algorithms yield significant speaker verification performance improvement over Htnorm and universal background model (UBM)-based speaker model synthesis.

机译：注册和测试数据之间的不匹配是说话人识别应用程序中最严重的性能下降因素之一。这种失配在公用电话网络上尤其如此，在公用电话网络上，输入的语音数据是通过不同的手机收集的，并通过不同的通道从一个试验传送到下一个试验。本文提出了一种基于队列的说话人模型合成（SMS）算法，该算法旨在用于合成鲁棒的说话人模型而无需特定于频道的注册数据。该算法利用从说话者特定队列集合中提取的声道的先验知识来合成这种说话者模型。建议的新SMS中的同类群组选择可以是特定于说话者的，也可以是基于高斯分量的。中国刑警学院（CCPC）说话人识别语料库的结果包含固定电话和移动渠道的语音，显示出新算法比基于Htnorm和基于通用背景模型（UBM）的说话人模型综合在说话人验证性能上有显着提高。

著录项

来源
《IEEE transactions on audio, speech and language processing》 |2007年第6期|p.1893-1903|共11页
作者
Wei Wu; Zheng T.F.; Ming-Xing Xu; Soong F.K.;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类自动化技术、计算机技术;
关键词
Gaussian processes; speaker recognition; speech synthesis; telephone networks; Gaussian component; cohort-based speaker model synthesis; mismatched channels; public telephone networks; speaker recognition; speaker verification; Channel mismatch; cohort; speaker mode;

机译：高斯过程;说话人识别;语音合成;电话网络;高斯分量;基于群组的说话人模型综合;信道不匹配;公用电话网络;说话人识别;说话人验证;信道不匹配;群组;说话人模式;

相似文献

外文文献
中文文献
专利

1. Joint Factor Analysis of Channel Mismatch in Whispering Speaker Verification [J] . Gang LV, Heming ZHAO Archives of acoustics . 2012,第4期

机译：说话者验证中通道不匹配的联合因素分析
2. Towards an Optimal Speaker Modeling in Speaker Verification Systems using Personalized Background Models [J] . Ayoub Bouziane, Jamal Kharroubi, Arsalane Zarghili International Journal of Electrical and Computer Engineering . 2017,第6期

机译：使用个性化背景模型实现说话人验证系统中的最佳说话人建模
3. Speaker Model Clustering to Construct Background Models for Speaker Verification [J] . Disken Gokay, Tufekci Zekeriya, Cevik Ulus Archives of acoustics . 2017,第1期

机译：说话人模型聚类为说话人验证构建背景模型
4. Modelling speaker and channel variability using deep neural networks for robust speaker verification [C] . Gautam Bhattacharya, Jahangir Alam, Patrick Kenn, IEEE Workshop on Spoken Language Technology . 2016

机译：使用深度神经网络对说话人和频道可变性进行建模，以进行可靠的说话人验证
5. Discriminative and generative approaches for long- and short-term speaker characteristics modeling: Application to speaker verification. [D] . Dehak, Najim. 2009

机译：长期和短期说话者特征建模的判别和生成方法：在说话者验证中的应用。
6. Short-time speaker verification with different speaking style utterances [O] . Hongwei Mao, Yan Shi, Yue Liu, 2020

机译：短时间发言者验证不同的说话风格的话语
7. EFFECTS OF DEVICE MISMATCH, LANGUAGE MISMATCH AND ENVIRONMENTAL MISMATCH ON SPEAKER VERIFICATION [O] . Bin Ma, Helen M. Meng, Man-wai Mak 2009

机译：设备不匹配，语音失调和环境失调对语音验证的影响
8. Speaker Verification in the Presence of Channel Mismatch Using Gaussian MixtureModels [R] . Reid, R. B. 1997

机译：使用高斯混合模型进行通道不匹配时的说话人验证

A Cohort-Based Speaker Model Synthesis for Mismatched Channels in Speaker Verification

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅