首页> 外文会议>Chinese Conference on biometric recognition >Prioritized Grid Highway Long Short-Term Memory-Based Universal Background Model for Speaker Verification

【24h】

Prioritized Grid Highway Long Short-Term Memory-Based Universal Background Model for Speaker Verification

机译：优先基于网格公路长期短期记忆的说话人验证通用背景模型

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Prioritized grid long short-term memory (pGLSTM) has been shown to improve automatic speech recognition efficiently. In this paper, we implement this state-of-the-art model of ASR tasks for text-independent Chinese language speaker verification tasks in which DNN/i-Vector (DNN-based i-Vector) framework is adopted along with PLDA backend. To fully explore the performance, we compared the presented pGLSTM based UBM to GMM-UBM and HLSTM-UBM. Due to constraint of the amount of Chinese transcribed corpus for ASR training, we also explore an adaptation method by firstly training the pGLSTM-UBM on English language with large amount of corpus and use a PLDA adaptation backend to fit into Chinese language before the final speaker verification scoring. Experiments show that both pGLSTM-UBM model with corresponding PLDA backend and pGLSTM-UBM with adapted PLDA backend achieve better performance than the traditional GMM-UBM model. Additionally the pGLSTM-UBM with PLDA backend achieves performance of 4.94% EER in 5 s short utterance and 1.97% EER in 10 s short utterance, achieving 47% and 51% drop comparing to that of GMM. Experiment results imply that DNN from ASR tasks can expand the advantage of UBM model especially in short utterance and that better DNN model for ASR tasks could achieve extra gain in speaker verification tasks.

机译：优先的网格长短期记忆（pGLSTM）已被证明可以有效地改善自动语音识别。在本文中，我们将DSR / i-Vector（基于DNN的i-Vector）框架与PLDA后端一起采用，从而针对与文本无关的中文说话者验证任务实现了这种ASR任务的最新模型。为了全面研究性能，我们将提出的基于pGLSTM的UBM与GMM-UBM和HLSTM-UBM进行了比较。由于用于ASR训练的中文转录语料库数量的限制，我们还探索了一种适应方法，首先在具有大量语料库的英语上训练pGLSTM-UBM，然后在最终讲者面前使用PLDA适应后端来适应中文验证评分。实验表明，带有相应PLDA后端的pGLSTM-UBM模型和带有适配PLDA后端的pGLSTM-UBM均比传统的GMM-UBM模型具有更好的性能。此外，带有PLDA后端的pGLSTM-UBM在5秒钟的短时间内可实现4.94％的EER，在10秒钟的短时间内可实现1.97％的EER，与GMM相比，可实现47％和51％的下降。实验结果表明，来自ASR任务的DNN可以扩展UBM模型的优势，特别是在短话语中，而针对ASR任务的更好的DNN模型可以在说话者验证任务中获得额外的收益。

著录项

来源
《Chinese Conference on biometric recognition》|2017年|584-592|共9页
会议地点
作者
Jianzong Wang; Hui Guo; Jing Xiao;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
DNN; RNN; i-Vector; UBM; Speaker verification; pGLSTM; VAD; Long short-term memory;

机译：DNN; RNN; i-Vector; UBM;说话者验证; pGLSTM; VAD;长期记忆;
入库时间 2022-08-26 13:48:25

相似文献

外文文献
中文文献
专利

1. OPTIMAL UNIVERSAL BACKGROUND MODEL IN AUTOMATIC SPEAKER VERIFICATION [J] . Hayet Djellali, Mohamed Tayeb Laskri Computers & Structures . 2013,第2期

机译：自动扬声器验证中的最佳通用背景模型
2. A Study on Universal Background Model Training in Speaker Verification [J] . Hasan T., Hansen J. H. L. Audio, Speech, and Language Processing, IEEE Transactions on . 2011,第7期

机译：说话人验证中的通用背景模型训练研究
3. Towards an Optimal Speaker Modeling in Speaker Verification Systems using Personalized Background Models [J] . Ayoub Bouziane, Jamal Kharroubi, Arsalane Zarghili International Journal of Electrical and Computer Engineering . 2017,第6期

机译：使用个性化背景模型实现说话人验证系统中的最佳说话人建模
4. Prioritized Grid Highway Long Short-Term Memory-Based Universal Background Model for Speaker Verification [C] . Jianzong Wang, Hui Guo, Jing Xiao Chinese Conference on Biometric Recognition . 2017

机译：优先电网高速公路长短期内存基于扬声器验证的通用背景模型
5. Discriminative and generative approaches for long- and short-term speaker characteristics modeling: Application to speaker verification. [D] . Dehak, Najim. 2009

机译：长期和短期说话者特征建模的判别和生成方法：在说话者验证中的应用。
6. Ordered short-term memory differs in signers and speakers: Implications for models of short-term memory [O] . Daphne Bavelier, Elissa L. Newport, Matt Hall, -1

机译：签名者和说话者的有序短期记忆有所不同：短期记忆模型的含义
7. Towards an Optimal Speaker Modeling in Speaker Verification Systems using Personalized Background Models [O] . Ayoub Bouziane, Jamal Kharroubi, Arsalane Zarghili 2017

机译：朝着使用个性化背景模型的扬声器验证系统中的最佳扬声器建模
8. Tests Results Advanced Development Models of BISS Identity Verification Equipment. Volume II. Automatic Speaker Verification. [R] . foodman,martin j. 1978

机译：测试结果BIss身份验证设备的高级开发模型。第二卷。自动扬声器验证。

Prioritized Grid Highway Long Short-Term Memory-Based Universal Background Model for Speaker Verification

摘要

著录项

相似文献

相关主题

期刊订阅