A New Perspective on Combining GMM and DNN Frameworks for Speaker Adaptation

机译：关于组合GMM和DNN框架对扬声器适应的新视角

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

In this paper we investigate the GMM-derived features for adaptation of context-dependent deep neural network HMM (CD-DNN-HMM) acoustic models with the focus on exploration of fusion of the adapted GMM-derived features and the conventional bottleneck features. We analyze and compare different types of fusion, such as feature level, posterior level, lattice level and others in order to discover the best possible way of fusion. Experimental results on the TED-LIUM corpus show that the proposed adaptation technique can be effectively integrated into DNN setup at different levels and provide additional gain in recognition performance: up to 6% of relative word error rate reduction (WERR) over the strong speaker adapted DNN baseline, and up to 22% of relative WERR in comparison with a speaker independent DNN baseline model, trained on conventional features.

机译：在本文中，我们研究了GMM导出的特征，用于改编上下文的深度神经网络嗯（CD-DNN-HMM）声学模型，重点是探索适用的GMM导出的特征和传统瓶颈特征的融合。我们分析和比较不同类型的融合，如特征级，后水平，晶格水平等，以发现最佳的融合方式。 TED-Lium语料库上的实验结果表明，所提出的适应技术可以在不同级别中有效地集成到DNN设置中，并提供识别性能的额外增益：在强大的扬声器上，高达6％的相对字错误率减少（WERR）适应与扬声器独立的DNN基线模型相比，DNN基线，相对WER的高达22％，接受常规特征培训。

著录项

来源
《International Conference on Statistical Language and Speech Processing》|2016年|144p|共13页
会议地点
作者
Natalia Tomashenko; Yuri Khokhlov; Yannick Esteve;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TN912-53;
关键词
Speaker adaptation; Deep neural networks (DNN); MAP; FMLLR; CD-DNN-HMM; GMM-derived (GMMD) features; Fusion; Posterior fusion; Confusion network combination;

机译：扬声器适应;深神经网络（DNN）;地图;FMLLR;CD-DNN-HMM;GMM衍生的（GMMD）特征;融合;后融合;混乱网络组合;
入库时间 2022-08-20 23:00:27

相似文献

外文文献
中文文献
专利

1. Robust distant speaker recognition based on position-dependent CMN by combining speaker-specific GMM with speaker-adapted HMM [J] . Longbiao Wang, Norihide Kitaoka, Seiichi Nakagawa Speech Communication . 2007,第6期

机译：通过结合特定于说话人的GMM和适用于说话人的HMM，基于位置相关的CMN进行鲁棒的远方说话人识别
2. Text-Independent/Text-Prompted Speaker Recognition by Combining Speaker-Specific GMM with Speaker Adapted Syllable-Based HMM [J] . Seiichi NAKAGAWA, Wei ZHANG, Mitsuo TAKAHASHI IEICE Transactions on Information and Systems . 2006,第3期

机译：通过结合特定于说话人的GMM和基于说话人的基于音节的HMM来实现与文本无关/提示文字的说话人识别
3. Speaker identification by combining speaker specific GMM with speaker adapted syllable-based HMM [J] . Seiichi Nakagawa, Wei Zhang 電子情報通信学会技術研究報告. 音声. Speech . 2003,第94期

机译：通过将特定于说话人的GMM与基于说话人的基于音节的HMM相结合来进行说话人识别
4. A New Perspective on Combining GMM and DNN Frameworks for Speaker Adaptation [C] . Natalia Tomashenko, Yuri Khokhlov, Yannick Esteve International conference on statistical language and speech processing . 2016

机译：结合GMM和DNN框架进行说话人适应的新观点
5. DNN Based Speaker Recognition System [D] . Song Hangyu 2020

机译：基于DNN的说话人识别系统
6. Intoxicated Speech Detection: A Fusion Framework with Speaker-Normalized Hierarchical Functionals and GMM Supervectors [O] . Daniel Bone, Ming Li, Matthew P. Black, -1

机译：陶醉的语音检测：具有扬声器归一化分层功能和GMM运行的融合框架
7. Probabilistic Neural Networks Combined With Gmms For Speaker Recognition Over Telephone Channels [O] . Todor Ganchev, Anastasios Tsopanoglou, Nikos Fakotakis, 2002

机译：结合Gmms的概率神经网络用于电话通道上的说话人识别

A New Perspective on Combining GMM and DNN Frameworks for Speaker Adaptation

摘要

著录项

相似文献

相关主题

期刊订阅