Prior Information for Rapid Speaker Adaptation

机译：快速适应说话者的先决信息

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Rapidly adapting a speech recognition system to new speakers using a small amount of adaptation data is important to improve initial user experience. In this paper, a count-smoothing framework for incorporating prior information is extended to allow for the use of different forms of dynamic prior and improve the robustness of transform estimation on small amounts of data. Prior information is obtained from existing rapid adaptation techniques like VTLN and PCMLLR. Results using VTLN as a dynamic prior for CMLLR estimation show that transforms estimated on just one utterance can yield relative gains of 15% and 46% over a baseline gender independent model on two tasks.

机译：使用少量的适应性数据将语音识别系统快速适应新的说话者，对于改善初始用户体验非常重要。在本文中，扩展了用于合并先验信息的计数平滑框架，以允许使用不同形式的动态先验，并提高了对少量数据的变换估计的鲁棒性。从现有的快速适应技术（如VTLN和PCMLLR）中获取先验信息。使用VTLN作为CMLLR估计的动态先验的结果表明，仅凭一种话语估计的变换就可以在两项任务的基准性别无关模型上产生15％和46％的相对收益。

著录项

来源
《Annual conference of the International Speech Communication Association;INTERSPEECH 2010》|2011年|p.1644-1647|共4页
会议地点
作者
C. Breslin; K.K. Chin; M.J.F. Gales; K. Knill H. Xu;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类通信;
关键词
automatic speech recognition; speaker adaptation; VTLN; prior knowledge;

机译：自动语音识别;说话人适应VTLN;先验知识;

相似文献

外文文献
中文文献
专利

1. Rapid Speaker Adaptation Based on Combination of KPCA and Latent Variable Model [J] . Ansari Zohreh, Almasganj Farshad, Kabudian Seyed Jahanshah Circuits, systems and signal processing . 2021,第8期

机译：基于KPCA和潜变模型的组合快速扬声器适应
2. Unsupervised rapid speaker adaptation based on selective eigenvoice merging for user-specific voice interaction [J] . Dong-Jin Choi, Jeong-Sik Park, Yung-Hwan Oh Engineering Applications of Artificial Intelligence . 2015,第apra期

机译：基于选择性特征语音合并的无监督快速说话人适应，用于特定于用户的语音交互
3. Rapid Speaker Adaptation Using Clustered Maximum-Likelihood Linear Basis With Sparse Training Data [J] . Tang Y., Rose R. IEEE transactions on audio, speech and language processing . 2008,第3期

机译：使用具有稀疏训练数据的聚类最大似然线性基础快速进行说话人适应
4. Prior Information for Rapid Speaker Adaptation [C] . C. Breslin, K.K. Chin, M.J.F. Gales, Annual conference of the International Speech Communication Association . 2010

机译：快速扬声器适应的事先信息
5. Rapid Speaker Normalization and Adaptation with Applications to Automatic Evaluation of Children's Language Learning Skills. [D] . Wang, Shizhen. 2010

机译：快速的说话人归一化和适应，并应用于儿童语言学习技能的自动评估。
6. Control of Movement: Control of the strength of visual-motor transmission as the mechanism of rapid adaptation of priors for Bayesian inference in smooth pursuit eye movements [O] . Timothy R. Darlington, Stefanie Tokiyama, Stephen G. Lisberger -1

机译：运动控制：视觉运动传递强度的控制作为快速顺畅地跟踪眼睛运动的贝叶斯推理先验先验的机制
7. Rapid speaker adaptation in latent speaker space with non-negative matrix factorization [O] . Zhang Xueru, Demuynck Kris, Van hamme Hugo 2013

机译：带有非负矩阵分解的潜在说话人空间中的快速说话人适应

Prior Information for Rapid Speaker Adaptation

摘要

著录项

相似文献

相关主题

期刊订阅