Reducing Computation Time of the Rapid Unsupervised Speaker Adaptation Based on HMM-Sufficient Statistics

Randy GOMEZ; Tomoki TODA; Hiroshi SARUWATARI; Kiyohiro SHIKANO

首页> 外文期刊>IEICE Transactions on Information and Systems >Reducing Computation Time of the Rapid Unsupervised Speaker Adaptation Based on HMM-Sufficient Statistics

【24h】

Reducing Computation Time of the Rapid Unsupervised Speaker Adaptation Based on HMM-Sufficient Statistics

机译：基于HMM足够统计量的快速无监督说话人自适应计算时间的减少

获取原文

获取原文并翻译 | 示例

开具论文收录证明 >>

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

In real-time speech recognition applications, there is a need to implement a fast and reliable adaptation algorithm. We propose a method to reduce adaptation time of the rapid unsupervised speaker adaptation based on HMM-Sufficient Statistics. We use only a single arbitrary utterance without transcriptions in selecting the N-best speakers' Sufficient Statistics created offline to provide data for adaptation to a target speaker. Further reduction of N-best implies a reduction in adaptation time. However, it degrades recognition performance due to insufficiency of data needed to robustly adapt the model. Linear interpolation of the global HMM-Sufficient Statistics offsets this negative effect and achieves a 50% reduction in adaptation time without compromising the recognition performance. Furthermore, we compared our method with Vocal Tract Length Normalization (VTLN), Maximum A Posteriori (MAP) and Maximum Likelihood Linear Regression (MLLR). Moreover, we tested in office, car, crowd and booth noise environments in 10 dB, 15 dB, 20 dB and 25 dB SNRs.

机译：在实时语音识别应用中，需要实现一种快速而可靠的自适应算法。我们提出了一种基于HMM充足统计量的快速减少无监督说话人自适应时间的方法。在选择N个最佳演讲者离线创建的充足统计信息时，我们仅使用一个没有转录的任意话语来提供适应目标演讲者的数据。 N-best的进一步减少意味着适应时间的减少。但是，由于健壮地适应模型所需的数据不足，它降低了识别性能。全局HMM充足统计信息的线性插值抵消了这种负面影响，并在不影响识别性能的情况下将自适应时间减少了50％。此外，我们将我们的方法与声带长度归一化（VTLN），最大后验概率（MAP）和最大似然线性回归（MLLR）进行了比较。此外，我们在办公室，汽车，人群和展位的噪声环境中分别测试了10 dB，15 dB，20 dB和25 dB的SNR。

著录项

来源
《IEICE Transactions on Information and Systems》 |2007年第2期|p.554-561|共8页
作者
Randy GOMEZ; Tomoki TODA; Hiroshi SARUWATARI; Kiyohiro SHIKANO;
展开▼
作者单位

Nara Institute of Science and Technology, Ikoma-shi, 630-0192 Japan;

展开▼
收录信息美国《科学引文索引》(SCI);美国《工程索引》(EI);
原文格式 PDF
正文语种 eng
中图分类无线电电子学、电信技术;
关键词
HMM-sufficient statistics; unsupervised; rapid adaptation; speech recognition;

机译：HMM足够的统计数据;无监督;快速适应;语音识别;

相似文献

外文文献
中文文献
专利

1. Improving Rapid Unsupervised Speaker Adaptation Based on HMM-Sufficient Statistics in Noisy Environments Using Multi-Template Models [J] . Randy GOMEZ, Akinobu LEE, Tomoki TODA, IEICE Transactions on Information and Systems . 2006,第3期

机译：使用多模板模型在嘈杂环境中提高基于HMM足够统计量的快速无监督说话人适应
2. Evaluating Rapid Unsupervised Speaker Adaptation Using Linear Interpolation of HMM-Sufficient Statistics [J] . Randy GOMEZ, Tomoki TODA, Hiroshi SARUWATAR, 電子情報通信学会技術研究報告. 音声. Speech . 2005,第495期

机译：使用HMM足够统计量的线性插值评估快速无监督说话人适应
3. Evaluating Rapid Unsupervised Speaker Adaptation Using Linear Interpolation of HMM-Sufficient Statistics [J] . Randy GOMEZ, Tomoki TODA, Hiroshi SARUWATAR, 電子情報通信学会技術研究報告. 言語理解とコミュニケーション. Natural Language Understanding and Models of Communication . 2005,第493期

机译：使用HMM足够统计量的线性插值评估快速无监督说话人适应
4. Unsupervised speaker adaptation based on sufficient HMM statistics of selected speakers [C] . Yoshizawa, S., Baba, . 2001

机译：基于选定说话者的足够HMM统计信息的无监督说话者适应
5. Rapid Speaker Normalization and Adaptation with Applications to Automatic Evaluation of Children's Language Learning Skills. [D] . Wang, Shizhen. 2010

机译：快速的说话人归一化和适应，并应用于儿童语言学习技能的自动评估。
6. Rapid identification of BCR/ABL1-like acute lymphoblastic leukaemia patients using a predictive statistical model based on quantitative real time-polymerase chain reaction: clinical prognostic and therapeutic implications [O] . Sabina Chiaretti, Monica Messina, Sara Grammatico, -1

机译：使用基于定量实时聚合酶链反应的预测统计模型快速识别BCR / ABL1样急性淋巴细胞白血病患者：临床预后和治疗意义
7. Improving Rapid Unsupervised Speaker Adaptation Based on HMM Sufficient Statistics [O] . Randy Gomez, Tomoki Toda, Hiroshi Saruwatari, 2006

机译：基于HMM足够统计量的快速无监督说话人自适应

Reducing Computation Time of the Rapid Unsupervised Speaker Adaptation Based on HMM-Sufficient Statistics

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅