Rapid speaker adaptation using speaker-mixture allophone models applied to speaker-independent speech recognition

机译：快速扬声器适应使用扬声器 - 混合的allophone模型应用于扬声器的语音识别

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

A speaker mixture principle that allows the creation of speaker-independent phone models is proposed. Speaker-tied training for rapid speaker adaptation using utterances shorter than one second is derived from this principle. The concept of speaker pruning is also introduced for reducing computational cost without degrading the speaker adaptation performance. The above principle is combined with context-dependent phone models, which have been automatically generated by the successive state splitting algorithm. In a Japanese phrase recognition experiment, speaker mixture allophone models achieved an error reduction of 29.0%, which is high in comparison with the conventional speaker-independent HMM (hidden Markov model)-LR method. Speaker adaptation by speaker-tied training attained an error reduction of 16.8% using a 0.6-s Japanese word utterance. Speaker pruning reduced the number of phone model mixtures by between 50% and 92% without lowering recognition performance.

机译：提出了允许创建扬声器的手机模型的扬声器混合原理。使用短于一秒钟的发言者为快速扬声器适应的讲话训练来自这一原则。还介绍了扬声器修剪的概念，用于降低计算成本而不会降低扬声器适应性能。上述原理与上下文相关的电话模型组合，这些电话模型已被连续状态分割算法自动生成。在日语短语识别实验中，扬声器混合偶联模型达到了29.0％的误差，与传统的扬声器无关的HMM（隐马尔可夫模型）-LR方法相比，这很高。演讲者适应扬声器绑定训练使用0.6-S日语话语达到16.8％的错误。扬声器修剪减少了电话模型混合物的数量在50％和92％之间而不降低识别性能。

著录项

来源
《IEEE international conference on acoustics, speech, and signal processing》|1993年||共4页
会议地点
作者
Kosaka T.; Takami J.; Institute of Electric and Electronic Engineer;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类通信理论;
关键词

相似文献

外文文献
中文文献
专利

1. An acoustic-phonetic-based speaker adaptation technique forimproving speaker-independent continuous speech recognition [J] . Yunxin Zhao IEEE Transactions on Speech and Audio Proceessing . 1994,第3期

机译：基于声学的说话人自适应技术，用于改善与说话人无关的连续语音识别
2. An acoustic-phonetic-based speaker adaptation technique for improving speaker-independent continuous speech recognition [J] . Yunxin Zhao IEEE Transactions on Speech and Audio Proceeding . 1994,第3期

机译：基于声学的说话人自适应技术，用于改善与说话人无关的连续语音识别
3. Acoustic Model Training Using Pseudo-Speaker Features Generated by MLLR Transformations for Robust Speaker-Independent Speech Recognition [J] . Arata ITOH, Sunao HARA, Norihide KITAOKA, IEICE transactions on information and systems . 2012,第10期

机译：使用由MLLR转换生成的伪扬声器特征进行声学模型训练，以实现与扬声器无关的可靠语音识别
4. Rapid speaker adaptation using speaker-mixture allophone models applied to speaker-independent speech recognition [C] . Kosaka, T., Takami, . 1993

机译：使用适用于与说话者无关的语音识别的说话者混合音素模型快速调整说话者
5. Audio parsing and rapid speaker adaptation in speech recognition for spoken document retrieval. [D] . Zhou, Bowen. 2003

机译：语音识别中的音频解析和快速的说话人自适应，可用于语音文档检索。
6. Speaker-Independent Silent Speech Recognition from Flesh-Point Articulatory Movements Using an LSTM NeuralNetwork [O] . Myungjong Kim, Beiming Cao, Ted Mau, -1

机译：使用LSTM神经从肉点发音运动中独立于说话者的沉默语音识别网络
7. Cross-lingual acoustic model adaptation for speaker-independent speech recognition [O] . Karhila Reima 2010

机译：跨语言声学模型自适应，用于独立于说话人的语音识别

Rapid speaker adaptation using speaker-mixture allophone models applied to speaker-independent speech recognition

摘要

著录项

相似文献

相关主题

期刊订阅