Dictation of multiparty conversation using MLLR speaker adaptation and statistical turn taking model

Noriyuki Murai; Tetsunori Kobayashi

首页> 外文期刊>電子情報通信学会技術研究報告. 音声. Speech >Dictation of multiparty conversation using MLLR speaker adaptation and statistical turn taking model

【24h】

Dictation of multiparty conversation using MLLR speaker adaptation and statistical turn taking model

机译：使用MLLR说话者自适应和统计转向模型对多方对话进行听写

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

A new speech decoder dealing with multiparty conversation is proposed. Multiparty conversation denotes a situation in which many speakers talk each other. In such a situation, the system has to recognize not only the word sequence of the input speech but also the speaker of each part of them. We propose the method utilizing not only acoustic model and language model, which are the resources of conventional single-user speech decoder, but also stochastic turn taking model and speakers individual models using MLLR speaker adaptation to recognize speech. This framework realizes simultaneous maximum likelihood estimation of spoken word sequence and the speaker sequence. Experimental results using TV sports news show that the proposed method reduce the word error rate by 29.5 % and speaker error rate by 89.7 % compared to the conventional method.

机译：提出了一种新的处理多方对话的语音解码器。多方对话表示许多发言人互相交谈的情况。在这种情况下，系统不仅必须识别输入语音的单词序列，而且还必须识别它们每个部分的说话者。我们提出的方法不仅利用声学模型和语言模型，这是传统的单用户语音解码器的资源，而且还利用MLLR说话者自适应来识别语音的随机转向模型和说话者个人模型。该框架实现了语音单词序列和说话者序列的同时最大似然估计。电视体育新闻的实验结果表明，与传统方法相比，该方法可将单词错误率降低29.5％，将说话者错误率降低89.7％。

著录项

来源
《電子情報通信学会技術研究報告. 音声. Speech》 |2000年第136期|共8页
作者
Noriyuki Murai; Tetsunori Kobayashi;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种 jpn
中图分类电报、传真;
关键词
Multiparty conversation; Stochastic turn taking model; Speaker individuality; GMM; MLLR;

机译：多方对话;随机转弯模型;说话者个性;GMM;MLLR;

相似文献

外文文献
中文文献
专利

1. Dictation of multiparty conversation using MLLR speaker adaptation and statistical turn taking model [J] . Noriyuki Murai, Tetsunori Kobayashi 電子情報通信学会技術研究報告. 音声. Speech . 2000,第136期

机译：使用MLLR说话者自适应和统计转向模型对多方对话进行听写
2. Dictation of Multiparty Conversation Considering Speaker Individuality and Turn Taking [J] . Noriyuki Murai, Tetsunori Kobayashi Systems and Computers in Japan . 2003,第13期

机译：考虑说话者个性和转弯的多方对话听写
3. Improving Rapid Unsupervised Speaker Adaptation Based on HMM-Sufficient Statistics in Noisy Environments Using Multi-Template Models [J] . Randy GOMEZ, Akinobu LEE, Tomoki TODA, IEICE Transactions on Information and Systems . 2006,第3期

机译：使用多模板模型在嘈杂环境中提高基于HMM足够统计量的快速无监督说话人适应
4. Dictation of multiparty conversation using statistical turn taking model and speaker model [C] . Murai, N., Kobayashi, . 2000

机译：使用统计转向模型和说话者模型对多方对话进行听写
5. Transformation sharing strategies for MLLR speaker adaptation. [D] . Mandal, Arindam. 2007

机译：MLLR说话人适应的转换共享策略。
6. Bilingual parents’ modeling of pragmatic language use in multiparty interactions [O] . Medha Tare, Susan A. Gelman -1

机译：双语父母的多方互动务实的语言使用建模
7. RAPID FEATURE SPACE MLLR SPEAKER ADAPTATION WITH BILINEAR MODELS [O] . Shilei Zhang, Peder A. Olsen, Yong Qin 2016

机译：快速特征空间mLLR扬声器适应双线性模型

Dictation of multiparty conversation using MLLR speaker adaptation and statistical turn taking model

摘要

著录项

相似文献

相关主题

期刊订阅