Dictation of Multiparty Conversation Considering Speaker Individuality and Turn Taking

Noriyuki Murai; Tetsunori Kobayashi

首页> 外文期刊>Systems and Computers in Japan >Dictation of Multiparty Conversation Considering Speaker Individuality and Turn Taking

【24h】

Dictation of Multiparty Conversation Considering Speaker Individuality and Turn Taking

机译：考虑说话者个性和转弯的多方对话听写

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

This paper discusses an algorithm that recognizes multiparty speech with complex turn taking. In recognition of the conversation of multiple speakers, it is necessary to know not only what is spoken, as in the conventional system, but also who spoke up to what point. The purpose of this paper is to find a method to solve this problem. The representation of the likelihood of turn taking is included in the language model in the continuous speech recognition system, and the speech properties of each speaker are represented by a statistical model. Using this approach, two algorithms are proposed that estimate simultaneously and in parallel the speaker and the speech content. Recognition experiments using conversation in TV sports news show that the proposed method can correct a maximum of 29.5% of the errors in the recognition of speech content and 93.0% of the errors in recognition of the speaker.

机译：本文讨论了一种识别带有复杂转弯动作的多方语音的算法。在认识到多个说话者的对话时，不仅要知道在传统系统中所说的话，而且还必须知道谁说了什么话。本文的目的是找到一种解决该问题的方法。连续语音识别系统的语言模型中包含有轮到可能性的表示，并且每个说话者的语音属性都由统计模型表示。使用这种方法，提出了两种算法，可同时并行地估计说话者和语音内容。在电视体育新闻中使用会话进行的识别实验表明，该方法最多可以纠正语音内容识别中的29.5％的错误和说话者识别中的93.0％的错误。

著录项

来源
《Systems and Computers in Japan》 |2003年第13期|共9页
作者
Noriyuki Murai; Tetsunori Kobayashi;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类计算技术、计算机技术;
关键词
Multiparty conversation; Statistical turn taking model; Speaker individuality; GMM; MLLR;

机译：多方对话;统计转向模型;说话人个性;GMM;MLLR;

相似文献

外文文献
中文文献
专利

1. Dictation of Multiparty Conversation Considering Speaker Individuality and Turn Taking [J] . Noriyuki Murai, Tetsunori Kobayashi Systems and Computers in Japan . 2003,第13期

机译：考虑说话者个性和转弯的多方对话听写
2. Dictation of multiparty conversation using MLLR speaker adaptation and statistical turn taking model [J] . Noriyuki Murai, Tetsunori Kobayashi 電子情報通信学会技術研究報告. 音声. Speech . 2000,第136期

机译：使用MLLR说话者自适应和统计转向模型对多方对话进行听写
3. Dictation of multiparty conversation using MLLR speaker adaptation and statistical turn taking model [J] . Noriyuki Murai, Tetsunori Kobayashi 電子情報通信学会技術研究報告. 音声. Speech . 2000,第136期

机译：使用MLLR扬声器适应和统计转向模型的多党对话的听写
4. Dictation of multiparty conversation using statistical turn taking model and speaker model [C] . Murai, N., Kobayashi, . 2000

机译：使用统计转向模型和说话者模型对多方对话进行听写
5. Gaze, turn-taking and proxemics in multiparty versus dyadic conversation across cultures. [D] . Herrera, David Alberto. 2010

机译：跨文化的多方对话与二元对话中的注视，转弯和近距离。
6. How Speakers Orient to the Notable Absence of Talk: A Conversation Analytic Perspective on Silence in Psychodynamic Therapy [O] . A. S. L. Knol, Tom Koole, Mattias Desmet, 2020

机译：扬声器如何定向到谈话中的显着缺席：一种谈话分析视角沉默在心理学治疗中
7. Communication Strategies of Non-Native Speaker to Native-Speaker conversation in an English conversation. [O] . PRANES SETYAWAN ADI 2013

机译：在英语对话中，非母语人士与母语人士之间的交流策略。

Dictation of Multiparty Conversation Considering Speaker Individuality and Turn Taking

摘要

著录项

相似文献

相关主题

期刊订阅