首页> 外文期刊>Systems and Computers in Japan >Dictation of Multiparty Conversation Considering Speaker Individuality and Turn Taking
【24h】

Dictation of Multiparty Conversation Considering Speaker Individuality and Turn Taking

机译:考虑说话者个性和转弯的多方对话听写

获取原文
获取原文并翻译 | 示例
           

摘要

This paper discusses an algorithm that recognizes multiparty speech with complex turn taking. In recognition of the conversation of multiple speakers, it is necessary to know not only what is spoken, as in the conventional system, but also who spoke up to what point. The purpose of this paper is to find a method to solve this problem. The representation of the likelihood of turn taking is included in the language model in the continuous speech recognition system, and the speech properties of each speaker are represented by a statistical model. Using this approach, two algorithms are proposed that estimate simultaneously and in parallel the speaker and the speech content. Recognition experiments using conversation in TV sports news show that the proposed method can correct a maximum of 29.5% of the errors in the recognition of speech content and 93.0% of the errors in recognition of the speaker.
机译:本文讨论了一种识别带有复杂转弯动作的多方语音的算法。在认识到多个说话者的对话时,不仅要知道在传统系统中所说的话,而且还必须知道谁说了什么话。本文的目的是找到一种解决该问题的方法。连续语音识别系统的语言模型中包含有轮到可能性的表示,并且每个说话者的语音属性都由统计模型表示。使用这种方法,提出了两种算法,可同时并行地估计说话者和语音内容。在电视体育新闻中使用会话进行的识别实验表明,该方法最多可以纠正语音内容识别中的29.5%的错误和说话者识别中的93.0%的错误。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号