...
首页> 外文期刊>電子情報通信学会技術研究報告. 音声. Speech >Speaker indexing based on speaker model selection and automatic speech recognition in discussions
【24h】

Speaker indexing based on speaker model selection and automatic speech recognition in discussions

机译:讨论中基于说话人模型选择和自动语音识别的说话人索引

获取原文
获取原文并翻译 | 示例
           

摘要

This paper addresses unsupervised speaker indexing for discussion audio archives. In discussions, the speaker changes frequently, thus the duration of utterances is very short and its variation is large, which causes significant problems in applying conventional methods such as model adaptation and Variance-BIC (Bayesian Information Criterion) methods. We propose a flexible framework that selects an optimal speaker model (GMM or VQ) based on the BIC according to the duration of utterances. When the speech segment is short, the simple and robust VQ-based method is expected to be chosen, while GMM will be reliably trained for long segments. For a discussion archive, it is demonstrated that the proposed method achieves higher indexing performance than that of conventional methods. The speaker index is useful for speaker adaptation of the acoustic model, which improves
机译:本文讨论了讨论音频档案的无监督发言人索引。在讨论中,说话者经常变化,因此说话的持续时间非常短并且其变化很大,这在应用诸如模型自适应和Variance-BIC(贝叶斯信息准则)方法之类的常规方法时引起了重大问题。我们提出了一个灵活的框架,该框架根据发声的持续时间,根据BIC选择最佳的讲话者模型(GMM或VQ)。当语音段很短时,预计将选择基于VQ的简单而强大的方法,而GMM将针对长段可靠地进行训练。对于讨论档案,证明了所提出的方法比常规方法具有更高的索引性能。说话者索引对于声学模型的说话者适应很有用,它可以改善

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号