...
首页> 外文期刊>電子情報通信学会技術研究報告. 音声. Speech >Speaker indexing based on speaker model selection and automatic speech recognition in discussions
【24h】

Speaker indexing based on speaker model selection and automatic speech recognition in discussions

机译:基于扬声器模型选择和讨论中的自动语音识别的扬声器索引

获取原文
获取原文并翻译 | 示例
           

摘要

This paper addresses unsupervised speaker indexing for discussion audio archives. In discussions, the speaker changes frequently, thus the duration of utterances is very short and its variation is large, which causes significant problems in applying conventional methods such as model adaptation and Variance-BIC (Bayesian Information Criterion) methods. We propose a flexible framework that selects an optimal speaker model (GMM or VQ) based on the BIC according to the duration of utterances. When the speech segment is short, the simple and robust VQ-based method is expected to be chosen, while GMM will be reliably trained for long segments. For a discussion archive, it is demonstrated that the proposed method achieves higher indexing performance than that of conventional methods. The speaker index is useful for speaker adaptation of the acoustic model, which improves
机译:本文涉及讨论音频档案的无监督者索引。 在讨论中,扬声器经常变化,因此话语的持续时间非常短,其变化很大,这在应用诸如模型适应和方差 - BIC(贝叶斯信息标准)方法之类的传统方法中引起重大问题。 我们提出了一种灵活的框架,根据话语的持续时间基于BIC选择最佳扬声器模型(GMM或VQ)。 当语音段短时,预计基于简单且坚固的VQ的方法将被选中,而GMM将可靠地培训用于长段。 对于讨论归档,证明该方法的索引性能比传统方法更高。 扬声器指数对于声学模型的扬声器调整有用,这改善了

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号