Speaker indexing based on speaker model selection and automatic speech recognition in discussions

Masafumi Nishida; Yuya Akita; Tatsuya Kawahara

首页> 外文期刊>電子情報通信学会技術研究報告. 音声. Speech >Speaker indexing based on speaker model selection and automatic speech recognition in discussions

【24h】

Speaker indexing based on speaker model selection and automatic speech recognition in discussions

机译：Speaker indexing based on speaker model selection and automatic speech recognition in discussions

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

页面导航

摘要
著录项
相关主题

摘要

This paper addresses unsupervised speaker indexing for discussion audio archives. In discussions, the speaker changes frequently, thus the duration of utterances is very short and its variation is large, which causes significant problems in applying conventional methods such as model adaptation and Variance-BIC (Bayesian Information Criterion) methods. We propose a flexible framework that selects an optimal speaker model (GMM or VQ) based on the BIC according to the duration of utterances. When the speech segment is short, the simple and robust VQ-based method is expected to be chosen, while GMM will be reliably trained for long segments. For a discussion archive, it is demonstrated that the proposed method achieves higher indexing performance than that of conventional methods. The speaker index is useful for speaker adaptation of the acoustic model, which improves

著录项

来源
《電子情報通信学会技術研究報告. 音声. Speech》 |2002年第530期|55-60|共6页
作者
Masafumi Nishida; Yuya Akita; Tatsuya Kawahara;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种日语
中图分类电报、传真;
关键词
Speech recognition; Speaker recognition; Discussions; unsupervised speaker indexing; Model selection; Bayesian information criterion;
入库时间 2024-01-25 00:42:56

Speaker indexing based on speaker model selection and automatic speech recognition in discussions

摘要

著录项

相关主题

期刊订阅