首页> 外文期刊>IEICE transactions on information and systems >Unsupervised Speaker Adaptation Using Speaker-Class Models for Lecture Speech Recognition
【24h】

Unsupervised Speaker Adaptation Using Speaker-Class Models for Lecture Speech Recognition

机译:使用演讲者级模型的演讲者语音识别的无监督演讲者自适应

获取原文
       

摘要

In this paper, we propose a new speaker-class modeling and its adaptation method for the LVCSR system and evaluate the method on the Corpus of Spontaneous Japanese (CSJ). In this method, closer speakers are selected from training speakers and the acoustic models are trained by using their utterances for each evaluation speaker. One of the major issues of the speaker-class model is determining the selection range of speakers. In order to solve the problem, several models which have a variety of speaker range are prepared for each evaluation speaker in advance, and the most proper model is selected on a likelihood basis in the recognition step. In addition, we improved the recognition performance using unsupervised speaker adaptation with the speaker-class models. In the recognition experiments, a significant improvement could be obtained by using the proposed speaker adaptation based on speaker-class models compared with the conventional adaptation method.
机译:在本文中,我们为LVCSR系统提出了一种新的说话人类别建模及其适应方法,并在自发日语语料库(CSJ)上对该方法进行了评估。在这种方法中,从训练说话者中选择更近的说话者,并通过对每个评估说话者使用其发声来训练声学模型。扬声器类模型的主要问题之一是确定扬声器的选择范围。为了解决该问题,预先为每个评估说话者准备了具有不同说话者范围的几个模型,并且在识别步骤中基于似然选择最合适的模型。此外,我们通过对说话人类别的模型进行无人监督的说话人自适应来提高识别性能。在识别实验中,与传统的自适应方法相比,通过使用基于说话者类别模型的拟议的说话者自适应,可以获得明显的改善。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号