首页> 外文会议>International Conference on Circuit, Power and Computing Technologies >Analysis on MAP and MLLR based speaker adaptation techniques in speech recognition
【24h】

Analysis on MAP and MLLR based speaker adaptation techniques in speech recognition

机译:语音识别中基于MAP和MLLR的说话人自适应技术分析

获取原文

摘要

Speech recognition system produces a text output corresponding to the given speech input. A speaker-dependent (SD) recognition system results in a higher recognition performance when compared to a speaker-independent (SI) system. Speaker adaptation techniques like maximum aposteriori (MAP) and maximum likelihood linear regression (MLLR) are applied to an SI system, in order to get a recognition performance similar to that of an SD system, with minimal amount of data. The main focus of this paper is to analyse the performance of the adaptation techniques, applied to the recognition system for different amount of adaptation data. In this work, a speech recognition system is developed using Tamil speech corpus. Cross-gender speaker adaptation is performed by varying the adaptation data. It is observed that when the adaptation data is very minimum, around 30s, the recognition performance of MLLR adapted system results in 45.76% when MAP adapted system resulted in 42.44%. When the adaptation data is increased to 5min, the overall recognition performance is improved by 6% for MAP adaptation over MLLR adapted recognition system.
机译:语音识别系统产生与给定语音输入相对应的文本输出。与说话者无关(SI)系统相比,说话者依赖性(SD)识别系统具有更高的识别性能。说话人自适应技术(如最大撇号(MAP)和最大似然线性回归(MLLR))应用于SI系统,以便以最少的数据量获得与SD系统相似的识别性能。本文的主要重点是分析适应技术的性能,将其应用于不同数量的适应数据的识别系统。在这项工作中,使用泰米尔语语料库开发了语音识别系统。通过改变适应数据来执行跨性别说话者适应。可以看出,当自适应数据非常小,大约30s时,MLLR自适应系统的识别性能为45.76%,而MAP自适应系统的识别性能为42.44%。当自适应数据增加到5分钟时,与MLLR自适应识别系统相比,MAP自适应的总体识别性能提高了6%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号