Analysis on MAP and MLLR based speaker adaptation techniques in speech recognition

机译：语音识别中基于MAP和MLLR的说话人自适应技术分析

获取原文

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

Speech recognition system produces a text output corresponding to the given speech input. A speaker-dependent (SD) recognition system results in a higher recognition performance when compared to a speaker-independent (SI) system. Speaker adaptation techniques like maximum aposteriori (MAP) and maximum likelihood linear regression (MLLR) are applied to an SI system, in order to get a recognition performance similar to that of an SD system, with minimal amount of data. The main focus of this paper is to analyse the performance of the adaptation techniques, applied to the recognition system for different amount of adaptation data. In this work, a speech recognition system is developed using Tamil speech corpus. Cross-gender speaker adaptation is performed by varying the adaptation data. It is observed that when the adaptation data is very minimum, around 30s, the recognition performance of MLLR adapted system results in 45.76% when MAP adapted system resulted in 42.44%. When the adaptation data is increased to 5min, the overall recognition performance is improved by 6% for MAP adaptation over MLLR adapted recognition system.

机译：语音识别系统产生与给定语音输入相对应的文本输出。与说话者无关（SI）系统相比，说话者依赖性（SD）识别系统具有更高的识别性能。说话人自适应技术（如最大撇号（MAP）和最大似然线性回归（MLLR））应用于SI系统，以便以最少的数据量获得与SD系统相似的识别性能。本文的主要重点是分析适应技术的性能，将其应用于不同数量的适应数据的识别系统。在这项工作中，使用泰米尔语语料库开发了语音识别系统。通过改变适应数据来执行跨性别说话者适应。可以看出，当自适应数据非常小，大约30s时，MLLR自适应系统的识别性能为45.76％，而MAP自适应系统的识别性能为42.44％。当自适应数据增加到5分钟时，与MLLR自适应识别系统相比，MAP自适应的总体识别性能提高了6％。

著录项

来源
《International Conference on Circuit, Power and Computing Technologies》|2014年|1753-1758|共6页
会议地点
作者
Ramya T.; Christina S. Lilly; Vijayalakshmi P.; Nagarajan T.;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
MAP; MLLR; Speaker adaptation;

机译：MAP; MLLR;扬声器适应;

相似文献

外文文献
中文文献
专利

1. An acoustic-phonetic-based speaker adaptation technique forimproving speaker-independent continuous speech recognition [J] . Yunxin Zhao IEEE Transactions on Speech and Audio Proceessing . 1994,第3期

机译：基于声学的说话人自适应技术，用于改善与说话人无关的连续语音识别
2. An acoustic-phonetic-based speaker adaptation technique for improving speaker-independent continuous speech recognition [J] . Yunxin Zhao IEEE Transactions on Speech and Audio Proceeding . 1994,第3期

机译：基于声学的说话人自适应技术，用于改善与说话人无关的连续语音识别
3. Speaker adaptation of pitch spectrum for HMM-based speech synthesis using MSD-MLLR [J] . Masatsune Tamura, Takashi Masuko, Keiichi Tokuda, 電子情報通信学会技術研究報告. 音声. Speech . 2001,第86期

机译：基于MSD-MLLR的基于HMM的语音合成的说话人音调频谱自适应
4. Analysis on MAP and MLLR based speaker adaptation techniques in speech recognition [C] . Ramya T., Christina S. Lilly, Vijayalakshmi P., International Conference on Circuit, Power and Computing Technologies . 2014

机译：语音识别中地图与MLLR的扬声器适应技术分析
5. Model selection based speaker adaptation and its application to nonnative speech recognition. [D] . He, Xiaodong. 2003

机译：基于模型选择的说话人自适应及其在非本地语音识别中的应用。
6. Regularized Speaker Adaptation of KL-HMM for Dysarthric Speech Recognition [O] . Myungjong Kim, Younggwan Kim, Joohong Yoo, -1

机译：KL-HMM的正则化说话人适应用于音调异常语音识别
7. Analysis and implementation of the speaker adaptation techniques : MAP, MLLR, and MLED [O] . Fanner Robert M. 2002

机译：扬声器适配技术的分析和实现：map，mLLR和mLED

Analysis on MAP and MLLR based speaker adaptation techniques in speech recognition

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅