首页> 外文期刊>IEEE Transactions on Speech and Audio Proceeding >Speaker adaptation using constrained transformation
【24h】

Speaker adaptation using constrained transformation

机译:使用约束变换的说话人适应

获取原文
获取原文并翻译 | 示例
       

摘要

In speech recognition research, transformation-based adaptation algorithms provide an effective way of adapting acoustic models to improve the recognition accuracy. However, when only limited amounts of adaptation data are available, the transformation is often poorly estimated, which may cause performance degradation. This paper presents the Markov Random Field Linear Regression (MRFLR) algorithm, which constrains the transformation-based adaptation by the correlations among acoustic parameters. The Markov Random Field theory is used to model the correlations. The correlations are estimated from the training corpus and hypothesized as prior knowledge of acoustic models. By explicitly incorporating them into adaptation, robust and fast adaptation can be achieved. The hypothesis is tested by comparing MRFLR with MLLR (Maximum Likelihood Linear Regression), a widely used transformation-based adaptation algorithm. Experimental results show that MRFLR outperforms MLLR when adaptation data are sparse, and converges to the MLLR performance when more adaptation data are available.
机译:在语音识别研究中,基于变换的自适应算法提供了一种有效的方法来自适应声学模型,以提高识别精度。但是,当只有有限数量的适应数据可用时,转换的估计往往很差,这可能会导致性能下降。本文提出了马尔可夫随机场线性回归(MRFLR)算法,该算法通过声学参数之间的相关性来约束基于变换的自适应。马尔可夫随机场理论用于对相关性进行建模。从训练语料库估计相关性,并假设它们是声学模型的先验知识。通过将它们明确地纳入适应中,可以实现强大而快速的适应。通过比较MRFLR和MLLR(最大似然线性回归)来检验该假设,MLLR是一种广泛使用的基于变换的自适应算法。实验结果表明,当适应数据稀疏时,MRRFR优于MLLR;当有更多适应数据时,MRRFR收敛于MLLR性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号