Speaker adaptation using constrained transformation

Xintian Wu; Yonghong Yan

首页> 外文期刊>IEEE Transactions on Speech and Audio Proceeding >Speaker adaptation using constrained transformation

【24h】

Speaker adaptation using constrained transformation

机译：使用约束变换的说话人适应

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

In speech recognition research, transformation-based adaptation algorithms provide an effective way of adapting acoustic models to improve the recognition accuracy. However, when only limited amounts of adaptation data are available, the transformation is often poorly estimated, which may cause performance degradation. This paper presents the Markov Random Field Linear Regression (MRFLR) algorithm, which constrains the transformation-based adaptation by the correlations among acoustic parameters. The Markov Random Field theory is used to model the correlations. The correlations are estimated from the training corpus and hypothesized as prior knowledge of acoustic models. By explicitly incorporating them into adaptation, robust and fast adaptation can be achieved. The hypothesis is tested by comparing MRFLR with MLLR (Maximum Likelihood Linear Regression), a widely used transformation-based adaptation algorithm. Experimental results show that MRFLR outperforms MLLR when adaptation data are sparse, and converges to the MLLR performance when more adaptation data are available.

机译：在语音识别研究中，基于变换的自适应算法提供了一种有效的方法来自适应声学模型，以提高识别精度。但是，当只有有限数量的适应数据可用时，转换的估计往往很差，这可能会导致性能下降。本文提出了马尔可夫随机场线性回归（MRFLR）算法，该算法通过声学参数之间的相关性来约束基于变换的自适应。马尔可夫随机场理论用于对相关性进行建模。从训练语料库估计相关性，并假设它们是声学模型的先验知识。通过将它们明确地纳入适应中，可以实现强大而快速的适应。通过比较MRFLR和MLLR（最大似然线性回归）来检验该假设，MLLR是一种广泛使用的基于变换的自适应算法。实验结果表明，当适应数据稀疏时，MRRFR优于MLLR；当有更多适应数据时，MRRFR收敛于MLLR性能。

著录项

来源
《IEEE Transactions on Speech and Audio Proceeding》 |2004年第2期|p.168-174|共7页
作者
Xintian Wu; Yonghong Yan;
展开▼
作者单位

Intel Corp., Santa Clara, CA, USA;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类电声技术和语音信号处理;
关键词
Markov processes; regression analysis; speaker recognition; speaker adaptation; constrained transformation; acoustic models; recognition accuracy; Markov random field linear regression algorithm; acoustic parameter correlation; maximum likelihood linear regression;

机译：马尔可夫过程;回归分析;说话人识别;说话人自适应;约束变换;声学模型;识别精度;马尔可夫随机场线性回归算法;声学参数相关性;最大似然线性回归;
入库时间 2022-08-18 00:13:02

相似文献

外文文献
中文文献
专利

1. Speaker adaptation using constrained transformation [J] . Xintian Wu, Yonghong Yan IEEE Transactions on Speech and Audio Proceessing . 2004,第2期

机译：使用约束变换的说话人适应
2. Analysis of Speaker Adaptation Algorithms for HMM-Based Speech Synthesis and a Constrained SMAPLR Adaptation Algorithm [J] . Yamagishi J., Kobayashi T., Nakano Y., IEEE transactions on audio, speech and language processing . 2009,第1期

机译：基于HMM的语音合成的说话人自适应算法和约束SMAPLR自适应算法的分析
3. Speaker clustering and transformation for speaker adaptation in speech recognition systems [J] . Padmanabhan M., Bahl L.R. IEEE Transactions on Speech and Audio Proceeding . 1998,第1期

机译：语音识别系统中的说话人适应和说话人聚类和转换
4. TEMPORAL STRUCTURE CONSTRAINED TRANSFORMATION FOR SPEAKER ADAPTATION [C] . IEEE IEEE International Conference on Acoustics, Speech, and Signal Processing . 2003

机译：扬声器适应的时间结构约束变换
5. Transformation sharing strategies for MLLR speaker adaptation. [D] . Mandal, Arindam. 2007

机译：MLLR说话人适应的转换共享策略。
6. Transformational adaptation when incremental adaptations to climate change are insufficient [O] . Robert W. Kates, William R. Travis, Thomas J. Wilbanks 2012

机译：当对气候变化的增量适应不足时的转型适应
7. Analysis of Speaker Adaptation Algorithms for HMM-based Speech Synthesis and a Constrained SMAPLR Adaptation Algorithm [O] . Yamagishi Junichi, Kobayashi Takao, Yuji Nakano, 2010

机译：基于HMM的语音合成的说话人自适应算法和约束SMAPLR自适应算法的分析

Speaker adaptation using constrained transformation

摘要

著录项

相似文献

相关主题

期刊订阅