【24h】

A study on speaker normalized MLP features in LVCSR

机译:LVCSR中的说话人归一化MLP功能研究

获取原文

摘要

Different normalization methods are applied in recent Large Vocabulary Continuous Speech Recognition Systems (LVCSR) to reduce the influence of speaker variability on the acoustic models. In this paper we investigate the use of Vocal Tract Length Normalization (VTLN) and Speaker Adaptive Training (SAT) in Multi Layer Perceptron (MLP) feature extraction on an English task. We achieve significant improvements by each normalization method and we gain further by stacking the normalizations. Studying features transformed by Constrained Maximum Likelihood Linear Regression (CMLLR) based SAT as possible input for MLP, further experiments show that MLP could not consistently take advantage of SAT as it does in case of VTLN.
机译:在最近的大词汇量连续语音识别系统(LVCSR)中应用了不同的归一化方法,以减少说话者变异性对声学模型的影响。在本文中,我们研究了在英语任务的多层感知器(MLP)特征提取中使用人行道长度归一化(VTLN)和说话人自适应训练(SAT)。我们通过每种归一化方法都实现了显着的改进,并且通过堆叠归一化而进一步受益。研究基于约束最大似然线性回归(CMLLR)的SAT转换的特征作为MLP的可能输入,进一步的实验表明MLP不能像VTLN那样始终如一地利用SAT。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号