首页> 外文会议>IEEE International Conference on Acoustics, Speech and Signal Processing;ICASSP >Factor analysis based VTS discriminative adaptive training
【24h】

Factor analysis based VTS discriminative adaptive training

机译:基于因子分析的VTS判别自适应训练

获取原文

摘要

Vector Taylor Series (VTS) model based compensation is a powerful approach for noise robust speech recognition. An important extension to this approach is VTS adaptive training (VAT), which allows canonical models to be estimated on diverse noise-degraded training data. These canonical model can be estimated using EM-based approaches, allowing simple extensions to discriminative VAT (DVAT). However to ensure a diagonal corrupted speech covariance matrix the Jacobian (loading matrix) relating the noise and clean speech is diagonalised. In this work an approach for yielding optimal diagonal loading matrices based on minimising the expected KL-divergence between the diagonal loading matrix and “correct” distributions is proposed. The performance of DVAT using the standard and optimal diagonalisation was evaluated on both in-car collected data and the Aurora4 task.
机译:基于矢量泰勒级数(VTS)模型的补偿是强大的噪声鲁棒语音识别方法。此方法的重要扩展是VTS自适应训练(VAT),它允许根据各种降噪后的训练数据来估计经典模型。可以使用基于EM的方法来估计这些规范模型,从而可以简单地扩展到区分性增值税(DVAT)。但是,为了确保对角线损坏的语音协方差矩阵,将与噪声和干净语音相关的雅可比行列式(负载矩阵)对角化。在这项工作中,提出了一种基于最小化对角线加载矩阵和“正确”分布之间的预期KL-散度的方法来产生最佳对角线加载矩阵的方法。使用标准和最佳对角线化技术对DVAT的性能进行了评估,包括车内收集的数据和Aurora4任务。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号