首页> 外文会议>Conference of the International Speech Communication Association >Discriminatively Trained Sparse Inverse Covariance Matrices for Low Resource Acoustic Modeling
【24h】

Discriminatively Trained Sparse Inverse Covariance Matrices for Low Resource Acoustic Modeling

机译:用于低资源声学建模的差异训练有素的稀疏反向协方差矩阵

获取原文

摘要

We propose a method to discriminatively train acoustic models with sparse inverse covariance (precision) matrices in order to improve the model robustness when training data is insufficient. Acoustic models with sparse inverse covariance matrices were previously proposed to address the problem of over-fitting when training data is inadequate. Since many of the entries of the inverse covariance matrices are driven to zero, the number of free parameters to be estimated is reduced. However, previously acoustic models using sparse inverse covariance matrices were trained using maximum likelihood (ML) training. It is well-known that discriminative training can further improve the recognition accuracy. Therefore, for the first time, we study the problem of training acoustic models with sparse inverse covariance matrices using the discriminative training method. An L1 regularization term is added to the traditional objective function for discriminative training to penalize complex models and to automatically sparsify the inverse covariance matrices. The new objective function is optimized by maximizing a weak-sense auxiliary function. Experimental results on the Wall Street Journal data set show that our method effectively regularizes the model complexity and allows more Gaussian components to be trained. Therefore it can better model the non-Gaussian nature of the speech feature vectors. Compared with the standard maximum mutual information (MMI) training method, our proposed method can significantly improve the recognition accuracy.
机译:我们提出了一种方法来判别具有稀疏反应协方差(精确)矩阵的声学模型(精确)矩阵,以便在训练数据不足时改善模型稳健性。先前提出了具有稀疏协方差矩阵矩阵的声学模型,以解决培训数据不充分时的过度拟合问题。由于逆协方差矩阵的许多条目被驱动为零,因此减少了待估计的自由参数的数量。然而,使用最大似然(ML)训练训练使用使用稀疏反相矩阵矩阵的先前声学模型。众所周知,鉴别性培训可以进一步提高识别准确性。因此,首次使用鉴别训练方法研究具有稀疏反相协方差矩阵的训练声学模型的问题。 L1正则化术语被添加到传统的目标函数中,以便判别攻击复杂模型并自动缩小反向协方差矩阵。通过最大化弱道辅助功能来优化新的客观函数。墙街道日志数据集的实验结果表明,我们的方法有效地规范了模型复杂性,并允许培训更多高斯部件。因此,它可以更好地模拟语音特征向量的非高斯性质。与标准最大互信息(MMI)训练方法相比,我们所提出的方法可以显着提高识别准确性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号