Discriminatively Trained Sparse Inverse Covariance Matrices for Low Resource Acoustic Modeling

机译：用于低资源声学建模的差异训练有素的稀疏反向协方差矩阵

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

We propose a method to discriminatively train acoustic models with sparse inverse covariance (precision) matrices in order to improve the model robustness when training data is insufficient. Acoustic models with sparse inverse covariance matrices were previously proposed to address the problem of over-fitting when training data is inadequate. Since many of the entries of the inverse covariance matrices are driven to zero, the number of free parameters to be estimated is reduced. However, previously acoustic models using sparse inverse covariance matrices were trained using maximum likelihood (ML) training. It is well-known that discriminative training can further improve the recognition accuracy. Therefore, for the first time, we study the problem of training acoustic models with sparse inverse covariance matrices using the discriminative training method. An L1 regularization term is added to the traditional objective function for discriminative training to penalize complex models and to automatically sparsify the inverse covariance matrices. The new objective function is optimized by maximizing a weak-sense auxiliary function. Experimental results on the Wall Street Journal data set show that our method effectively regularizes the model complexity and allows more Gaussian components to be trained. Therefore it can better model the non-Gaussian nature of the speech feature vectors. Compared with the standard maximum mutual information (MMI) training method, our proposed method can significantly improve the recognition accuracy.

机译：我们提出了一种方法来判别具有稀疏反应协方差（精确）矩阵的声学模型（精确）矩阵，以便在训练数据不足时改善模型稳健性。先前提出了具有稀疏协方差矩阵矩阵的声学模型，以解决培训数据不充分时的过度拟合问题。由于逆协方差矩阵的许多条目被驱动为零，因此减少了待估计的自由参数的数量。然而，使用最大似然（ML）训练训练使用使用稀疏反相矩阵矩阵的先前声学模型。众所周知，鉴别性培训可以进一步提高识别准确性。因此，首次使用鉴别训练方法研究具有稀疏反相协方差矩阵的训练声学模型的问题。 L1正则化术语被添加到传统的目标函数中，以便判别攻击复杂模型并自动缩小反向协方差矩阵。通过最大化弱道辅助功能来优化新的客观函数。墙街道日志数据集的实验结果表明，我们的方法有效地规范了模型复杂性，并允许培训更多高斯部件。因此，它可以更好地模拟语音特征向量的非高斯性质。与标准最大互信息（MMI）训练方法相比，我们所提出的方法可以显着提高识别准确性。

著录项

来源
《Conference of the International Speech Communication Association》|2013年||共5页
会议地点
作者
Weibin ZHANG; Pascale FUNG;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TN912.3-532;
关键词
regularization; components; recognition;

机译：正则化;组件;识别;

相似文献

外文文献
中文文献
专利

1. Discriminatively Trained Sparse Inverse Covariance Matrices for Speech Recognition [J] . Audio, Speech, and Language Processing, IEEE Transactions on . 2014,第5期

机译：区分训练的稀疏逆协方差矩阵用于语音识别
2. A Well-Conditioned and Sparse Estimation of Covariance and Inverse Covariance Matrices Using a Joint Penalty [J] . Ashwini Maurya Journal of machine learning research . 2016,第130期

机译：使用联合惩罚的协方差和逆协方差矩阵的条件良好且稀疏的估计
3. A Well-Conditioned and Sparse Estimation of Covariance and Inverse Covariance Matrices Using a Joint Penalty [J] . Ashwini Maurya Journal of machine learning research . 2016,第130期

机译：使用联合惩罚的协方差和逆协方差矩阵的条件良好且稀疏的估计
4. Discriminatively Trained Sparse Inverse Covariance Matrices for Low Resource Acoustic Modeling [C] . Weibin ZHANG, Pascale FUNG Conference of the International Speech Communication Association . 2013

机译：用于低资源声学建模的差异训练有素的稀疏反向协方差矩阵
5. Sparse estimation of high-dimensional covariance matrices. [D] . Rothman, Adam J. 2010

机译：高维协方差矩阵的稀疏估计。
6. Genotypic covariance matrices and their inverses for models allowing dominance and inbreeding [O] . SP Smith, A Mäki-Tanila 1990

机译：模型的基因型协方差矩阵及其逆矩阵可实现优势和近交
7. Genotypic covariance matrices and their inverses for models allowing dominance and inbreeding [O] . SP Smith, A Mäki-Tanila 2009

机译：模型的基因型协方差矩阵及其逆矩阵，可实现优势和近交

Discriminatively Trained Sparse Inverse Covariance Matrices for Low Resource Acoustic Modeling

摘要

著录项

相似文献

相关主题

期刊订阅