首页> 外文会议>IEEE International Conference on Acoustics, Speech and Signal Processing;ICASSP >Learning improved linear transforms for speech recognition
【24h】

Learning improved linear transforms for speech recognition

机译:学习用于语音识别的改进线性变换

获取原文

摘要

This paper explores a novel large margin approach to learning a linear transform for dimensionality reduction in speech recognition. The method assumes a trained Gaussian mixture model for each class to be discriminated and trains a dimensionality-reducing linear transform with respect to the fixed model, optimizing a hinge loss on the difference between the distance to the nearest in- and out-of-class Gaussians using stochastic gradient descent. Results are presented showing that the learnt transform improves state classification for individual frames and reduces word error rate compared to Linear Discriminant Analysis (LDA) in a large vocabulary speech recognition problem even after discriminative training.
机译:本文探索了一种新颖的大余量方法,用于学习用于语音识别中降维的线性变换。该方法为要区分的每个类别假设一个经过训练的高斯混合模型,并针对固定模型训练一个降维线性变换,从而根据距最接近的类别内和类别外的距离之间的差异优化了铰链损耗高斯人使用随机梯度下降。结果表明,相对于线性判别分析(LDA),即使经过判别训练后,在较大的词汇语音识别问题中,学习的变换也可以改善单个帧的状态分类并降低单词错误率。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号