Learning improved linear transforms for speech recognition

机译：学习用于语音识别的改进线性变换

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

This paper explores a novel large margin approach to learning a linear transform for dimensionality reduction in speech recognition. The method assumes a trained Gaussian mixture model for each class to be discriminated and trains a dimensionality-reducing linear transform with respect to the fixed model, optimizing a hinge loss on the difference between the distance to the nearest in- and out-of-class Gaussians using stochastic gradient descent. Results are presented showing that the learnt transform improves state classification for individual frames and reduces word error rate compared to Linear Discriminant Analysis (LDA) in a large vocabulary speech recognition problem even after discriminative training.

机译：本文探索了一种新颖的大余量方法，用于学习用于语音识别中降维的线性变换。该方法为要区分的每个类别假设一个经过训练的高斯混合模型，并针对固定模型训练一个降维线性变换，从而根据距最接近的类别内和类别外的距离之间的差异优化了铰链损耗高斯人使用随机梯度下降。结果表明，相对于线性判别分析（LDA），即使经过判别训练后，在较大的词汇语音识别问题中，学习的变换也可以改善单个帧的状态分类并降低单词错误率。

著录项

来源
《IEEE International Conference on Acoustics, Speech and Signal Processing;ICASSP》|2012年|p.1957- 1960|共4页
会议地点 Kyoto(JP)
作者
Senior, Andrew;
展开▼
作者单位

Google Inc. New York USA;

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. Feature Adaptation Using Linear Spectro-Temporal Transform for Robust Speech Recognition [J] . Duc Hoang Ha Nguyen, Xiong Xiao, Eng Siong Chng, Audio, Speech, and Language Processing, IEEE/ACM Transactions on . 2016,第6期

机译：使用线性光谱-时域变换进行特征自适应以实现鲁棒的语音识别
2. RECOGNITION OF SPEECH SIGNALS: AN EXPERIMENTAL COMPARISON OF LINEAR PREDICTIVE CODING AND DISCRETE WAVELET TRANSFORMS [J] . SONIA SUNNY, DAVID PETER S, K POULOSE JACOB International Journal of Engineering Science and Technology . 2012,第4期

机译：语音信号识别：线性预测编码和离散小波变换的实验比较
3. RECOGNITION OF SPEECH SIGNALS: AN EXPERIMENTAL COMPARISON OF LINEAR PREDICTIVE CODING AND DISCRETE WAVELET TRANSFORMS [J] . SONIA SUNNY, DAVID PETER S, K POULOSE JACOB International Journal of Engineering Science and Technology . 2012,第4期

机译：语音信号识别：线性预测编码和离散小波变换的实验比较
4. Learning improved linear transforms for speech recognition [C] . Senior A., Youngmin Cho, Weston J. IEEE International Conference on Acoustics, Speech and Signal Processing . 2011

机译：学习改进的语音识别线性变换
5. Compressive nonlinearity for representing speech spectral magnitude to improve noise robustness of automatic speech recognition . [D] . Wong, Brian. 2011

机译：压缩非线性表示语音频谱幅度提高语音自动识别的鲁棒性。
6. Feature Selection for Speech Emotion Recognition in Spanish and Basque: On the Use of Machine Learning to Improve Human-Computer Interaction [O] . Andoni Arruti, Idoia Cearreta, Aitor Álvarez, 2010

机译：西班牙语和巴斯克人语音情感识别的特征选择：使用机器学习改善人机交互
7. LEARNING IMPROVED LINEAR TRANSFORMS FOR SPEECH RECOGNITION [O] . Andrew Senior, Youngmin Cho, Jason Weston, 2013

机译：学习改进的语音识别线性变换

Learning improved linear transforms for speech recognition

摘要

著录项

相似文献

相关主题

期刊订阅