Maximum likelihood and minimum classification error factor analysis for automatic speech recognition

Saul L.K.; Rahim M.G.

首页> 外文期刊>IEEE Transactions on Speech and Audio Proceeding >Maximum likelihood and minimum classification error factor analysis for automatic speech recognition

【24h】

Maximum likelihood and minimum classification error factor analysis for automatic speech recognition

机译：自动语音识别的最大似然和最小分类误差因子分析

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Hidden Markov models (HMMs) for automatic speech recognition rely on high dimensional feature vectors to summarize the short-time properties of speech. Correlations between features can arise when the speech signal is nonstationary or corrupted by noise. We investigate how to model these correlations using factor analysis, a statistical method for dimensionality reduction. Factor analysis uses a small number of parameters to model the covariance structure of high dimensional data. These parameters can be chosen in two ways: (1) to maximize the likelihood of observed speech signals, or (2) to minimize the number of classification errors. We derive an expectation-maximization (EM) algorithm for maximum likelihood estimation and a gradient descent algorithm for improved class discrimination. Speech recognizers are evaluated on two tasks, one small-sized vocabulary (connected alpha-digits) and one medium-sized vocabulary (New Jersey town names). We find that modeling feature correlations by factor analysis leads to significantly increased likelihoods and word accuracies. Moreover, the rate of improvement with model size often exceeds that observed in conventional HMM's.

机译：用于自动语音识别的隐马尔可夫模型（HMM）依赖于高维特征向量来总结语音的短时属性。当语音信号不稳定或被噪声破坏时，可能会出现功能之间的相关性。我们研究如何使用因子分析（降维的一种统计方法）对这些相关性进行建模。因子分析使用少量参数来建模高维数据的协方差结构。可以通过两种方式选择这些参数：（1）最大化观察到的语音信号的可能性，或（2）最小化分类错误的数量。我们推导了用于最大似然估计的期望最大化（EM）算法和用于改进类别识别的梯度下降算法。对语音识别器进行两项任务评估，一种是小型词汇（连接的字母数字），另一种是中型词汇（新泽西州的城镇名称）。我们发现，通过因素分析来建模特征相关性会导致可能性和单词准确性显着提高。此外，模型尺寸的改进速度通常超过传统HMM中观察到的速度。

著录项

来源
《IEEE Transactions on Speech and Audio Proceeding》 |2000年第2期|P.115-125|共11页
作者
Saul L.K.; Rahim M.G.;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类电声技术和语音信号处理;
关键词

相似文献

外文文献
中文文献
专利

1. Maximum likelihood and minimum classification error factor analysisfor automatic speech recognition [J] . Saul L.K., Rahim M.G. IEEE Transactions on Speech and Audio Proceessing . 2000,第2期

机译：自动语音识别的最大似然和最小分类误差因子分析
2. Frequency-domain maximum likelihood estimation for automatic speech recognition in additive and convolutive noises [J] . Zhao Y. IEEE Transactions on Speech and Audio Proceeding . 2000,第3期

机译：用于加性和卷积噪声中自动语音识别的频域最大似然估计
3. A simple error classification system for understanding sources of error in automatic speech recognition and human transcription [J] . Atif Zafar, Burke Mamlin, Susan Perkins, International journal of medical informatics . 2004,第9a10期

机译：一个简单的错误分类系统，用于理解自动语音识别和人类转录中的错误源
4. Use of generalized dynamic feature parameters for speech recognition: maximum likelihood and minimum classification error approaches [C] . Rathinavelu, C., Deng, . 1995

机译：使用广义动态特征参数进行语音识别：最大似然和最小分类误差方法
5. Design of loss functions and feature transformation for minimum classification error based automatic speech recognition [D] . Ratnagiri, Madhavi Vedula 2011

机译：基于最小分类误差的自动语音识别损失函数设计和特征变换
6. Original and Mirror Face Images and Minimum Squared Error Classification for Visible Light Face Recognition [O] . Rong Wang 2015

机译：原始和镜面图像以及可见光脸部识别的最小平方误差分类
7. Maximum likelihood and minimum classification error factor analysis for automatic speech recognition [O] . Lawrence K. Saul, Mazin G. Rahim, Senior Member 1999

机译：自动语音识别的最大似然和最小分类误差因子分析
8. Improving on hidden Markov models: An articulatorily constrained, maximum likelihood approach to speech recognition and speech coding [R] . Hogden, J. 1996

机译：改进隐马尔可夫模型：语音识别和语音编码的语义约束，最大似然方法

Maximum likelihood and minimum classification error factor analysis for automatic speech recognition

摘要

著录项

相似文献

相关主题

期刊订阅