首页> 外文会议>Human language technology >A Neural Network System for Large-Vocabulary Continuous Speech Recognition in Variable Acoustic Environments

【24h】

A Neural Network System for Large-Vocabulary Continuous Speech Recognition in Variable Acoustic Environments

机译：可变声学环境下大词汇量连续语音识别的神经网络系统

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
相似文献
相关主题

摘要

Performance of speech recognizers is typically degraded by deleterious properties of the acoustic environment, such as multipath distortion (reverberation) and ambient noise. The degradation becomes more prominent as the microphone is positioned more distant from the speaker, for instance, in a teleconferencing application. Mismatched training and testing conditions, such as frequency response, microphone, signal-to-noise ratio (SNR), and room reverberation, also degrade recognition performance. Among available approaches to handling mismatches between training and testing conditions, a popular one is to retrain the speech recognizer under new environments. Hidden Markov models (HMM) have to date been accepted as an effective classification method for large vocabulary continuous speech recognition, e.g., the ARPA-sponsored SPHINX and DECIPHER. Retraining of HMM-based recognizers is a complex and tedious task. It requires recollection of speech data under corresponding conditions and reestimation of HMM's parameters. Particularly great time and effort are needed to retrain a recognizer which operates in a speaker-independent mode, which is the mode of greatest general interest.

机译：语音识别器的性能通常会因声学环境的有害特性而降低，例如多径失真（混响）和环境噪声。例如，在电话会议应用中，随着麦克风与扬声器的距离越来越远，降级变得更加明显。训练和测试条件不匹配，例如频率响应，麦克风，信噪比（SNR）和房间混响，也会降低识别性能。在处理训练条件与测试条件之间不匹配的可用方法中，一种流行的方法是在新环境下对语音识别器进行再训练。迄今为止，隐马尔可夫模型（HMM）已被接受为大词汇量连续语音识别的有效分类方法，例如，由ARPA支持的SPHINX和DECIPHER。基于HMM的识别器的再培训是一项复杂而乏味的任务。它要求在相应条件下重新收集语音数据，并重新估算HMM参数。重新训练以独立于说话者的模式工作的识别器需要特别大的时间和精力，这是最大的兴趣所在。

著录项

来源
《Human language technology》|1994年|470-470|共1页
会议地点 Plainsboro NJ(US)
作者
J. Flanagan; Q. Lin; J. Pearson; B. de Vries;
展开▼
作者单位

CAIP Center, Rutgers University;

CAIP Center, Rutgers University;

David Sarnoff Research Center;

David Sarnoff Research Center;

展开▼
会议组织
原文格式 PDF
正文语种 eng
中图分类计算机软件;
关键词

相似文献

外文文献
中文文献
专利

1. Acoustic Models of the Elderly for Large-Vocabulary Continuous Speech Recognition [J] . Akira Baba, Shinichi Yoshizawa, Miichi Yamada, Electronics and Communications in Japan. Part 2, Electronics . 2004,第7期

机译：大词汇量连续语音识别的老年人声学模型
2. Large-Vocabulary Continuous Speech Recognition Systems: A Look at Some Recent Advances [J] . Saon G., Chien J.-T. Signal Processing Magazine, IEEE . 2012,第6期

机译：大词汇量连续语音识别系统：最近的一些进展
3. A VLSI grammar processing subsystem for a real-time large-vocabulary continuous speech recognition system [J] . Chen D.C., Yu R. IEEE Journal of Solid-State Circuits . 1991,第3期

机译：实时大词汇量连续语音识别系统的VLSI语法处理子系统
4. A Neural Network System for Large-Vocabulary Continuous Speech Recognition in Variable Acoustic Environments [C] . Human language technology workshop . 1994

机译：一种用于可变声环境中大词汇连续语音识别的神经网络系统
5. Large-vocabulary speaker-independent continuous speech recognition: The SPHINX system. [D] . Lee, Kai-Fu. 1988

机译：独立于大词汇的说话者的连续语音识别：SPHINX系统。
6. Multi-resolution speech analysis for automatic speech recognition using deep neural networks: Experiments on TIMIT [O] . Doroteo T. Toledano, María Pilar Fernández-Gallego, Alicia Lozano-Diez 2012

机译：基于深度神经网络的自动语音识别的多分辨率语音分析：TIMIT实验
7. Syllable-Length Acoustic Units in Large-Vocabulary Continuous Speech Recognition [O] . Hämäläinen K.A., Boves L.W.J., Veth J.M. de 2005

机译：大词汇量连续语音识别中的音节长度声学单位
8. Improving State-of-the-Art Continuous Speech Recognition System Using the N-Best Paradigm with Neural Networks. [R] . Austin, S., Zavaliagkost, G., Makhoul, J., 1992

机译：利用神经网络的N-Best范式改进最先进的连续语音识别系统。

A Neural Network System for Large-Vocabulary Continuous Speech Recognition in Variable Acoustic Environments

摘要

著录项

相似文献

相关主题

期刊订阅