首页> 外文会议>Human language technology >A Neural Network System for Large-Vocabulary Continuous Speech Recognition in Variable Acoustic Environments
【24h】

A Neural Network System for Large-Vocabulary Continuous Speech Recognition in Variable Acoustic Environments

机译:可变声学环境下大词汇量连续语音识别的神经网络系统

获取原文
获取原文并翻译 | 示例

摘要

Performance of speech recognizers is typically degraded by deleterious properties of the acoustic environment, such as multipath distortion (reverberation) and ambient noise. The degradation becomes more prominent as the microphone is positioned more distant from the speaker, for instance, in a teleconferencing application. Mismatched training and testing conditions, such as frequency response, microphone, signal-to-noise ratio (SNR), and room reverberation, also degrade recognition performance. Among available approaches to handling mismatches between training and testing conditions, a popular one is to retrain the speech recognizer under new environments. Hidden Markov models (HMM) have to date been accepted as an effective classification method for large vocabulary continuous speech recognition, e.g., the ARPA-sponsored SPHINX and DECIPHER. Retraining of HMM-based recognizers is a complex and tedious task. It requires recollection of speech data under corresponding conditions and reestimation of HMM's parameters. Particularly great time and effort are needed to retrain a recognizer which operates in a speaker-independent mode, which is the mode of greatest general interest.
机译:语音识别器的性能通常会因声学环境的有害特性而降低,例如多径失真(混响)和环境噪声。例如,在电话会议应用中,随着麦克风与扬声器的距离越来越远,降级变得更加明显。训练和测试条件不匹配,例如频率响应,麦克风,信噪比(SNR)和房间混响,也会降低识别性能。在处理训练条件与测试条件之间不匹配的可用方法中,一种流行的方法是在新环境下对语音识别器进行再训练。迄今为止,隐马尔可夫模型(HMM)已被接受为大词汇量连续语音识别的有效分类方法,例如,由ARPA支持的SPHINX和DECIPHER。基于HMM的识别器的再培训是一项复杂而乏味的任务。它要求在相应条件下重新收集语音数据,并重新估算HMM参数。重新训练以独立于说话者的模式工作的识别器需要特别大的时间和精力,这是最大的兴趣所在。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号