首页> 外文会议>IEEE workshop on neural networks for signal processing >A monolithic speech recognizer based on fully recurrent neural networks
【24h】

A monolithic speech recognizer based on fully recurrent neural networks

机译:基于完全复发性神经网络的单片语音识别器

获取原文

摘要

Reports on investigations concerning the application of fully recurrent neural networks (FRNN) for speaker independent speech recognition. In a phoneme based recognition system separate FRNN are used for feature scoring as well as for compensating variations in time durations of speech segments. A recognizer with a FRNN for feature scoring achieves the same recognition rate as a recognition system where the context information is provided. The performance of the FRNN used for time alignment is comparable to that of a viterbi based alignment with durational constraints. Additionally, a monolithic speech recognizer is realized by FRNN which directly classifies feature sequences. The performance of this FRNN is comparable to that of speech recognition systems which are based on discrete hidden Markov models and use a sophisticated durational modeling. Furthermore, simulation experiments revealed that FRNN are able to extract relevant information for speech recognition from noise contaminated speech and thus achieve a robust recognition performance.
机译:关于扬声器独立语音识别全反动性神经网络(FRNN)应用的报告。在基于音素的识别系统中,单独的FRNN用于特征评分以及用于补偿语音段的时间持续时间的变化。具有FRNN的特征评分的识别器实现了与提供上下文信息的识别系统相同的识别率。用于时间对齐的FRNN的性能与与持久性约束的基于维特比对的比对的性能相当。另外,通过FRNN实现单片语音识别器,其直接对特征序列进行分类。该FRNN的性能与基于离散隐马尔可夫模型的语音识别系统的性能相当,并且使用复杂的持久性建模。此外,模拟实验表明,FRNN能够从噪声污染语音中提取语音识别的相关信息,从而实现鲁棒识别性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号