首页> 外文期刊>Speech Communication >Simultaneous recognition of words and prosody in the Boston University Radio Speech Corpus
【24h】

Simultaneous recognition of words and prosody in the Boston University Radio Speech Corpus

机译:波士顿大学广播语音语料库中单词和韵律的同时识别

获取原文
获取原文并翻译 | 示例
           

摘要

This paper describes automatic speech recognition systems that satisfy two technological objectives. First, we seek to improve the automatic labeling of prosody, in order to aid future research in automatic speech understanding. Second, we seek to apply statistical speech recognition models of prosody for the purpose of reducing the word error rate of an automatic speech recognizer. The systems described in this paper are variants of a core dynamic Bayesian network model, in which the key hidden variables are the word, the prosodic tag sequence, and the prosody-dependent allo-phones. Statistical models of the interaction among words and prosodic tags are trained using the Boston University Radio Speech Corpus, a database annotated using the tones and break indices (ToBI) prosodic annotation system. This paper presents both theoretical and empirical results in support of the conclusion that a prosody-dependent speech recognizer—a recognizer that simultaneously computes the most-probable word labels and prosodic tags—can provide lower word recognition error rates than a standard prosody-independent speech recognizer in a multi-speaker speaker-dependent speech recognition task on radio speech.
机译:本文介绍了满足两个技术目标的自动语音识别系统。首先,我们寻求改进韵律的自动标记,以帮助将来对自动语音理解的研究。第二,我们寻求应用韵律的统计语音识别模型,以降低自动语音识别器的单词错误率。本文介绍的系统是核心动态贝叶斯网络模型的变体,其中关键的隐藏变量是单词,韵律标记序列和与韵律相关的异音素。使用波士顿大学广播语音语料库训练单词和韵律标签之间相互作用的统计模型,该数据库是使用音调和中断索引(ToBI)韵律注释系统进行注释的数据库。本文提供了理论和经验结果,以支持以下结论:与韵律无关的语音比标准的与韵律无关的语音,与韵律相关的语音识别器(可同时计算最可能的单词标签和韵律标签的识别器)可提供更低的单词识别错误率无线电语音的多说话者相关语音识别任务中的语音识别器。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号