首页> 外文会议>International Conference on Spoken Language Processing; 20041004-08; Jeju(KR) >Automatic Speech Recognition of Co-Channel Speech: Integrated Speaker and Speech Recognition Approach
【24h】

Automatic Speech Recognition of Co-Channel Speech: Integrated Speaker and Speech Recognition Approach

机译:同频道语音的自动语音识别:演讲者和语音识别的集成方法

获取原文
获取原文并翻译 | 示例

摘要

This paper presents a novel Bayesian approach to the problem of co-channel speech. The problem is formulated as the joint maximization of the a posteriori probability of the word sequence and the target speaker given the observed speech signal. It is shown that the joint probability can be expressed as the product of six terms: a likelihood score from a speaker-independent speech recognizer, the (normalized) likelihood score of a speaker recognizer, the likelihood of a sequence of prosodic events, the likelihood of a speaker-dependent statistical language model, a prior representing the channel usage patterns of a speaker, and the prior probability of the speaker. An efficient single-pass Viterbi search strategy is presented. Experimental results on over-the-telephone recognition of co-channel speech show a 45% reduction in word error rate of a 10-digit telephone number task.
机译:本文提出了一种新颖的贝叶斯方法来解决同频道语音问题。该问题被表述为给定观察到的语音信号时,单词序列和目标说话人的后验概率的联合最大化。结果表明,联合概率可以表示为六个项的乘积:来自独立于说话者的语音识别器的似然评分,来自说话者识别器的(规范化)似然评分,一系列韵律事件的似然,取决于说话者的统计语言模型的代表,代表说话者的频道使用模式的先验和说话者的先验概率。提出了一种有效的单遍维特比搜索策略。通过电话识别同频道语音的实验结果表明,一个10位数电话号码任务的单词错误率降低了45%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号