Automatic Speech Recognition of Co-Channel Speech: Integrated Speaker and Speech Recognition Approach

机译：同频道语音的自动语音识别：演讲者和语音识别的集成方法

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
相似文献
相关主题

摘要

This paper presents a novel Bayesian approach to the problem of co-channel speech. The problem is formulated as the joint maximization of the a posteriori probability of the word sequence and the target speaker given the observed speech signal. It is shown that the joint probability can be expressed as the product of six terms: a likelihood score from a speaker-independent speech recognizer, the (normalized) likelihood score of a speaker recognizer, the likelihood of a sequence of prosodic events, the likelihood of a speaker-dependent statistical language model, a prior representing the channel usage patterns of a speaker, and the prior probability of the speaker. An efficient single-pass Viterbi search strategy is presented. Experimental results on over-the-telephone recognition of co-channel speech show a 45% reduction in word error rate of a 10-digit telephone number task.

机译：本文提出了一种新颖的贝叶斯方法来解决同频道语音问题。该问题被表述为给定观察到的语音信号时，单词序列和目标说话人的后验概率的联合最大化。结果表明，联合概率可以表示为六个项的乘积：来自独立于说话者的语音识别器的似然评分，来自说话者识别器的（规范化）似然评分，一系列韵律事件的似然，取决于说话者的统计语言模型的代表，代表说话者的频道使用模式的先验和说话者的先验概率。提出了一种有效的单遍维特比搜索策略。通过电话识别同频道语音的实验结果表明，一个10位数电话号码任务的单词错误率降低了45％。

著录项

来源
《International Conference on Spoken Language Processing; 20041004-08; Jeju(KR)》|2004年|P.829-832|共4页
会议地点 Jeju(KR)
作者
Larry P. Heck; Mark Z. Mao;
展开▼
作者单位

Nuance Communications, Menlo Park, CA, USA;

展开▼
会议组织
原文格式 PDF
正文语种 eng
中图分类应用语言学;
关键词

相似文献

外文文献
中文文献
专利

1. 汉语语音识别中区分性声调模型及最优集成方法 [J] . 黄浩, 朱杰东南大学学报（英文版） . 2007,第002期
2. Arabic Speaker-Independent Continuous Automatic Speech Recognition Based on a Phonetically Rich and Balanced Speech Corpus [J] . Mohammad Abushariah, Raja Ainon, Roziati Zainuddin, The international arab journal of information technology . 2012,第1期

机译：基于语音丰富均衡的语料库的阿拉伯语独立于说话人的连续自动语音识别
3. Studies on inter-speaker variability in speech and its application in automatic speech recognition [J] . S UMESH Sadhana . 2011,第5期

机译：语音中说话人之间的变异性及其在自动语音识别中的应用研究
4. Studies on inter-speaker variability in speech and its application in automatic speech recognition [J] . S. UMESH Sadhana: Academy Proceedings in Engineering Science . 2011,第5期

机译：语音中说话人之间的变异性及其在自动语音识别中的应用研究
5. Automatic Speech Recognition of Co-Channel Speech: Integrated Speaker and Speech Recognition Approach [C] . Larry P. Heck, Mark Z. Mao, International Speech Communication Association International Conference on Spoken Language Processing . 2004

机译：自动语音识别同频语音：集成扬声器和语音识别方法
6. Automatic Speaker Recognition and Diarization in Co-Channel Speech [D] . Shokouhi, Navid. 2017

机译：同频道语音中的说话人自动识别和区分
7. Brain-inspired speech segmentation for automatic speech recognition using the speech envelope as a temporal reference [O] . Byeongwook Lee, Kwang-Hyun Cho -1

机译：以语音包络作为时间参考的自动语音识别的大脑启发式语音分割
8. Analyzing the impact of speaker localization errors on speech separation for automatic speech recognition [O] . Sunit Sivasankaran, Emmanuel Vincent, Dominique Fohr 2021

机译：分析扬声器本地化误差对自动语音识别语音分离的影响
9. Robust Speech Processing & Recognition: Speaker ID, Language ID, Speech Recognition/Keyword Spotting, Diarization/Co-Channel/Environmental Characterization, Speaker State Assessment. [R] . Hansen, J. H. 2015

机译：强大的语音处理和识别：说话者ID，语言ID，语音识别/关键字识别，Diarization / Co-Channel /环境表征，说话者状态评估。

Automatic Speech Recognition of Co-Channel Speech: Integrated Speaker and Speech Recognition Approach

摘要

著录项

相似文献

相关主题

期刊订阅