首页> 外文学位 >Run-time information fusion in large vocabulary continuous speech recognition.
【24h】

Run-time information fusion in large vocabulary continuous speech recognition.

机译:大词汇量连续语音识别中的运行时信息融合。

获取原文
获取原文并翻译 | 示例

摘要

Continuous speech recognition systems are environmentally sensitive and suffer from the great variability of speech. In order to achieve recognition robustness, there's a strong interest among researchers on how to fuse different information sources for speech recognition. A common problem of those approaches is that complementary information is lost either before or after recognition.; To avoid this unrecoverable information loss, and to better utilize this complementary information, we proposed a run time information fusion scheme. The hypothesis of this thesis is that by performing fusion at different levels and stages of a Large Vocabulary Continuous Speech Recognition (LVCSR) system, especially inside the decoder, more reliable and efficient fusion is possible.; The hypothesis is first tested in a speech segmentation task, which is essential to the performance of an LVCSR system. Furthermore, three different approaches of run time fusion are proposed and implemented inside an LVCSR decoder. The experiments demonstrate the effectiveness and potential of these approaches.
机译:连续语音识别系统对环境敏感,并且遭受语音的巨大变化。为了实现识别的鲁棒性,研究人员对如何融合不同的信息源进行语音识别有着浓厚的兴趣。这些方法的一个普遍问题是互补信息在识别之前或之后都会丢失。为了避免这种不可恢复的信息丢失,并更好地利用此补充信息,我们提出了一种运行时信息融合方案。本文的假设是,通过在大型词汇连续语音识别(LVCSR)系统的不同级别和阶段执行融合,尤其是在解码器内部,可以实现更可靠,更有效的融合。该假设首先在语音分割任务中进行测试,这对于LVCSR系统的性能至关重要。此外,在LVCSR解码器内部提出并实现了三种不同的运行时融合方法。实验证明了这些方法的有效性和潜力。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号