首页> 外文会议>IEEE International Conference on Acoustics Speech and Signal;ICASSP 2010 >Spoken term detection with Connectionist Temporal Classification: A novel hybrid CTC-DBN decoder
【24h】

Spoken term detection with Connectionist Temporal Classification: A novel hybrid CTC-DBN decoder

机译:带有连接主义时间分类的语音术语检测:一种新型的混合CTC-DBN解码器

获取原文

摘要

This paper proposes a novel system for robust keyword detection in continuous speech. Our decoder is composed of a bidirectional Long Short-Term Memory recurrent neural network using a Connectionist Temporal Classification (CTC) output layer, and a Dynamic Bayesian Network (DBN). The CTC network exploits bidirectional context information to reliably identify phonemes, whereas the DBN is able to discriminate between keywords and arbitrary speech while explicitly modeling substitutions, deletions, and insertions in the CTC phoneme output string. Our technique is vocabulary independent and does not require an explicit garbage model. Experiments show that our system architecture prevails over a standard Hidden Markov Model approach.
机译:本文提出了一种用于连续语音中鲁棒关键词检测的新颖系统。我们的解码器由使用连接器时间分类(CTC)输出层的双向长期短期记忆递归神经网络和动态贝叶斯网络(DBN)组成。 CTC网络利用双向上下文信息来可靠地识别音素,而DBN能够区分关键字和任意语音,同时在CTC音素输出字符串中显式建模替换,删除和插入。我们的技术与词汇无关,不需要显式的垃圾模型。实验表明,我们的系统架构胜于标准的隐马尔可夫模型方法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号