首页> 外文会议>Annual conference of the International Speech Communication Association >Exploiting Discriminative Point Process Models for Spoken Term Detection
【24h】

Exploiting Discriminative Point Process Models for Spoken Term Detection

机译:利用判别点过程模型进行口语检测

获取原文

摘要

State-of-the-art spoken term detection (STD) systems are built on top of large vocabulary speech recognition engines, which generate lattices that encode candidate occurrences of each in-vocabulary query. These lattices specifiy start and stop times of hypothesized term occurrences, providing a clear opportunity to return to the acoustics to incorporate novel confidence measures for verification. In this paper, we introduce a novel exemplar distance metric to the recently proposed discriminative point process modeling (DPPM) framework and use the resulting whole word models to generate STD confidence scores. In doing so, we introduce STD to a completely distinct acoustic modeling pipeline, trading Gaussian mixture models (GMM) for multi-layer perceptrons and replacing dictionary-derived hidden Markov models (HMM) with exemplar-based point process models. We find that whole word DPPM scores both perform comparably and are complementary to lattice posterior scores produced by a state-of-the-art speech recognition engine.
机译:最先进的语音术语检测(STD)系统建立在大型词汇语音识别引擎的基础上,该引擎生成可对每个语音查询中的候选出现进行编码的格。这些晶格指定了假设词项出现的开始和停止时间,从而提供了返回声学的明确机会,以纳入新颖的置信度度量进行验证。在本文中,我们向最近提出的判别点过程建模(DPPM)框架引入了一种新颖的示例性距离度量,并使用所得的整个单词模型来生成STD置信度得分。为此,我们将STD引入了一个完全不同的声学建模管道,将高斯混合模型(GMM)换为多层感知器,并用基于示例的点过程模型替换了字典派生的隐马尔可夫模型(HMM)。我们发现,整个单词DPPM分数的表现均相当,并且与最新的语音识别引擎产生的晶格后验分数互补。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号