Exploiting Discriminative Point Process Models for Spoken Term Detection

机译：利用判别点过程模型进行口语检测

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

State-of-the-art spoken term detection (STD) systems are built on top of large vocabulary speech recognition engines, which generate lattices that encode candidate occurrences of each in-vocabulary query. These lattices specifiy start and stop times of hypothesized term occurrences, providing a clear opportunity to return to the acoustics to incorporate novel confidence measures for verification. In this paper, we introduce a novel exemplar distance metric to the recently proposed discriminative point process modeling (DPPM) framework and use the resulting whole word models to generate STD confidence scores. In doing so, we introduce STD to a completely distinct acoustic modeling pipeline, trading Gaussian mixture models (GMM) for multi-layer perceptrons and replacing dictionary-derived hidden Markov models (HMM) with exemplar-based point process models. We find that whole word DPPM scores both perform comparably and are complementary to lattice posterior scores produced by a state-of-the-art speech recognition engine.

机译：最先进的语音术语检测（STD）系统建立在大型词汇语音识别引擎的基础上，该引擎生成可对每个语音查询中的候选出现进行编码的格。这些晶格指定了假设词项出现的开始和停止时间，从而提供了返回声学的明确机会，以纳入新颖的置信度度量进行验证。在本文中，我们向最近提出的判别点过程建模（DPPM）框架引入了一种新颖的示例性距离度量，并使用所得的整个单词模型来生成STD置信度得分。为此，我们将STD引入了一个完全不同的声学建模管道，将高斯混合模型（GMM）换为多层感知器，并用基于示例的点过程模型替换了字典派生的隐马尔可夫模型（HMM）。我们发现，整个单词DPPM分数的表现均相当，并且与最新的语音识别引擎产生的晶格后验分数互补。

著录项

来源
《Annual conference of the International Speech Communication Association》|2012年|2441-2444|共4页
会议地点
作者
Atta Norouzian; Aren Jansen; Richard Rose; Samuel Thomas;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
spoken term detection; point process model; dis-criminative training; whole word model;

机译：语音术语检测;点过程模型;歧视性培训;全字模型;

相似文献

外文文献
中文文献
专利

1. Feature analysis for discriminative confidence estimation in spoken term detection [J] . Javier Tejedor, Doroteo T. Toledano, Dong Wang, Computer speech and language . 2014,第5期

机译：语音术语检测中的判别置信度估计的特征分析
2. Evolutionary discriminative confidence estimation for spoken term detection [J] . Javier Tejedor, Alejandro Echeverria, Dong Wang, Multimedia Tools and Applications . 2013,第1期

机译：语音项检测的进化判别置信估计
3. Discriminative Optimization of the Figure of Merit for Phonetic Spoken Term Detection [J] . Wallace R., Baker B., Vogt R., Audio, Speech, and Language Processing, IEEE Transactions on . 2011,第6期

机译：语音口语检测的优值判别式优化
4. Exploiting Discriminative Point Process Models for Spoken Term Detection [C] . Atta Norouzian, Aren Jansen, Richard Rose, INTERSPEECH 2012 . 2012

机译：利用说明术语检测的判别点过程模型
5. Discriminative Articulatory Feature-based Pronunciation Models with Application to Spoken Term Detection [D] . Prabhavalkar, Rohit. 2013

机译：基于区分性发音特征的语音模型及其在口语检测中的应用
6. The cortical organization of lexical knowledge: A dual lexicon model of spoken language processing [O] . David W. Gow Jr. -1

机译：词汇知识的皮质组织：一种口语语言处理的双exicon模型
7. DISCRIMINATIVE ARTICULATORY MODELS FOR SPOKEN TERM DETECTION IN LOW-RESOURCE CONVERSATIONAL SETTINGS [O] . Rohit Prabhavalkar, Karen Livescu, Eric Fosler-lussier, 2014

机译：低资源对流设置中尖端检测的判别指示模型

Exploiting Discriminative Point Process Models for Spoken Term Detection

摘要

著录项

相似文献

相关主题

期刊订阅