Direct Posterior Confidence for Out-of-Vocabulary Spoken Term Detection

机译：词汇外口语检测的直接后验置信度

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
相似文献
相关主题

摘要

Spoken term detection (STD) is a fundamental task in spoken information retrieval. Compared to conventional speech transcription and keyword spotting, STD is an open-vocabulary task and is necessarily required to address out-of-vocabul-ary (OOV) terms. Approaches based on subword units, e.g. phonemes, are widely used to solve the OOV issue; however, performance on OOV terms is still significantly inferior to that for in-vocabulary (INV) terms. The performance degradation on OOV terms can be attributed to a multitude of factors. A particular factor we address in this paper is that the acoustic and language models used for speech transcribing are highly vulnerable to OOV terms, which leads to unreliable confidence measures and error-prone detections. A direct posterior confidence measure that is derived from discriminative models has been proposed for STD. In this paper, we utilize this technique to tackle the weakness of OOV terms in confidence estimation. Neither acoustic models nor language models being included in the computation, the new confidence avoids the weak modeling problem with OOV terms. Our experiments, set up on multi-party meeting speech which is highly spontaneous and conversational, demonstrate that the proposed technique improves STD performance on OOV terms significantly; when combined with conventional lattice-based confidence, a significant improvement in performance is obtained on both INVs and OOVs. Furthermore, the new confidence measure technique can be combined together with other advanced techniques for OOV treatment, such as stochastic pronunciation modeling and term-dependent confidence discrimination, which leads to an integrated solution for OOV STD with greatly improved performance.

机译：语音术语检测（STD）是语音信息检索中的一项基本任务。与常规语音转录和关键字查找相比，STD是一项公开的任务，并且必须解决词汇外（OOV）术语。基于子词单位的方法，例如音素，被广泛用于解决OOV问题；但是，OOV术语的性能仍然明显低于词汇内（INV）术语的性能。 OOV术语的性能下降可归因于多种因素。我们在本文中解决的一个特殊因素是，用于语音转录的声学和语言模型极易受到OOV术语的影响，这会导致不可靠的置信度度量和易于出错的检测。从判别模型导出的直接后置置信度量度已被建议用于STD。在本文中，我们利用该技术来解决置信度估计中OOV项的弱点。计算中既没有声学模型也没有语言模型，新的置信度避免了OOV术语的弱建模问题。我们的实验基于高度自发和对话的多方会议演讲，证明了所提出的技术可以显着提高OOD条件下的性病表现；当与常规的基于格的置信度结合使用时，INV和OOV的性能都将得到显着改善。此外，新的置信度测量技术可以与其他用于OOV处理的高级技术结合使用，例如随机发音建模和与术语相关的置信度判别，这导致OOV STD的集成解决方案性能大大提高。

著录项

来源
《ACM workshop on searching spontaneous conversational speech 2010》|2010年|p.21-26|共6页
会议地点 Firenze(IT);Firenze(IT)
作者
Dong Wang; Simon King; Nicholas Evans; Joe Frankel; Raphael Troncy;
展开▼
作者单位

Eurecom BP 193, F-06904 Sophia Antiplis, France;

CSTR, University of Edinburgh 10 Crichton Street, EH8 9AB Edinburgh, UK;

Eurecom BP 193, F-06904 Sophia Antiplis, France;

CSTR, University of Edinburgh 10 Crichton Street, EH8 9AB Edinburgh, UK;

Eurecom BP193, F-06904 Sophia Antiplis, France;

展开▼
会议组织
原文格式 PDF
正文语种 eng
中图分类计算技术、计算机技术;
关键词
spoken term detection; speech document search; spontaneous conversational speech; speech recognition;

机译：语音术语检测；语音文件搜索；自发的对话讲话；语音识别;
入库时间 2022-08-26 14:19:11

相似文献

外文文献
中文文献
专利

1. Direct Posterior Confidence for Out-of-Vocabulary Spoken Term Detection [J] . DONG WANG, SIMON KING, JOE FRANKEL, ACM Transactions on Information Systems . 2012,第3期

机译：词汇外口语检测的直接后验置信度
2. Term-Dependent Confidence Normalisation for Out-of-Vocabulary Spoken Term Detection [J] . Dong Wang, Javier Tejedor, Simon King, 计算机科学技术学报（英文版） . 2012,第002期

机译：词外词汇检测的词条依赖置信度归一化
3. Stochastic Pronunciation Modeling for Out-of-Vocabulary Spoken Term Detection [J] . Wang D.King S.Frankel J. Audio, Speech, and Language Processing, IEEE Transactions on . 2011,第4期

机译：词汇外语音检测的随机语音建模
4. Direct Posterior Confidence for Out-of-Vocabulary Spoken Term Detection [C] . Dong Wang, Simon King, Nicholas Evans, ACM workshop on searching spontaneous conversational speech . 2010

机译：直接后置于词汇术语检测
5. Adaptation and Augmentation: Towards Better Rescoring Strategies for Automatic Speech Recognition and Spoken Term Detection [D] . Ma, Min. 2018

机译：适应和增强：寻求更好的自动语音识别和语音术语检测的评分策略
6. Using Complexity-Identical Human- and Machine-Directed Utterances to Investigate Addressee Detection for Spoken Dialogue Systems [O] . Oleg Akhtiamov, Ingo Siegert, Alexey Karpov, 2020

机译：使用复杂度相同的人机对话来调查口语对话系统的收件人检测
7. Direct Posterior Confidence for Out-of-Vocabulary Spoken Term Detection [O] . Nicholas Evans, Sophia Antiplis, Dong Wang, 2012

机译：词汇外口语检测的直接后置置信度
8. Recognition Confidence Measures: Detection of Misrecognitions and Out-Of-Vocabulary Words [R] . Young, S. R. 1994

机译：识别信心度量：检测误识别和词汇外单词

Direct Posterior Confidence for Out-of-Vocabulary Spoken Term Detection

摘要

著录项

相似文献

相关主题

期刊订阅