Confidence Measure Based on Context Consistency Using Word Occurrence Probability and Topic Adaptation for Spoken Term Detection

Haiyang LI; Tieran ZHENG; Guibin ZHENG; Jiqing HAN

首页> 外文期刊>IEICE transactions on information and systems >Confidence Measure Based on Context Consistency Using Word Occurrence Probability and Topic Adaptation for Spoken Term Detection

【24h】

Confidence Measure Based on Context Consistency Using Word Occurrence Probability and Topic Adaptation for Spoken Term Detection

机译：基于上下文一致性的词汇出现概率和主题自适应度的置信度测度

获取原文

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

In this paper, we propose a novel confidence measure to improve the performance of spoken term detection (STD). The proposed confidence measure is based on the context consistency between a hypothesized word and its context in a word lattice. The main contribution of this paper is to compute the context consistency by considering the uncertainty in the results of speech recognition and the effect of topic. To measure the uncertainty of the context, we employ the word occurrence probability, which is obtained through combining the overlapping hypotheses in a word posterior lattice. To handle the effect of topic, we propose a method of topic adaptation. The adaptation method firstly classifies the spoken document according to the topics and then computes the context consistency of the hypothesized word with the topic-specific measure of semantic similarity. Additionally, we apply the topic-specific measure of semantic similarity by two means, and they are performed respectively with the information of the top-1 topic and the mixture of all topics according to topic classification. The experiments conducted on the Hub-4NE Mandarin database show that both the occurrence probability of context word and the topic adaptation are effective for the confidence measure of STD. The proposed confidence measure performs better compared with the one ignoring the uncertainty of the context or the one using a non-topic method.

机译：在本文中，我们提出了一种新颖的置信度度量来改善口语术语检测（STD）的性能。拟议的置信度度量基于假设词与词格中其上下文之间的上下文一致性。本文的主要贡献是通过考虑语音识别结果的不确定性和主题效果来计算上下文一致性。为了测量上下文的不确定性，我们使用单词出现概率，该概率是通过将单词后格子中的重叠假设进行组合而获得的。为了处理主题的影响，我们提出了一种主题自适应的方法。自适应方法首先根据主题对语音文档进行分类，然后使用特定主题的语义相似性度量来计算假设单词的上下文一致性。此外，我们通过两种方式应用特定于主题的语义相似性度量，并根据主题分类分别对top-1主题的信息和所有主题的混合进行执行。在Hub-4NE普通话数据库上进行的实验表明，上下文词的出现概率和主题适应对STD的置信度均有效。与忽略上下文不确定性的方法或使用非主题方法的方法相比，建议的置信度方法的效果更好。

著录项

来源
《IEICE transactions on information and systems》 |2014年第3期|共8页
作者
Haiyang LI; Tieran ZHENG; Guibin ZHENG; Jiqing HAN;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种
中图分类无线电电子学、电信技术;
关键词

相似文献

外文文献
中文文献
专利

1. A Multi-source Knowledge Fusion Strategy to Improve Confidence Measure in a Lattice-based Spoken Term Detection System [J] . Xinglong Gao, Jielin Pan, Yonghong Yan Journal of information and computational science . 2014,第11期

机译：基于格的口语检测系统中提高置信度的多源知识融合策略
2. Topic Mining based on Word Posterior Probability in Spoken Document [J] . Lei Zhang, Guo-xing Chen, Xue-zhi Xiang, Journal of software . 2011,第11期

机译：语音文档中基于词后验概率的主题挖掘
3. Topic Mining based on Word Posterior Probability in Spoken Document [J] . Lei Zhang, Guo-xing Chen, Xue-zhi Xiang, Journal of Computers . 2011,第11期

机译：语音文档中基于词后验概率的主题挖掘
4. A Novel Confidence Measure Based on Context Consistency for Spoken Term Detection [C] . Haiyang Li, Jiqing Han, Tieran Zheng, Annual conference of the International Speech Communication Association . 2012

机译：基于上下文一致性的语音术语检测置信度新方法
5. Adaptation and Augmentation: Towards Better Rescoring Strategies for Automatic Speech Recognition and Spoken Term Detection [D] . Ma, Min. 2018

机译：适应和增强：寻求更好的自动语音识别和语音术语检测的评分策略
6. The Two Sides of Linguistic Context: Eye-Tracking as a Measure of Semantic Competition in Spoken Word Recognition Among Younger and Older Adults [O] . Nicolai D. Ayasse, Arthur Wingfield 2020

机译：语言上下文的两个方面：眼动追踪作为一种衡量年轻人和老年人口语识别中语义竞争的手段
7. Posterior-based confidence measures for spoken term detection [O] . Wang Dong, Tejedor Javier, Frankel Joe, 2009

机译：基于后验的置信度用于语音术语检测
8. Recognition Confidence Measures: Detection of Misrecognitions and Out-Of-Vocabulary Words [R] . Young, S. R. 1994

机译：识别信心度量：检测误识别和词汇外单词

Confidence Measure Based on Context Consistency Using Word Occurrence Probability and Topic Adaptation for Spoken Term Detection

摘要

著录项

相似文献

相关主题

期刊订阅