Confidence measure for speech indexing based on Latent Dirichlet Allocation

机译：基于潜在狄利克雷分配的语音索引置信度度量

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

This paper presents a confidence measure for speech indexing that aims to predict the indexing quality of a speech document for a Spoken Document Retrieval (SDR) task. We first introduce how the indexing quality of a speech document is evaluated. Then, we present our method to predict the indexing quality of a speech document. It is based on confidence measure provided by an automatic speech recognition system and the detection of semantic outliers implemented with the Latent Dirichlet Allocation (LDA) model. Experiments are conducted on the French Broadcast news campaign ESTER2 in a classical SDR scenario where users submit text-queries to a search engine. Results demonstrate an overall improvement when the detection is done with the LDA model. The detection rate is always above 70%.

机译：本文提出了一种语音索引的置信度度量，旨在预测语音文档检索（SDR）任务的语音文档的索引质量。我们首先介绍如何评估语音文档的索引质量。然后，我们提出了预测语音文档索引质量的方法。它基于自动语音识别系统提供的置信度度量以及使用潜在狄利克雷分配（LDA）模型实现的语义离群值检测。在传统的SDR场景中，法国广播新闻活动ESTER2进行了实验，用户将文本查询提交给搜索引擎。当使用LDA模型进行检测时，结果证明了整体改进。检出率始终高于70％。

著录项

来源
《Annual conference of the International Speech Communication Association》|2012年|2299-2302|共4页
会议地点
作者
Gregory Senay; Georges Linares;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
speech indexing; confidence measure; spo-ken document retrieval; latent dirichlet allocation;

机译：语音索引置信度口头文件检索;潜在狄利克雷分配;

相似文献

外文文献
中文文献
专利

1. Document Similarity Measure Based on the Earth Mover's Distance Utilizing Latent Dirichlet Allocation [J] . Min-Hee Jang, Tae-Hwan Eom, Sang-Wook Kim, Research journal of applied science, engineering and technology . 2016,第2期

机译：利用潜在狄利克雷分配的基于地球移动器距离的文档相似性度量
2. Document Similarity Measure Based on the Earth Mover's Distance Utilizing Latent Dirichlet Allocation [J] . Min-Hee Jang, Tae-Hwan Eom, Sang-Wook Kim, Research journal of applied science, engineering and technology . 2016,第2期

机译：利用潜在狄利克雷分配的基于地球移动器距离的文档相似性度量
3. Indexing by Latent Dirichlet Allocation and an Ensemble Model [J] . Yanshan Wang, Jae-Sung Lee, In-Chan Choi Journal of the American Society for Information Science and Technology . 2016,第7期

机译：通过潜在Dirichlet分配和集成模型建立索引
4. Confidence measure for speech indexing based on Latent Dirichlet Allocation [C] . Grégory Senay, Georges Linarès INTERSPEECH 2012 . 2012

机译：基于潜在Dirichlet分配的语音索引信心措施
5. Comparing latent Dirichlet allocation and latent semantic analysis as classifiers [D] . Anaya, Leticia H. 2011

机译：比较潜在Dirichlet分配和潜在语义分析作为分类器
6. Latent Dirichlet allocation model for world trade analysis [O] . Diego Kozlowski, Viktoriya Semeshenko, Andrea Molinari 2021

机译：世界贸易分析潜在的Dirichlet分配模型
7. Analysis of latent Dirichlet allocation and non-negative matrix factorization using latent semantic indexing [O] . Saqib et al. 2019

机译：利用潜在语义索引分析潜在的Dirichlet分配和非负矩阵分解

Confidence measure for speech indexing based on Latent Dirichlet Allocation

摘要

著录项

相似文献

相关主题

期刊订阅