Vocabulary-Independent Indexing of Spontaneous Speech

Yu P.; Chen K.; Ma C.; Seide F.

首页> 外文期刊>IEEE Transactions on Speech and Audio Proceessing >Vocabulary-Independent Indexing of Spontaneous Speech

【24h】

Vocabulary-Independent Indexing of Spontaneous Speech

机译：自发语音的词汇无关索引

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

We present a system for vocabulary-independent indexing of spontaneous speech, i.e., neither do we know the vocabulary of a speech recording nor can we predict which query terms for which a user is going to search. The technique can be applied to information retrieval, information extraction, and data mining. Our specific target is search in recorded conversations in the office/information-worker scenario—teleconferences, meetings, presentations, and voice mails. The focus of this paper is on how to index phonetic lattices. We will show that an index should provide expected term frequencies (ETFs) of query terms. Since, at indexing time, it is unknown which phoneme sequences constitute valid query terms, we will introduce an approximation of ETFs of a query's phoneme sequence by$M$-gram phoneme language models, which are estimated on lattices and organized in an inverted index-like structure for fast access. We will discuss ranking, estimation, and integration of phoneme/word hybrid approaches. Compared with an unindexed baseline without approximation, our approximation leads only to a 3.4% relative loss of search accuracy on the Linguistic Data Consortium (LDC) voicemail task. We also propose a two-stage method for locating individual keyword occurences using the above method as a fast match. A 20-times speedup is achieved over unindexed search at under a 2-point accuracy loss. Last, we will briefly introduce a prototype applet based on the above techniques.

机译：我们提出了一种独立于词汇量的自发语音索引系统，即，我们既不知道语音记录的词汇量，也不能预测用户要搜索的查询词。该技术可以应用于信息检索，信息提取和数据挖掘。我们的特定目标是在办公室/信息工作者场景中的记录的对话中进行搜索-电话会议，会议，演示和语音邮件。本文的重点是如何索引语音格。我们将显示索引应提供查询词的预期词频（ETF）。由于在建立索引时，尚不清楚哪个音素序列构成有效的查询词，因此我们将通过$ M $ -gram音素语言模型引入查询音素序列的ETF近似值，该模型以格估计并以倒排索引的形式组织类似的结构，可快速访问。我们将讨论音素/单词混合方法的排名，估计和集成。与没有近似值的未索引基线相比，我们的近似值仅导致语言数据联盟（LDC）语音邮件任务的搜索准确性相对降低3.4％。我们还提出了一种使用上述方法作为快速匹配来定位各个关键字出现的两阶段方法。在2点精度损失下，无索引搜索的速度提高了20倍。最后，我们将简要介绍基于上述技术的原型applet。

著录项

来源
《IEEE Transactions on Speech and Audio Proceessing》 |2005年第5期|p.635-643|共9页
作者
Yu P.; Chen K.; Ma C.; Seide F.;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类电声技术和语音信号处理;
关键词
indexing; information retrieval; speech processing; vocabulary; M-gram phoneme language models; data mining; expected term frequencies; information extraction; information retrieval; inverted index-like structure; phoneme sequences; phonetic lattices; speech recordi;

机译：索引;信息检索;语音处理;词汇;M-gram音素语言模型;数据挖掘;预期词频;信息提取;信息检索;倒排索引结构;音素序列;语音格;语音记录;

相似文献

外文文献
中文文献
专利

1. HarkMan--A Vocabulary-Independent Keyword Spotter for Spontaneous Chinese Speech [J] . ZHENG Fang, XU Mingxing, MOU Xiaolong, Journal of Computer Science & Technology . 1999,第1期

机译：HarkMan-自发中文语音的独立于单词的关键词搜寻器
2. HarkMan-A Vocabulary-Independent Keyword Spotter for Spontaneous Chinese Speech [J] . ZHENG Fang, XU Mingxing, MOU Xiaolong, 计算机科学技术学报（英文版） . 1999,第001期

机译：HarkMan-独立于词汇的中文单词语音发现者
3. Multilingual phone models for vocabulary-independent speech recognition tasks [J] . Joachim Kohler Speech Communication . 2001,第1a2期

机译：用于与词汇无关的语音识别任务的多语言电话模型
4. Fast Vocabulary-Independent Audio Search Based on Syllable Confusion Network Indexing in Mandarin Spontaneous Speech [C] . Shao, Jian, Zhang, . 2007

机译：基于普通话自发音节混淆网络索引的快速词汇无关音频搜索
5. Speech Entrainment to Improve Spontaneous Speech in Broca’s Aphasia [D] . Thors, Helga. 2019

机译：言语夹带改善了博克阿邦的自发言论
6. The role of linguistic and indexical information in improved recognition ofdysarthric speech [O] . Stephanie A. Borrie, a), Megan J. McAuliffe, -1

机译：语言和索引信息在改善对信息的识别中的作用构音障碍
7. FAST TWO-STAGE VOCABULARY-INDEPENDENT SEARCH IN SPONTANEOUS SPEECH [O] . Peng Yu, Frank Seide 2008

机译：自发语音的快速两阶段词汇独立搜索
8. Vocabulary and Environment Adaptation in Vocabulary-Independent Speech Recognition. [R] . Hon, H., Lee, K. 1992

机译：词汇独立语音识别中的词汇与环境适应。

Vocabulary-Independent Indexing of Spontaneous Speech

摘要

著录项

相似文献

相关主题

期刊订阅