Can You Repeat That? Using Word Repetition to Improve Spoken Term Detection

机译：你可以重复一次吗？使用Word Repetition来改善口语术语检测

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

We aim to improve spoken term detection performance by incorporating contextual information beyond traditional N-gram language models. Instead of taking a broad view of topic context in spoken documents, variability of word co-occurrence statistics across corpora leads us to focus instead the on phenomenon of word repetition within single documents. We show that given the detection of one instance of a term we are more likely to find additional instances of that term in the same document. We leverage this bursti-ness of keywords by taking the most confident keyword hypothesis in each document and interpolating with lower scoring hits. We then develop a principled approach to select interpolation weights using only the ASR training data. Using this re-weighting approach we demonstrate consistent improvement in the term detection performance across all five languages in the BABEL program.

机译：我们旨在通过结合超出传统的N-GRAM语言模型的上下文信息来提高口语期限检测性能。而不是在口语文件中拍摄主题背景，Corpora中的一词共同发生统计数据的可变性导致我们关注单个文档中的单词重复现象。我们表明，鉴于检测一个术语的一个实例，我们更有可能在同一文档中找到该术语的其他实例。我们通过在每个文档中采取最自信的关键字假设并与较低的评分命中进行插值来利用这一破产的关键字。然后，我们使用ASR培训数据制定一个原则性的方法来选择插值权重。使用此重新加权方法我们展示了Babel计划中所有五种语言的术语检测性能的一致性。

著录项

来源
《Annual meeting of the Association for Computational Linguistics》|2014年||共10页
会议地点
作者
Jonathan Wintrode; Sanjeev Khudanpur;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类计算机软件;
关键词

相似文献

外文文献
中文文献
专利

1. Long-Term Repetition Priming in Spoken and Written Word Production: Evidence for a Contribution of Phonology to Handwriting [J] . Damian M.F., Dorjee D., Stadthagen-Gonzalez H. Journal of experimental psychology. Learning, memory, and cognition . 2011,第4期

机译：口语和书面单词生产中的长期重复启动：语音学对手写的贡献的证据
2. Words translated in sentence contexts produce repetition priming in visual word comprehension and spoken word production [J] . Wendy S. Francis, Alejandra Camacho, Carolina Lara Memory & cognition . 2014,第7期

机译：在句子上下文中翻译的单词在视觉单词理解和口语单词生成中产生重复启动
3. Handling OOV Words in Mandarin Spoken Term Detection with an Hierarchical n-Gram Language Model [J] . WANG Xuyang1, ZHANG Pengyuan1, NA Xingyu1, 电子学报：英文版 . 2017,第006期

机译：用分层N-GRAM语言模型处理普通话语言术语检测的OOV字
4. Can You Repeat That? Using Word Repetition to Improve Spoken Term Detection [C] . Jonathan Wintrode, Sanjeev Khudanpur Annual meeting of the Association for Computational Linguistics . 2014

机译：你可以再说一遍吗？使用单词重复来改善语音术语检测
5. A comparison of short term memory performance on tests of nonword repetition, pitch discrimination, and pitch matching in typically developing children [D] . Flagge, Ashley Gaal 2012

机译：比较典型发育儿童在非单词重复，音高辨别和音高匹配测试中的短期记忆表现
6. Spoken Word Memory Traces within the Human Auditory Cortex Revealed by Repetition Priming and Functional Magnetic Resonance Imaging [O] . Pierre Gagnepain, Gael Chételat, Brigitte Landeau, 2008

机译：通过重复启动和功能磁共振成像揭示人类听觉皮层内的口语单词记忆轨迹。
7. Can You Repeat That? Using Word Repetition to Improve Spoken Term Detection [O] . Jonathan Wintrode, Sanjeev Khudanpur 2015

机译：你可以再说一遍吗？使用Word重复改进语音检测

Can You Repeat That? Using Word Repetition to Improve Spoken Term Detection

摘要

著录项

相似文献

相关主题

期刊订阅