【24h】

AN EMPIRICAL STUDY OF CONFUSION MODELING IN KEYWORD SEARCH FOR LOW RESOURCE LANGUAGES

机译:在低资源语言中的关键词搜索中混淆建模的实证研究

获取原文
获取外文期刊封面目录资料

摘要

Keyword search, in the context of low resource languages, has emerged as a key area of research. The dominant approach in keyword search is to use Automatic Speech Recognition (ASR) as a front end to produce a representation of audio that can be indexed. The biggest drawback of this approach lies in its the inability to deal with out-of-vocabulary words and query terms that are not in the ASR system output. In this paper we present an empirical study evaluating various approaches based on using confusion models as query expansion techniques to address this problem. We present results across four languages using a range of confusion models which lead to significant improvements in keyword search performance as measured by the Maximum Term Weighted Value (MTWV) metric.
机译:关键字搜索,在低资源语言的背景下,已成为研究的关键领域。 关键字搜索中的主导方法是使用自动语音识别(ASR)作为前端,以产生可以索引的音频的表示。 这种方法的最大缺点在于它无法处理不在ASR系统输出中的词汇单词和查询术语。 在本文中,我们提出了一种基于使用混淆模型作为查询扩展技术来评估各种方法的实证研究,以解决这个问题。 我们使用一系列混淆模型呈现四种语言的结果,这导致了通过最大术语加权值(MTWV)度量来测量的关键字搜索性能的显着改进。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号