首页> 外文会议>IEEE International Conference on Acoustics, Speech and Signal Processing >Keyword search using query expansion for graph-based rescoring of hypothesized detections
【24h】

Keyword search using query expansion for graph-based rescoring of hypothesized detections

机译:使用查询扩展的关键字搜索,对假设的检测结果进行基于图的记录

获取原文

摘要

In this work, we propose a novel framework for rescoring keyword search (KWS) detections using acoustic samples extracted from the training data. We view the keyword rescoring task as an information retrieval task and adopt the idea of query expansion. We expand a textual keyword with multiple speech keyword samples extracted from the training data. In this way, the hypothesized detections are compared with the multiple keywords using non-parametric approaches such as dynamic time warping (DTW). The obtained similarity scores are used in a graph based method to re-rank the original confidence scores estimated by the automatic speech recognition (ASR) systems. Experimental results on the NIST OpenKWS15 Evaluation show that our rescoring method is effective, especially for the subword system. For subword experiments, the graph-based rescoring with training samples obtains 5.1% and 1.5% absolute improvement over two baseline systems. One is a standard parametric ASR system, while the other is the graph-based rescoring without training samples.
机译:在这项工作中,我们提出了一种用于使用从训练数据中提取的声学样本来重新调用关键字搜索(KWS)检测的新颖框架。我们将关键字rescoring任务视为信息检索任务,并采用查询扩展的想法。我们展开了从训练数据中提取的多个语音关键字样本的文本关键字。以这种方式,使用诸如动态时间翘曲(DTW)的非参数方法(DTW)将假设检测与多个关键字进行比较。所获得的相似度分数以基于曲线图的方法使用,以重新排名由自动语音识别(ASR)系统估计的原始置信区。 NIST OpenKWS15评估的实验结果表明,我们的救援方法是有效的,特别是对于子字系统。对于次字实验,具有培训样本的基于图形的重生获得了两个基线系统的5.1%和1.5%的绝对改进。一个是标准参数ASR系统,而另一个是基于图形的Rescoring而不进行训练样本。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号