首页> 外文会议>IEEE Workshop on Automatic Speech Recognition and Understanding >Combination of syllable based N-gram search and word search for spoken term detection through spoken queries and IV/OOV classification
【24h】

Combination of syllable based N-gram search and word search for spoken term detection through spoken queries and IV/OOV classification

机译:通过语音查询和IV / OOV分类,将基于音节的N-gram搜索和单词搜索相结合以检测语音术语

获取原文

摘要

This paper presents a Japanese spoken term detection method for spoken queries using a combination of word-based search and syllable-based N-gram search with in-vocabulary/out-of-vocabulary (IV/OOV) term classification. The N-gram index in a recognized syllable-based lattice for OOV terms, which assumes recognition errors such as substitution, insertion and deletion errors, incorporates a distance metric as a confidence score. To address spoken queries, we propose an automatic method for discriminating IV and OOV terms by using the confidence scores of spoken queries through large-vocabulary/syllable continuous speech recognition. Evaluation on an academic lecture presentation database with 44 hours of data shows that the combination of word search and syllable-based N-gram search yields significant improvement and outperforms the baseline syllable-based DTW approach.
机译:本文提出了一种日语语音术语检测方法,该方法结合了基于单词的搜索和基于音节的N-gram搜索以及词汇内/词汇外(IV / OOV)术语分类,从而进行了语音查询。假设识别错误(例如替换,插入和删除错误)的OOV术语基于已识别的基于音节的格中的N元语法索引结合了距离度量作为置信度得分。为了解决口头查询,我们提出了一种自动方法,该方法通过大词汇量/音节连续语音识别使用口头查询的置信度得分来区分IV和OOV术语。对具有44个小时数据的学术讲座演示数据库的评估表明,单词搜索和基于音节的N-gram搜索相结合可产生显着改善,并且优于基于基线的基于音节的DTW方法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号