首页> 外文期刊>International journal of speech technology >An efficient lattice-based phonetic search method for accelerating keyword spotting in large speech databases
【24h】

An efficient lattice-based phonetic search method for accelerating keyword spotting in large speech databases

机译:一种有效的基于格的语音搜索方法,用于加速大型语音数据库中的关键词发现

获取原文
获取原文并翻译 | 示例
       

摘要

This paper describes an algorithm for the reduction of computational complexity in phonetic search Key-Word Spotting (KWS). This reduction is particularly important when searching for within very large speech databases and aiming for rapid response time. The suggested algorithm consists of an anchor-based phoneme search that reduces the search space by generating hypotheses only around phonemes recognized with high reliability. Three databases have been used for the evaluation: IBM Voice-mail I and Voicemail II, consisting of long spontaneous utterances and the Wall Street Journal portion of the MACRO-PHONE database, consisting of read speech utterances. The results indicated a significant reduction of nearly 90 % in the computational complexity of the search while improving the false alarm rate, with only a small decrease in the detection rate in both databases. Search space reduction, as well as, performance gain or loss can be controlled according to the user preferences via the suggested algorithm parameters and thresholds.
机译:本文介绍了一种用于减少语音搜索关键字发现(KWS)中的计算复杂度的算法。当在非常大的语音数据库中搜索并希望快速响应时,这种减少特别重要。所建议的算法由基于锚的音素搜索组成,该搜索通过仅在以高可靠性识别的音素周围生成假设来减少搜索空间。评估使用了三个数据库:IBM语音邮件I和Voicemail II,由长时间的自发发声组成; MACRO-PHONE数据库的《华尔街日报》部分,由阅读的语音发声组成。结果表明,搜索的计算复杂度显着降低了近90%,同时提高了误报率,两个数据库中的检测率仅小幅下降。可以根据用户的喜好,通过建议的算法参数和阈值来控制搜索空间的减少以及性能的提高或降低。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号