【24h】

Evaluation of Fast Spoken Term Detection Using a Suffix Array

机译:使用后缀数组的快速语音术语检测评估

获取原文

摘要

We previously proposed [1] fast spoken term detection that uses a suffix array as a data structure for searching a large-scale speech documents. In this method, a keyword is divided into sub-keywords, and the phoneme sequences that contain two or more sub-keywords are output as results. Although the search is executed very quickly on a 10,000-h speech database, we only proposed a variety of matching procedures in [1]. In this paper, we compare different varieties of matching procedures in which the number of phonemes in a sub-keyword and the required number of sub-keywords to be contained in a search result are different. We also compare the performance and the process time of our method with typical spoken term detection using an inverted index.
机译:我们先前提出了[1]快速口语术语检测,该方法使用后缀数组作为用于搜索大规模语音文档的数据结构。在该方法中,将关键字划分为子关键字,并且将包含两个或多个子关键字的音素序列作为结果输出。尽管在10,000小时的语音数据库中可以非常快速地执行搜索,但我们在[1]中仅提出了各种匹配过程。在本文中,我们比较了各种匹配过程,其中子关键字中的音素数量与搜索结果中包含的子关键字所需数量不同。我们还将使用倒排索引将我们的方法的性能和处理时间与典型的语音术语检测进行比较。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号