首页> 外文会议>IEEE International Conference on Acoustics, Speech and Signal Processing >Segmental acoustic indexing for zero resource keyword search
【24h】

Segmental acoustic indexing for zero resource keyword search

机译:用于零资源关键字搜索的分段声学索引

获取原文

摘要

The task of zero resource query-by-example keyword search has received much attention in recent years as the speech technology needs of the developing world grow. These systems traditionally rely upon dynamic time warping (DTW) based retrieval algorithms with runtimes that are linear in the size of the search collection. As a result, their scalability substantially lags that of their supervised counterparts, which take advantage of efficient word-based indices. In this paper, we present a novel audio indexing approach called Segmental Randomized Acoustic Indexing and Logarithmic-time Search (S-RAILS). S-RAILS generalizes the original frame-based RAILS methodology to word-scale segments by exploiting a recently proposed acoustic segment embedding technique. By indexing word-scale segments directly, we avoid higher cost frame-based processing of RAILS while taking advantage of the improved lexical discrimination of the embeddings. Using the same conversational telephone speech benchmark, we demonstrate major improvements in both speed and accuracy over the original RAILS system.
机译:近年来,随着发展中国家语音技术需求的增长,以资源为例进行零资源关键词搜索的任务受到了广泛的关注。这些系统传统上依赖于基于动态时间规整(DTW)的检索算法,其运行时间在搜索集合的大小上是线性的。结果,它们的可伸缩性大大落后于受监督的同类文件,后者利用了有效的基于单词的索引。在本文中,我们提出了一种新颖的音频索引方法,称为分段随机声学索引和对数时间搜索(S-RAILS)。 S-RAILS通过利用最近提出的声学片段嵌入技术,将原始的基于帧的RAILS方法推广到词标片段。通过直接索引字标段,我们避免了使用成本更高的基于框架的RAILS处理,同时利用了改进的嵌入词法辨别力。使用相同的对话电话语音基准测试,我们证明了与原始RAILS系统相比,速度和准确性都有了显着提高。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号