...
首页> 外文期刊>Information Sciences: An International Journal >A fast hierarchical search algorithm for discriminative keyword spotting
【24h】

A fast hierarchical search algorithm for discriminative keyword spotting

机译:区分关键词的快速分层搜索算法

获取原文
获取原文并翻译 | 示例
   

获取外文期刊封面封底 >>

       

摘要

A keyword spotter can be considered as a binary classifier which classifies a set of uttered sentences into two groups on the basis of whether they contain target keywords or not. For this classification task, the keyword spotter needs to identify the target keywords locations based on a fast and accurate search algorithm. In our previous works, we exploited a modified Viterbi (M-Viterbi) search algorithm which has two known drawbacks. First, to locate the target keywords, it runs an exhaustive search through all possible segments of input speech. Second, while computing the start and end time-frames of each new phone, it makes the keyword spotter to trace-back and re-evaluate the timing alignments of all previous one(s), despite the fact that very limited amount of data -if any- would get updated as a result. These two pitfalls cause a dramatically enlarged search space as well as a significant increase in computational complexity. In this paper, we propose a Hierarchical Search (H-Search) algorithm which allows the system to ignore some segments of input speech at each level of hierarchy, according to their lower likelihood of containing the target keywords. In addition, unlike the M-Viterbi algorithm, the H-Search algorithm does not demand repeated evaluations when computing the current phone alignment which, in turn, results in a narrowed-down search space (O(TP) versus O(TPLmax) - where T is number of frames, P is number of keyword phones and Lmax is the maximum phone duration) as well as a decreased computational complexity (O(TPLmax) versus O(TPLmax3)) compared to those of the M-Viterbi algorithm. We applied the H-Search algorithm to the classification part of an Evolutionary Discriminative Keyword Spotting (EDKWS) system introduced in our previous works. The experimental results indicate that the H-Search algorithm is executed 100 times faster than the M-Viterbi algorithm while the performance of the EDKWS system degrades no more than two percent compared to that of the M-Viterbi algorithm. (C) 2015 Elsevier Inc. All rights reserved.
机译:关键字搜寻器可以看作是二元分类器,它根据一组发声句子是否包含目标关键字将其分为两组。对于此分类任务,关键字搜寻器需要基于快速准确的搜索算法来识别目标关键字位置。在我们以前的工作中,我们利用了改进的维特比(M-Viterbi)搜索算法,该算法具有两个已知的缺点。首先,要找到目标关键字,它会在输入语音的所有可能片段中进行详尽搜索。其次,在计算每部新手机的开始和结束时间范围时,尽管数据量非常有限,但它使关键字发现者可以追溯并重新评估所有先前产品的时间对齐方式-如果有的话-结果将得到更新。这两个陷阱导致搜索空间显着扩大,并且计算复杂性显着增加。在本文中,我们提出了一种分层搜索(H-Search)算法,该算法允许系统根据包含目标关键字的较低可能性来忽略层次结构各个级别上的某些输入语音片段。此外,与M-Viterbi算法不同,H-Search算法在计算当前电话对齐方式时不需要重复评估,从而导致缩小的搜索空间(O(TP)与O(TPLmax)-其中T是帧数,P是关键字电话的数量,Lmax是电话的最大持续时间),并且与M-Viterbi算法相比,计算复杂度降低了(O(TPLmax)与O(TPLmax3))。我们将H-Search算法应用于我们以前的工作中介绍的进化歧视性关键词发现(EDKWS)系统的分类部分。实验结果表明,与M-Viterbi算法相比,H-Search算法的执行速度比M-Viterbi算法快100倍,而EDKWS系统的性能下降不超过2%。 (C)2015 Elsevier Inc.保留所有权利。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号