首页> 外文会议>IEEE International Conference on Acoustics, Speech, and Signal Processing >SPEED IMPROVEMENTS TO INFORMATION RETRIEVAL-BASED DYNAMIC TIME WARPING USING HIERARCHICAL K-MEANS CLUSTERING
【24h】

SPEED IMPROVEMENTS TO INFORMATION RETRIEVAL-BASED DYNAMIC TIME WARPING USING HIERARCHICAL K-MEANS CLUSTERING

机译:使用分层K-means群集的信息检索基于动态时间扭曲的速度改进

获取原文

摘要

With the increase in multi-media data over the Internet, query by example spoken term detection (QbE-STD) has become important in providing a search mechanism to find spoken queries in spoken audio. Audio search algorithms should be efficient in terms of speed and memory to handle large audio files. In general, approaches derived from the well known dynamic time warping (DTW) algorithm suffer from scalability problems. To overcome such problems, an Information Retrieval-based DTW (IR-DTW) algorithm has been proposed recently. IR-DTW borrows techniques from Information Retrieval community to detect regions which are more likely to contain the spoken query and then uses a standard DTW to obtain exact start and end times. One drawback of the IR-DTW is the time taken for the retrieval of similar reference points for a given query point. In this paper we propose a method to improve the search performance of IR-DTW algorithm using a clustering based technique. The proposed method has shown an estimated speedup of 2400X.
机译:随着Internet的多媒体数据的增加,通过示例说明术语检测(Qbe-STD)的查询在提供搜索机制方面变得重要,以便在口语音频中找到口头查询。音频搜索算法应在速度和内存方面有效,以处理大音频文件。通常,从已知的动态时间翘曲(DTW)算法导出的方法遭受可扩展性问题。为了克服这些问题,最近提出了一种信息检索的DTW(IR-DTW)算法。 IR-DTW借助来自信息检索社区的技术来检测更有可能包含口头查询的区域,然后使用标准DTW来获得精确的开始和结束时间。 IR-DTW的一个缺点是对给定查询点的类似参考点的检索所花费的时间。在本文中,我们提出了一种使用基于聚类技术提高IR-DTW算法的搜索性能的方法。所提出的方法显示了2400倍的估计加速。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号