首页> 外文会议>2010 IEEE International Conference on Acoustics Speech and Signal Processing >Fast similarity search on a large speech data set with neighborhood graph indexing
【24h】

Fast similarity search on a large speech data set with neighborhood graph indexing

机译:使用邻域图索引对大型语音数据集进行快速相似性搜索

获取原文
获取原文并翻译 | 示例
获取外文期刊封面目录资料

摘要

This paper presents a novel graph-based approach for solving a problem of fast finding a speech model acoustically similar to a query model from a large set of speech models. Each speech model in the set is represented by a Gaussian mixture model and dissimilarity from a GMM to another is measured with a Kullback-Leibler divergence (KLD). Conventional pruning techniques based on the triangle inequality for fast similarity search are not available because the model space with a KLD is not a metric space. We propose a search method that is characterized by an index of a degree-reduced nearest neighbor (DRNN) graph. The search method can efficiently find the most similar (closest) GMM to a query, exploring the DRNN graph with a best-first manner. Experimental evaluations on utterance GMM search tasks reveal a significantly low computational cost of the proposed method.
机译:本文提出了一种新颖的基于图的方法,用于解决从大量语音模型中快速查找与查询模型在声学上相似的语音模型的问题。集合中的每个语音模型都由高斯混合模型表示,并且使用Kullback-Leibler散度(KLD)来衡量GMM与另一个GMM的差异。基于三角形不等式的用于快速相似性搜索的常规修剪技术不可用,因为带有KLD的模型空间不是度量空间。我们提出了一种搜索方法,该方法的特征在于降低度的最近邻(DRNN)图的索引。该搜索方法可以有效地找到与查询最相似(最接近)的GMM,并以最佳优先的方式探索DRNN图。对语音GMM搜索任务的实验评估表明,该方法的计算成本非常低。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号