首页> 外文期刊>IEEE Transactions on Knowledge and Data Engineering >CSVD: clustering and singular value decomposition for approximate similarity search in high-dimensional spaces
【24h】

CSVD: clustering and singular value decomposition for approximate similarity search in high-dimensional spaces

机译:CSVD:聚类和奇异值分解,用于在高维空间中进行近似相似性搜索

获取原文
获取原文并翻译 | 示例
           

摘要

Nearest-neighbor search of high-dimensionality spaces is critical for many applications, such as content-based retrieval from multimedia databases, similarity search of patterns in data mining, and nearest-neighbor classification. Unfortunately, even with the aid of the commonly used indexing schemes, the performance of nearest-neighbor (NN) queries deteriorates rapidly with the number of dimensions. We propose a method, called Clustering with Singular Value Decomposition (CSVD), which supports efficient approximate processing of NN queries, while maintaining good precision-recall characteristics. CSVD groups homogeneous points into clusters and separately reduces the dimensionality of each cluster using SVD. Cluster selection for NN queries relies on a branch-and-bound algorithm and within-cluster searches can be performed with traditional or in-memory indexing methods. Experiments with texture vectors extracted from satellite images show that CSVD achieves significantly higher dimensionality reduction than plain SVD for the same normalized mean squared error (NMSE), which translates into a higher efficiency in processing approximate NN queries.
机译:高维空间的最近邻居搜索对于许多应用程序至关重要,例如从多媒体数据库中基于内容的检索,数据挖掘中模式的相似性搜索以及最近邻居分类。不幸的是,即使借助常用的索引方案,最近邻(NN)查询的性能也会随着维数的增加而迅速恶化。我们提出了一种称为奇异值分解聚类(CSVD)的方法,该方法支持对NN查询进行有效的近似处理,同时保持良好的精度调用特性。 CSVD将齐次点分组为簇,并使用SVD分别降低每个簇的维数。 NN查询的群集选择依赖于分支定界算法,并且可以使用传统或内存中索引方法来执行群集内搜索。使用从卫星图像提取的纹理矢量进行的实验表明,对于相同的归一化均方误差(NMSE),CSVD的降维效果明显高于普通SVD,从而在处理近似NN查询时转化为更高的效率。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号