CSVD: clustering and singular value decomposition for approximate similarity search in high-dimensional spaces

Castelli V.; Thomasian A.; Chung-Sheng Li

首页> 外文期刊>IEEE Transactions on Knowledge and Data Engineering >CSVD: clustering and singular value decomposition for approximate similarity search in high-dimensional spaces

【24h】

CSVD: clustering and singular value decomposition for approximate similarity search in high-dimensional spaces

机译：CSVD：聚类和奇异值分解，用于在高维空间中进行近似相似性搜索

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Nearest-neighbor search of high-dimensionality spaces is critical for many applications, such as content-based retrieval from multimedia databases, similarity search of patterns in data mining, and nearest-neighbor classification. Unfortunately, even with the aid of the commonly used indexing schemes, the performance of nearest-neighbor (NN) queries deteriorates rapidly with the number of dimensions. We propose a method, called Clustering with Singular Value Decomposition (CSVD), which supports efficient approximate processing of NN queries, while maintaining good precision-recall characteristics. CSVD groups homogeneous points into clusters and separately reduces the dimensionality of each cluster using SVD. Cluster selection for NN queries relies on a branch-and-bound algorithm and within-cluster searches can be performed with traditional or in-memory indexing methods. Experiments with texture vectors extracted from satellite images show that CSVD achieves significantly higher dimensionality reduction than plain SVD for the same normalized mean squared error (NMSE), which translates into a higher efficiency in processing approximate NN queries.

机译：高维空间的最近邻居搜索对于许多应用程序至关重要，例如从多媒体数据库中基于内容的检索，数据挖掘中模式的相似性搜索以及最近邻居分类。不幸的是，即使借助常用的索引方案，最近邻（NN）查询的性能也会随着维数的增加而迅速恶化。我们提出了一种称为奇异值分解聚类（CSVD）的方法，该方法支持对NN查询进行有效的近似处理，同时保持良好的精度调用特性。 CSVD将齐次点分组为簇，并使用SVD分别降低每个簇的维数。 NN查询的群集选择依赖于分支定界算法，并且可以使用传统或内存中索引方法来执行群集内搜索。使用从卫星图像提取的纹理矢量进行的实验表明，对于相同的归一化均方误差（NMSE），CSVD的降维效果明显高于普通SVD，从而在处理近似NN查询时转化为更高的效率。

著录项

来源
《IEEE Transactions on Knowledge and Data Engineering》 |2003年第3期|p.671-685|共15页
作者
Castelli V.; Thomasian A.; Chung-Sheng Li;
展开▼
作者单位

IBM Thomas J. Watson Res. Center, Yorktown Heights, NY, USA;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类无线电电子学、电信技术;
关键词
pattern clustering; singular value decomposition; query processing; multimedia databases; tree searching; database indexing; mean square error methods; data mining; CSVD; clustering; singular value decomposition; approximate similarity search; high-d;

机译：模式聚类;奇异值分解;查询处理;多媒体数据库;树搜索;数据库索引;均方误差方法;数据挖掘;CSVD;集群奇异值分解;近似相似度搜索;高维;

相似文献

外文文献
中文文献
专利

1. SPY-TECf An efficient indexing method for similarity search in high-dimensional data spaces [J] . Dong-Ho Lee, Hyoung-Joo Kim Data & Knowledge Engineering . 2000,第1期

机译：SPY-TECf一种高效的索引方法，用于在高维数据空间中进行相似性搜索
2. Subspace Clustering for High-Dimensional Data Using Cluster Structure Similarity [J] . Kavan Fatehi, Mohsen Rezvani, Mansoor Fateh, International Journal of Intelligent Information Technologies . 2018,第3期

机译：使用集群结构相似性的高维数据子空间聚类
3. Region Proximity in Metric Spaces and Its Use for Approximate Similarity Search [J] . GIUSEPPE AMATO, FAUSTO RABITTI, PASQUALE SAVINO, ACM Transactions on Information Systems . 2003,第2期

机译：度量空间中的区域邻近度及其在近似相似度搜索中的应用
4. CSVD: approximate similarity searches in high-dimensional spaces usingclustering and singular value decomposition, [C] . Alexander Thomasian, IBM Thomas J. Watson Research Ctr., Yorktown Heights, Conference on multimedia storage and archiving systems . 1998

机译：CSVD：使用聚类和奇异值分解在高维空间中进行近似相似性搜索，
5. Efficient similarity search in high-dimensional data spaces. [D] . Li, Yue. 2004

机译：高维数据空间中的有效相似性搜索。
6. Convex hulls in hamming space enable efficient search for similarity and clustering of genomic sequences [O] . David S. Campo, Yury Khudyakov 2020

机译：汉明空间的凸壳能够有效地寻求基因组序列的相似性和聚类
7. Clustering for approximate similarity search in high-dimensional spaces [O] . Chen Li, Edward Chang, Hector Garcia-molina, 2002

机译：高维空间中的近似相似性搜索聚类

CSVD: clustering and singular value decomposition for approximate similarity search in high-dimensional spaces

摘要

著录项

相似文献

相关主题

期刊订阅