Vector Approximation based Indexing for Non-uniform High Dimensional Data Sets

机译：基于向量逼近的非均匀高维数据集索引

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
相似文献
相关主题

摘要

With the proliferation of multimedia data, there is increasing need to support the indexing and searching of high dimensional data. Recently, a vector approximation based technique called VA-file has been proposed for indexing high dimensional data. It has been shown that the VA-file is an effective technique compared to the current approaches based on space and data partitioning. The VA-file is an effective technique compared to the current approaches based on space and data partitioning. The VA-file gives good performance especially when the data set is uniformly distributed. Real data sets are not uniformly distributed, are often clustered, and the dimensions of the feature vectors in real data sets are usually correlated. More careful analysis for non-uniform or correlated data is needed for effectively indexing high dimensional data. We propose a solution to these problems and propose the VA~+-file, a new technique for indexing high dimensional data sets based on vector approximations. We conclude with an evalaution of nearest neighbor queries and show that the VA~+-file technique results in significant improvements over the current VA-file approach for several real data sets.

机译：随着多媒体数据的激增，越来越需要支持索引和搜索高维数据。近来，已经提出了一种基于矢量近似的称为VA文件的技术，用于索引高维数据。已经表明，与基于空间和数据分区的当前方法相比，VA文件是一种有效的技术。与基于空间和数据分区的当前方法相比，VA文件是一种有效的技术。 VA文件可提供良好的性能，尤其是当数据集均匀分布时。真实数据集不是均匀分布的，通常是聚类的，并且真实数据集中特征向量的维数通常是相关的。为了有效索引高维数据，需要对非均匀或相关数据进行更仔细的分析。我们提出了这些问题的解决方案，并提出了VA〜+文件，这是一种基于向量逼近对高维数据集建立索引的新技术。我们以对最近邻居查询的回避作为结论，并表明VA〜+文件技术相对于当前的VA文件方法在几个真实数据集上产生了重大改进。

著录项

来源
《Ninth International Conference on Information Knowledge Management CIKM 2000 November 6-11, 2000 McLean, VA》|2000年|p.202-209|共8页
会议地点 McLean VA(US);McLean VA(US)
作者
Hakan Ferhatosmanoglu; Ertem Tuncel; Divyakant Agrawal; Amr El Abbadi;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种 eng
中图分类自动化技术、计算机技术;
关键词

相似文献

外文文献
中文文献
专利

1. Spatial indexing of high-dimensional data based on relative approximation [J] . Yasushi Sakurai, Masatoshi Yoshikawa, Shunsuke Uemura, The International Journal of Very Large Data Bases . 2002,第2期

机译：基于相对逼近的高维数据空间索引
2. An Indexing Technique Using Relative Approximation for High-Dimensional Data [J] . Yasushi Sakurai 1 Masatoshi Yoshikawa, Shunsuke Uemura, Haruhiko Kojima Systems and Computers in Japan . 2003,第12期

机译：使用相对逼近的高维数据索引技术
3. Comparative evaluation of support vector machines for computer aided diagnosis of lung cancer in CT based on a multi-dimensional data set [J] . SunT., WangJ., LiX., Computer Methods and Programs in Biomedicine: An International Journal Devoted to the Development, Implementation and Exchange of Computing Methodology and Software Systems in Biomedical Research and Medical Practice . 2013,第2期

机译：基于多维数据集的计算机辅助诊断CT诊断肺癌的支持向量机的比较评估
4. Vector approximation based indexing for non-uniform high dimensional data sets [C] . Hakan Ferhatosmanoglu, Ertem Tuncel, Divyakant Agrawal, International conference on Information and knowledge management . 2000

机译：基于向量逼近的非均匀高维数据集索引
5. Efficient indexing and retrieval of colour image data using a vector -based approach. [D] . Androutsos, Dimitrios. 1999

机译：使用基于矢量的方法对彩色图像数据进行高效索引和检索。
6. Fast Nonparametric Density-Based Clustering of Large Data Sets Using a Stochastic Approximation Mean-Shift Algorithm [O] . Ollivier Hyrien, Andrea Baran -1

机译：使用随机逼近均值漂移算法的大型数据集基于非参数密度的快速聚类
7. Vector Approximation based Indexing for Non-uniform High Dimensional Data Sets [O] . Hakan Ferhatosmanoglu, Ertem Tuncel, Divyakant Agrawal, 2000

机译：基于向量逼近的非均匀高维数据集索引
8. Gravity Vector Estimatiton from a Large and Densely Spaced Heterogeneous Gradient Data Set Using Closed-Form Kernel Approximations [R] . Jekeli, C. 1985

机译：基于闭模核近似的大且密集空间非均匀梯度数据集的重力矢量估计

Vector Approximation based Indexing for Non-uniform High Dimensional Data Sets

摘要

著录项

相似文献

相关主题

期刊订阅