首页> 外文会议>Ninth International Conference on Information Knowledge Management CIKM 2000 November 6-11, 2000 McLean, VA >Vector Approximation based Indexing for Non-uniform High Dimensional Data Sets
【24h】

Vector Approximation based Indexing for Non-uniform High Dimensional Data Sets

机译:基于向量逼近的非均匀高维数据集索引

获取原文
获取原文并翻译 | 示例

摘要

With the proliferation of multimedia data, there is increasing need to support the indexing and searching of high dimensional data. Recently, a vector approximation based technique called VA-file has been proposed for indexing high dimensional data. It has been shown that the VA-file is an effective technique compared to the current approaches based on space and data partitioning. The VA-file is an effective technique compared to the current approaches based on space and data partitioning. The VA-file gives good performance especially when the data set is uniformly distributed. Real data sets are not uniformly distributed, are often clustered, and the dimensions of the feature vectors in real data sets are usually correlated. More careful analysis for non-uniform or correlated data is needed for effectively indexing high dimensional data. We propose a solution to these problems and propose the VA~+-file, a new technique for indexing high dimensional data sets based on vector approximations. We conclude with an evalaution of nearest neighbor queries and show that the VA~+-file technique results in significant improvements over the current VA-file approach for several real data sets.
机译:随着多媒体数据的激增,越来越需要支持索引和搜索高维数据。近来,已经提出了一种基于矢量近似的称为VA文件的技术,用于索引高维数据。已经表明,与基于空间和数据分区的当前方法相比,VA文件是一种有效的技术。与基于空间和数据分区的当前方法相比,VA文件是一种有效的技术。 VA文件可提供良好的性能,尤其是当数据集均匀分布时。真实数据集不是均匀分布的,通常是聚类的,并且真实数据集中特征向量的维数通常是相关的。为了有效索引高维数据,需要对非均匀或相关数据进行更仔细的分析。我们提出了这些问题的解决方案,并提出了VA〜+文件,这是一种基于向量逼近对高维数据集建立索引的新技术。我们以对最近邻居查询的回避作为结论,并表明VA〜+文件技术相对于当前的VA文件方法在几个真实数据集上产生了重大改进。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号