首页> 外文会议>International conference on management of data >Effective Data Co-Reduction for Multimedia Similarity Search
【24h】

Effective Data Co-Reduction for Multimedia Similarity Search

机译:多媒体相似搜索的有效数据共约减少

获取原文
获取外文期刊封面目录资料

摘要

Multimedia similarity search has been playing a critical role in many novel applications. Typically, multimedia objects are described by high-dimensional feature vectors (or points) which are organized in databases for retrieval. Although many high-dimensional indexing methods have been proposed to facilitate the search process, efficient retrieval over large, sparse and extremely high-dimensional databases remains challenging due to the continuous increases in data size and feature dimensionality. In this paper, we propose the first framework for Data Co-Reduction (DCR) on both data size and feature dimensionality. By utilizing recently developed co-clustering methods, DCR simultaneously reduces both size and dimensionality of the original data into a compact subspace, where lower bounds of the actual distances in the original space can be efficiently established to achieve fast and lossless similarity search in the filter-and-refine approach. Particularly, DCR considers the duality between size and dimensionality, and achieves the optimal co-reduction which generates the least number of candidates for actual distance computations. We conduct an extensive experimental study on large and real-life multimedia datasets. with dimensionality ranging from 432 to 1936. Our results demonstrate that DCR outperforms existing methods significantly for lossless retrieval, especially in the presence of extremely high dimensionality.
机译:多媒体相似性搜索在许多新颖的应用程序中一直扮演着至关重要的角色。通常,多媒体对象是通过在数据库中组织以供检索的高维特征向量(或点)来描述的。尽管已经提出了许多高维索引方法来简化搜索过程,但是由于数据大小和特征维的不断增加,在大型,稀疏和超高维数据库上进行有效的检索仍然具有挑战性。在本文中,我们针对数据大小和特征维数提出了第一个数据共约化(DCR)框架。通过使用最新开发的共聚方法,DCR同时将原始数据的大小和维数减小到一个紧凑的子空间中,在该子空间中,可以有效地建立原始空间中实际距离的下限,从而在过滤器中实现快速且无损的相似性搜索和完善的方法。特别是,DCR考虑了尺寸和尺寸之间的对偶,并实现了最佳的共约简,该最优共约简生成了用于实际距离计算的最少候选数。我们对大型和现实的多媒体数据集进行了广泛的实验研究。维度范围从432到1936。我们的结果表明,对于无损检索,DCR的性能明显优于现有方法,尤其是在存在极高维度的情况下。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号