首页> 外文期刊>Information Systems >A survey on graph-based methods for similarity searches in metric spaces
【24h】

A survey on graph-based methods for similarity searches in metric spaces

机译:基于图的相似性搜索方法的调查

获取原文
获取原文并翻译 | 示例
           

摘要

Technology development has accelerated the volume growth of complex data, such as images, videos, time series, and georeferenced data. Similarity search is a widely used approach to retrieve complex data, which aims at retrieving similar data according to intrinsic characteristics of the data. Therefore, to facilitate the retrieval of complex data using similarity searches, one needs to organize large collections of data in a way that similar data can be retrieved efficiently. Many access methods were proposed in the literature to speed up similarity data retrieval from large databases. Recently, graph-based methods have emerged as a very efficient alternative for similarity retrieval, with reports indicating those methods outperformed other non-graph-based methods in several scenarios. However, to the best of our knowledge, there is no previous work with experimental analysis on a comprehensive number of graph-based methods using the same search algorithm and execution environment. Our main contribution is a survey on graph-based methods used for similarity searches. We present a review on graph-based methods (types of graphs and search algorithms) as well as a detailed discussion on the applicability of search algorithms (with exact or approximate answers) in each graph type. Our main focus is on static methods in metric spaces. This survey also includes an experimental evaluation of representative graphs implemented in a common platform. We evaluate the relative performance behavior of these graphs concerning the main construction and query parameters for a variety of real-world datasets. We also show results using synthetic datasets evaluating the performance of different graph types according to different dataset features. Our experimental results reinforce the tradeoff between graph construction cost and search performance according to the construction and search parameters. (C) 2020 Elsevier Ltd. All rights reserved.
机译:技术开发加速了复杂数据的体积增长,如图像,视频,时间序列和地理参考数据。相似性搜索是一种广泛使用的方法来检索复杂数据,其目的在于根据数据的内在特征检索类似的数据。因此,为了便于使用相似性搜索检索复杂数据,需要以有效地检索类似数据的方式组织大量数据集合。在文献中提出了许多访问方法,以加快来自大型数据库的相似性数据检索。最近,基于图形的方法作为相似性检索的非常有效的替代方法,报告指示这些方法在几种情况下表现出基于其他非图形的方法。然而,据我们所知,使用相同的搜索算法和执行环境,没有以前的基于图形的基于图形方法的实验分析。我们的主要贡献是关于相似性搜索的基于图的方法的调查。我们对基于图形的方法(图表和搜索算法类型)进行了审查,以及关于搜索算法(具有精确或近似答案)在每个图形类型中的适用性的详细讨论。我们的主要重点是在公制空间中的静态方法。该调查还包括在共同平台中实施的代表性图表的实验评估。我们评估这些图表的相对性能行为,了关于各种现实数据集的主要构造和查询参数。我们还使用合成数据集根据不同的数据集功能来显示使用SyntheCate DataSets评估不同图类型的性能的结果。我们的实验结果根据施工和搜索参数加强了图形施工成本和搜索性能之间的权衡。 (c)2020 elestvier有限公司保留所有权利。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号