...
首页> 外文期刊>The VLDB journal >Approximate similarity search for online multimedia services on distributed CPU-GPU platforms
【24h】

Approximate similarity search for online multimedia services on distributed CPU-GPU platforms

机译:分布式CPU-GPU平台上在线多媒体服务的近似相似度搜索

获取原文
获取原文并翻译 | 示例
           

摘要

Similarity search in high-dimensional spaces is a pivotal operation for several database applications, including online content-based multimedia services. With the increasing popularity of multimedia applications, these services are facing new challenges regarding (1) the very large and growing volumes of data to be indexed/searched and (2) the necessity of reducing the response times as observed by end-users. In addition, the nature of the interactions between users and online services creates fluctuating query request rates throughout execution, which requires a similarity search engine to adapt to better use the computation platform and minimize response times. In this work, we address these challenges with Hypercurves, a flexible framework for answering approximate k-nearest neighbor (kNN) queries for very large multimedia databases. Hypercurves executes in hybrid CPU-GPU environments and is able to attain massive query-processing rates through the cooperative use of these devices. Hypercurves also changes its CPU-GPU task partitioning dynamically according to the observed load, aiming for optimal response times. In our empirical evaluation, dynamic task partitioning reduced query response times by approximately 50% compared to the best static task partition. Due to a probabilistic proof of equivalence to the sequential kNN algorithm, the CPU-GPU execution of Hypercurves in distributed (multi-node) environments can be aggressively optimized, attaining superlinear scalability while still guaranteeing, with high probability, results at least as good as those from the sequential algorithm.
机译:高维空间中的相似性搜索是一些数据库应用程序(包括基于在线内容的多媒体服务)的关键操作。随着多媒体应用程序的日益普及,这些服务面临以下新挑战:(1)要索引/搜索的数据量非常大且不断增长;(2)减少最终用户观察到的响应时间的必要性。此外,用户和在线服务之间交互的性质会在整个执行过程中产生波动的查询请求率,这需要相似性搜索引擎来更好地使用计算平台并最小化响应时间。在这项工作中,我们使用Hypercurves解决了这些挑战,Hypercurves是一种灵活的框架,用于回答大型多媒体数据库的近似k最近邻(kNN)查询。 Hypercurves在混合CPU-GPU环境中执行,并且能够通过协同使用这些设备来获得大量的查询处理速率。 Hypercurves还根据观察到的负载动态更改其CPU-GPU任务分区,以实现最佳响应时间。在我们的经验评估中,与最佳静态任务分区相比,动态任务分区将查询响应时间减少了约50%。由于具有与顺序kNN算法等效的概率证明,因此可以积极优化分布式(多节点)环境中Hypercurves的CPU-GPU执行,获得超线性可扩展性,同时仍以很高的概率保证结果至少与那些来自顺序算法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号