Approximate similarity search for online multimedia services on distributed CPU-GPU platforms

George Teodoro; Eduardo Valle; Nathan Mariano; Ricardo Torres; Wagner Meira Jr; Joel H. Saltz

首页> 外文期刊>The VLDB journal >Approximate similarity search for online multimedia services on distributed CPU-GPU platforms

【24h】

Approximate similarity search for online multimedia services on distributed CPU-GPU platforms

机译：分布式CPU-GPU平台上在线多媒体服务的近似相似度搜索

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Similarity search in high-dimensional spaces is a pivotal operation for several database applications, including online content-based multimedia services. With the increasing popularity of multimedia applications, these services are facing new challenges regarding (1) the very large and growing volumes of data to be indexed/searched and (2) the necessity of reducing the response times as observed by end-users. In addition, the nature of the interactions between users and online services creates fluctuating query request rates throughout execution, which requires a similarity search engine to adapt to better use the computation platform and minimize response times. In this work, we address these challenges with Hypercurves, a flexible framework for answering approximate k-nearest neighbor (kNN) queries for very large multimedia databases. Hypercurves executes in hybrid CPU-GPU environments and is able to attain massive query-processing rates through the cooperative use of these devices. Hypercurves also changes its CPU-GPU task partitioning dynamically according to the observed load, aiming for optimal response times. In our empirical evaluation, dynamic task partitioning reduced query response times by approximately 50% compared to the best static task partition. Due to a probabilistic proof of equivalence to the sequential kNN algorithm, the CPU-GPU execution of Hypercurves in distributed (multi-node) environments can be aggressively optimized, attaining superlinear scalability while still guaranteeing, with high probability, results at least as good as those from the sequential algorithm.

机译：高维空间中的相似性搜索是一些数据库应用程序（包括基于在线内容的多媒体服务）的关键操作。随着多媒体应用程序的日益普及，这些服务面临以下新挑战：（1）要索引/搜索的数据量非常大且不断增长；（2）减少最终用户观察到的响应时间的必要性。此外，用户和在线服务之间交互的性质会在整个执行过程中产生波动的查询请求率，这需要相似性搜索引擎来更好地使用计算平台并最小化响应时间。在这项工作中，我们使用Hypercurves解决了这些挑战，Hypercurves是一种灵活的框架，用于回答大型多媒体数据库的近似k最近邻（kNN）查询。 Hypercurves在混合CPU-GPU环境中执行，并且能够通过协同使用这些设备来获得大量的查询处理速率。 Hypercurves还根据观察到的负载动态更改其CPU-GPU任务分区，以实现最佳响应时间。在我们的经验评估中，与最佳静态任务分区相比，动态任务分区将查询响应时间减少了约50％。由于具有与顺序kNN算法等效的概率证明，因此可以积极优化分布式（多节点）环境中Hypercurves的CPU-GPU执行，获得超线性可扩展性，同时仍以很高的概率保证结果至少与那些来自顺序算法。

著录项

来源
《The VLDB journal》 |2014年第3期|427-448|共22页
作者
George Teodoro; Eduardo Valle; Nathan Mariano; Ricardo Torres; Wagner Meira Jr; Joel H. Saltz;
展开▼
作者单位

Center for Comprehensive Informatics, Emory University, Atlanta, GA, USA;

Recod Lab/DCA/FEEC, State University of Campinas, Campinas, SP, Brazil;

Department of Computer Science, Universidade Federal de Minas Gerais, Belo Horizonte, MG, Brazil;

Recod Lab/DSI/IC, State University of Campinas. Campinas, SP, Brazil;

Department of Computer Science, Universidade Federal de Minas Gerais, Belo Horizonte, MG, Brazil;

Center for Comprehensive Informatics, Emory University, Atlanta, GA, USA;

展开▼
收录信息美国《科学引文索引》(SCI);
原文格式 PDF
正文语种 eng
中图分类
关键词
Descriptor indexing; Multimedia databases; Information retrieval; Hypercurves; Filter-stream; GPGPU;

机译：描述符索引;多媒体数据库;信息检索;超曲线过滤流;通用图形处理器;

相似文献

外文文献
中文文献
专利

1. Large-scale parallel similarity search with Product Quantization for online multimedia services [J] . Andrade Guilherme, Fernandes Andre, Gomes Jeremias M., Journal of Parallel and Distributed Computing . 2019,第MARa期

机译：用于在线多媒体服务的带有产品量化的大规模并行相似度搜索
2. Online multimedia retrieval on CPU-GPU platforms with adaptive work partition [J] . Rafael Souza, Andre Fernandes, Thiago S.F.X. Teixeira, Journal of Parallel and Distributed Computing . 2021,第Feba期

机译：具有自适应工作分区的CPU-GPU平台上的在线多媒体检索
3. Distributed similarity search algorithm in distributed heterogeneous multimedia databases [J] . Ju-Hong Lee, Deok-Hwan Kim, Secok-Lyong Lee Information Processing Letters . 2000,第1a2期

机译：分布式异构多媒体数据库中的分布式相似度搜索算法
4. Adaptive Parallel Approximate Similarity Search for Responsive Multimedia Retrieval [C] . George Teodoro, Eduardo Valle, Nathan Mariano, ACM international conference on information and knowledge management . 2011

机译：自适应并行近似相似搜索在多媒体响应中的应用
5. Service Similarity Based User Centric IoT Service Management System for Service Search [D] . Quan Huilan 2020

机译：基于服务相似度的以用户为中心的物联网服务管理系统
6. Worldwide telemedicine services based on distributed multimedia electronic patient records by using the second generation Web server hyperwave. [O] . G. Quade, J. Novotny, B. Burde, 1999

机译：通过使用第二代Web服务器Hyperwave基于分布式多媒体电子病历的全球远程医疗服务。
7. Approximate Similarity Search for Online Multimedia Services on Distributed CPU-GPU Platforms [O] . Teodoro, George, Valle, Eduardo, Mariano, Nathan, 2012

机译：在线多媒体服务的近似相似性搜索分布式CpU-GpU平台

Approximate similarity search for online multimedia services on distributed CPU-GPU platforms

摘要

著录项

相似文献

相关主题

期刊订阅