首页> 外文会议>IEEE International Conference on Distributed Computing Systems >Multi-query Optimization for Distributed Similarity Query Processing
【24h】

Multi-query Optimization for Distributed Similarity Query Processing

机译:分布式相似性查询处理的多查询优化

获取原文

摘要

This paper considers a multi-query optimization issue for distributed similarity query processing, which attempts to exploit the dependencies in the derivation of a query evaluation plan. To the best of our knowledge, this is the first work investigating a multi-query optimization technique for distributed similarity query processing (MDSQ). Four steps are incorporated in our MDSQ algorithm. First when a number of query requests (i.e., m query vectors and m radiuses) are simultaneously submitted by users, then a cost-based dynamic query scheduling (DQS) procedure is invoked to quickly and effectively identify the correlation among the query spheres (requests). After that, an index-based vector set reduction is performed at data node level in parallel. Finally, a refinement process of the candidate vectors is conducted to get the answer set. The proposed method includes a cost-based dynamic query scheduling, a Start-Distance (SD)-based load balancing scheme, and an index-based vector set reduction algorithm. The experimental results validate the efficiency and effectiveness of the algorithm in minimizing the response time and increasing the parallelism of I/O and CPU.
机译:本文考虑了用于分布式相似性查询处理的多查询优化问题,该问题试图利用查询评估计划的推导中的依赖项。据我们所知,这是第一个调查用于分布式相似性查询处理(MDSQ)的多查询优化技术的工作。我们的MDSQ算法中包含四个步骤。首先,当用户同时提交许多查询请求(即,M查询向量和M个半径)时,将调用基于成本的动态查询调度(DQS)过程以快速有效地识别查询球之间的相关性(请求)。之后,并行地在数据节点电平下执行基于索引的向量集。最后,进行候选矢量的细化过程以获取答案集。所提出的方法包括基于成本的动态查询调度,开始距离(SD)基于负载平衡方案和基于索引的向量集合减少算法。实验结果验证了算法在最小化响应时间并增加I / O和CPU的并行性时验证算法的效率和有效性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号