首页> 外文期刊>International journal of computer science and network security >Query Processing and Optimization in Distributed Database Systems
【24h】

Query Processing and Optimization in Distributed Database Systems

机译:分布式数据库系统中的查询处理和优化

获取原文
获取原文并翻译 | 示例
           

摘要

Query processing is an important concern in the field of distributed databases. The main problem is: if a query can be decomposed into subqueries that require operations at geographically separated databases, determine the sequence and the sites for performing this set of operations such that the operating cost (communication cost and processing cost) for processing this query is minimized. The problem is complicated by the fact that query processing not only depends on the operations of the query, but also on the parameter values associated with the query. Distributed query processing is an important factor in the overall performance of a distributed database system.rnQuery optimization is a difficult task in a distributed client/server environment as data location becomes a major factor. In order to optimize queries accurately, sufficient information must be available to determine which data access techniques are most effective (for example, table and column cardinality, organization information, and index availability). Optimization algorithms have an important impact on the performance of distributed query processing.rnIn this paper, we describe the distributed query optimization problem in detail. We then present a (ARRQ) technique to process queries with a minimum quantity of intersite data transfer. The technique can be used to process the query where all of the relations referenced by a query are non-fragmented but distributed in different sites. The proposed technique is used to determine which relations are to be partitioned into fragments, and where the fragments are to be sent for processing. The technique is efficient compared to other techniques, as it generally chooses more than one relation to remain fragmented which exploits parallelism, while replicating the other relations (excluding the fragmented relations) to the sites of the fragmented relations. Thus the communication costs and local processing costs can be reduced due to the reduced size of the fragmented relations and the response time of queries can be improved.
机译:查询处理是分布式数据库领域中的一个重要问题。主要问题是:如果一个查询可以分解为需要在地理上分开的数据库中进行操作的子查询,请确定执行这组操作的顺序和位置,以便处理该查询的操作成本(通信成本和处理成本)为最小化。由于查询处理不仅取决于查询的操作,还取决于与查询关联的参数值,因此使问题变得复杂。分布式查询处理是影响分布式数据库系统整体性能的重要因素。由于数据位置成为主要因素,在分布式客户端/服务器环境中,查询优化是一项艰巨的任务。为了准确优化查询,必须有足够的信息来确定哪种数据访问技术最有效(例如,表和列的基数,组织信息和索引可用性)。优化算法对分布式查询处理的性能有重要影响。本文详细描述了分布式查询优化问题。然后,我们提出一种(ARRQ)技术,以最少的站点间数据传输量处理查询。该技术可用于处理查询,在该查询中,查询引用的所有关系都是无碎片的,但分布在不同的站点中。所提出的技术用于确定将哪些关系划分为片段,以及将片段发送到哪里进行处理。与其他技术相比,该技术是有效的,因为它通常选择一个以上的关系来保留利用并行性的碎片,同时将其他关系(不包括碎片关系)复制到碎片关系的位置。因此,由于碎片关系的大小减小,因此可以降低通信成本和本地处理成本,并且可以改善查询的响应时间。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号