首页> 外文学位 >Query optimization in distributed database systems and multidatabase systems.
【24h】

Query optimization in distributed database systems and multidatabase systems.

机译:分布式数据库系统和多数据库系统中的查询优化。

获取原文
获取原文并翻译 | 示例

摘要

Distributed query processing algorithms usually perform data reduction by using a semijoin program, but the problem with these approaches is that they still require an explicit join of the reduced relations in the final phase. We introduce an efficient algorithm for join processing in distributed database systems that makes use of bipartite graphs in order to reduce data communication costs and local processing costs. The bipartite graphs represent the tuples that can be joined in two relations taking into account also the reduction state of the relations. This algorithm fully reduces the relations at each site. We then present an adaptive algorithm for response time optimization that takes into account the system configuration, i.e., the additional resources available and the data characteristics in order to select the best strategy for response time minimization. We also report on the results of a set of experiments that show that our algorithms outperform a number of the recently proposed methods for total processing time and response time minimization.; In multidatabase systems, we address the problem of data integration and query processing in the presence of conflicting values for the attributes that correspond to different instances of the same real-world object appearing in multiple roles. We present a new method for schema and data integration in multidatabase systems that presents the answer to the query as a set of sets representing the distinct intersections between the relations representing the various roles of interest. We present extensions to MSQL that allow us to express role-sets and quantifiers applied to them. We further develop a strategy for query processing based on our role-based model. Our method allows for local selection for all queries with/without aggregation and thus reduces the amount of data transmitted. In addition, it makes efficient use of merge-sort techniques to generate the role-sets and compute joins at the global query site.
机译:分布式查询处理算法通常通过使用半联接程序执行数据约简,但是这些方法的问题在于,它们仍然需要在最后阶段显式联接已缩减的关系。我们引入了一种高效的分布式数据库系统中联接处理的算法,该算法利用二部图来降低数据通信成本和本地处理成本。二部图表示可以在两个关系中合并的元组,同时还考虑了关系的简化状态。该算法充分减少了每个站点的关系。然后,我们提出了一种用于响应时间优化的自适应算法,该算法考虑了系统配置(即可用的额外资源和数据特征),以选择最佳的响应时间最小化策略。我们还报告了一组实验的结果,这些结果表明我们的算法在总处理时间和最小化响应时间方面优于最近提出的许多方法。在多数据库系统中,当存在与多个角色中出现的同一真实世界对象的不同实例对应的属性存在冲突值时,我们解决了数据集成和查询处理的问题。我们提出了一种用于多数据库系统中的模式和数据集成的新方法,该方法将查询的答案显示为一组代表关系的不同交集的集合,这些交集代表了所关注的各种角色。我们介绍了MSQL的扩展,使我们可以表达应用于它们的角色集和量词。我们基于基于角色的模型进一步开发了用于查询处理的策略。我们的方法允许对所有具有或不具有聚合的查询进行本地选择,从而减少了传输的数据量。此外,它可以有效地利用合并排序技术来生成角色集并在全局查询站点上计算联接。

著录项

  • 作者

    Chong, Eugene Inseok.;

  • 作者单位

    Northwestern University.;

  • 授予单位 Northwestern University.;
  • 学科 Computer Science.
  • 学位 Ph.D.
  • 年度 1994
  • 页码 137 p.
  • 总页数 137
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类 自动化技术、计算机技术;
  • 关键词

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号