首页> 外文学位 >Improving the performance of heterogeneous databases and agents.
【24h】

Improving the performance of heterogeneous databases and agents.

机译:改善异构数据库和代理的性能。

获取原文
获取原文并翻译 | 示例

摘要

Today's applications require the ability to access and query multiple heterogeneous data sources. Heterogeneous databases (HDB) and heterogeneous agent systems provide the necessary means to attain this goal. A common characteristics of such HDB and agent systems is that they simultaneously process large numbers of queries/requests. The ability to efficiently handle large volumes of simultaneous queries is critical in many such applications.; In this thesis, we present various query optimization techniques to improve the performance of such heavily loaded HDB and agent systems. Since cost models play a key role in query optimization, we first propose a framework through which a heterogeneous system can obtain cost and cardinality information required for optimization. Our approach is the first to adapt a traditional System-R style optimizer to perform costing in a heterogeneous environment.; Another important problem in a heterogeneous system deployed over wide area networks, such as the Internet, is the duplication of data and services. Since different sources have varying characteristics, choosing the right set of sources to answer a user query has critical performance implications. In this thesis, we first formalize this problem as the source selection problem and show that it is NP-hard. We then propose one optimal and two heuristic based algorithms to address source selection problem.; As the next step to optimize the performance of HDB and agent systems, we propose a set of cost-based multiple query optimization (MQO) algorithms which exploit commonalities between multiple queries submitted to such systems.; Finally, as an HDB or an agent system may have a vast number of pending queries and MQO algorithms can handle relatively smaller sets of queries, the final step in optimizing the performance of such systems is to group a large number of queries into classes of manageable size, so that each class can be optimized by the MQO techniques we developed earlier (or by third party MQO techniques). In this thesis, we formalize this partitioning problem and show that it is NP-hard. We then provide two exact and several heuristic based algorithms for query partitioning.
机译:当今的应用程序需要能够访问和查询多个异构数据源。异构数据库(HDB)和异构代理系统提供了实现此目标的必要手段。这种HDB和代理系统的共同特征是它们同时处理大量查询/请求。在许多此类应用程序中,有效处理大量同时查询的能力至关重要。在本文中,我们提出了各种查询优化技术,以提高此类重载HDB和代理系统的性能。由于成本模型在查询优化中起着关键作用,因此我们首先提出一个框架,异构系统可以通过该框架获取优化所需的成本和基数信息。我们的方法是率先采用传统的System-R样式优化器以在异构环境中执行成本计算的。在广域网(例如Internet)上部署的异构系统中的另一个重要问题是数据和服务的重复。由于不同的来源具有不同的特性,因此选择正确的来源集来回答用户查询具有关键的性能影响。在本文中,我们首先将此问题形式化为源选择问题,并证明它是NP难的。然后,我们提出一种基于最优算法和两种基于启发式算法的算法来解决源选择问题。作为优化HDB和代理系统性能的下一步,我们提出了一套基于成本的多查询优化(MQO)算法,该算法利用了提交给此类系统的多个查询之间的共性。最后,由于HDB或代理系统可能具有大量待处理的查询,而MQO算法可以处理相对较少的查询集,因此优化此类系统性能的最后一步是将大量查询分组为可管理的类大小,以便可以通过我们之前开发的MQO技术(或通过第三方MQO技术)优化每个类。在本文中,我们对这个划分问题进行了形式化证明,证明它是NP难的。然后,我们为查询分区提供了两种基于精确和几种启发式的算法。

著录项

  • 作者

    Ozcan, Fatma.;

  • 作者单位

    University of Maryland College Park.;

  • 授予单位 University of Maryland College Park.;
  • 学科 Computer Science.
  • 学位 Ph.D.
  • 年度 2001
  • 页码 265 p.
  • 总页数 265
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类 自动化技术、计算机技术;
  • 关键词

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号