首页> 外文会议>International Conference on Enterprise Information Systems >STATISTICS API: DBMS-INDEPENDENT ACCESS AND MANAGEMENT OF DBMS STATISTICS IN HETEROGENEOUS ENVIRONMENTS
【24h】

STATISTICS API: DBMS-INDEPENDENT ACCESS AND MANAGEMENT OF DBMS STATISTICS IN HETEROGENEOUS ENVIRONMENTS

机译:统计API:异构环境中的DBMS无关访问和管理DBMS统计信息

获取原文

摘要

Many of today applications access not a single but a multitude of databases running on different DBMSs. Federation technology is being used to integrate these databases and to offer a single query-interface to the user where he can run queries accessing tables stored on different remote databases. So, the optimizer of the federated DBMS has to decide what portion of the query should be processed by the federated DBMS itself and what portion should be executed at the remote systems. Thereto, it has to retrieve cost estimates for query fragments from the remote databases. The response of these databases typically contains cost and cardinality estimates but no statistics about the data stored in these databases. However, statistics are optimization-critical information which is the crucial factor for any kind of decision making in the optimizer of the federated DBMS. When this information is not available optimization has to rely on imprecise heuristics mostly based on default selectivities. To fill this gap, we propose Statistics API, a JAVA interface that provides DBMS-independent access to statistics data stored in databases running on different DBMSs. Statistics API also defines data structures used for the statistics data returned by or passed to the interface. We have implemented this interface for the three prevailing commercial DBMSs IBM DB2, Oracle and Microsoft SQL Server. These implementations are available under the terms of the GNU Lesser General Public License (LGPL). This paper introduces the interface, i.e. the methods and data structures of the Statistics API, and discusses some details of the three interface implementations.
机译:今天许多应用程序访问不是单个但是在不同的DBMS上运行的多个数据库。联合技术用于集成这些数据库并向用户提供单个查询接口,在那里他可以运行存储在不同远程数据库上的表的查询。因此,联合DBMS的优化器必须决定COFERATED DBMS本身应处理查询的哪些部分,并且应该在远程系统上执行哪些部分。在其上,它必须从远程数据库检索查询片段的成本估计。这些数据库的响应通常包含成本和基数估计,但没有关于存储在这些数据库中的数据的统计信息。然而,统计是优化 - 关键信息,这是联邦DBMS优化器中任何类型决策的关键因素。当此信息不可用时,优化必须依赖于不精确的启发式信息,主要基于默认选择性。为了填补此差距,我们提出了一个统计数据,一个Java接口,它提供了与在不同DBMS上运行的数据库中存储的数据库中存储的DBMS的独立访问。统计API还定义了用于返回或传递给接口的统计数据的数据结构。我们已经为三个现行商用DBMSS IBM DB2,Oracle和Microsoft SQL Server实现了此界面。这些实施是根据GNU小公共许可证(LGPL)的条款提供的。本文介绍了界面,即统计API的方法和数据结构,并讨论了三个接口实现的一些细节。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号