首页> 外文会议>IEEE international conference on data engineering >Estimating temporary files sizes in distributed realtional database systems
【24h】

Estimating temporary files sizes in distributed realtional database systems

机译:估计分布式关系数据库系统中的临时文件大小

获取原文

摘要

The estimated sizes of temporary files are one of the most important statistics used by an Optimizer in generating a minimum cost processing strategy. Statistical information is the primary input to the estimation technique. The amount of statistics kept concerning the key attributes and the distribution of data values within a domain in a relation will greatly affect the accuracy of the estimates. However, the cost of storing this information may outweigh its value. Ultimately such a determination is left to those responsible for designing and implementing distributed realtional DBMSs. This paper presents a new method to calculate temporary files sizes. It also describes specific data structures and algorithms to implement the proposed method. The major tools suggested are a Log File and several specialized data matrices. The latter contain information unique to each relation. The Log Files is a posting files that records the number of occurrences of different values in the database.
机译:临时文件的估计大小是优化器在生成最低成本处理策略时使用的最重要的统计信息之一。统计信息是估计技术的主要输入。保留的有关关键属性的统计量以及关系中某个域内数据值的分布将极大地影响估计的准确性。但是,存储此信息的成本可能会超过其价值。最终,这样的决定留给那些负责设计和实现分布式实际DBMS的人员来决定。本文提出了一种计算临时文件大小的新方法。它还描述了实现所提出方法的特定数据结构和算法。建议的主要工具是一个日志文件和几个专用的数据矩阵。后者包含每个关系唯一的信息。日志文件是一个过帐文件,用于记录数据库中不同值出现的次数。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号