首页> 外文会议>International Joint Conference on Rough Sets >A Metadata Diagnostic Framework for a New Approximate Query Engine Working with Granulated Data Summaries
【24h】

A Metadata Diagnostic Framework for a New Approximate Query Engine Working with Granulated Data Summaries

机译:新近似查询引擎的元数据诊断框架与颗粒数据摘要一起使用

获取原文

摘要

This paper refers to a new database engine that acquires and utilizes granulated data summaries for the purposes of fast approximate execution of analytical SQL statements. We focus on the task of creation of a relational metadata repository which enables the engine developers and users to investigate the collected data summaries independently from the engine itself. We discuss how the design of the considered repository evolved over time from both conceptual and software engineering perspectives, addressing the challenges of conversion and accessibility of the internal engine contents that can represent hundreds of terabytes of the original data. We show some scenarios of a usage of the obtained metadata repository for both diagnostic and analytical purposes. We pay a particular attention to the relationships of the discussed scenarios with the principles of rough sets - one of the theories that hugely influenced the presented solutions. We also report some empirical results obtained for relatively small fragments (100 × 2~(16) rows each) of data sets coming from two organizations that use the considered new engine.
机译:本文是指新的数据库引擎,用于获取并利用颗粒状数据摘要,以便快速近似地执行分析SQL语句。我们专注于创建关系元数据存储库的任务,这使得发动机开发人员和用户能够独立于发动机本身调查收集的数据摘要。我们讨论所考虑的存储库的设计如何随着时间的推移而发展,从而解决了可以代表原始数据数百个TB的内部发动机内容的转换和可访问性的挑战。对于诊断和分析目的,我们展示了所获得的元数据存储库的使用情况。我们特别关注讨论的方案与粗糙集原则的关系 - 一种巨大地影响所提出的解决方案的理论之一。我们还报告了来自两个组织的相对较小的碎片(100×2〜(16)行)的一些经验结果来自两个使用所考虑的新引擎的组织。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号