首页> 外文会议>ACM SIGMOD international conference on Management of data >Efficient computation of multiple group by queries
【24h】

Efficient computation of multiple group by queries

机译:高效的查询多个组

获取原文

摘要

Data analysts need to understand the quality of data in the warehouse. This is often done by issuing many Group By queries on the sets of columns of interest. Since the volume of data in these warehouses can be large, and tables in a data warehouse often contain many columns, this analysis typically requires executing a large number of Group By queries, which can be expensive. We show that the performance of today's database systems for such data analysis is inadequate. We also show that the problem is computationally hard, and develop efficient techniques for solving it. We demonstrate significant speedup over existing approaches on today's commercial database systems.
机译:数据分析师需要了解仓库中数据的质量。这通常是通过对感兴趣的列集发出许多“分组依据”查询来完成的。由于这些仓库中的数据量可能很大,并且数据仓库中的表通常包含许多列,因此此分析通常需要执行大量的Group By查询,这可能会很昂贵。我们表明,当今的数据库系统在进行此类数据分析方面的性能不足。我们还证明了该问题在计算上很困难,并且开发了解决该问题的有效技术。我们证明了在当今的商业数据库系统上比现有方法有显着的提速。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号