【24h】

Query sampling in DB2 Universal Database

机译:在DB2通用数据库中查询采样

获取原文

摘要

Executing ad hoc queries against large databases can be prohibitively expensive. Exploratory analysis of data may not require exact answers to queries, however: results based on sampling the data are often satisfactory. Supporting sampling as a primitive SQL operator turns out to be difficult because sampling does not commute with many SQL operators.In this paper, we describe an implementation in IBM? DB2? Universal Database (UDB) of a sampling operator that commutes with some SQL operators. As a result, the query with the sampling operator always returns a random sample of the answers and in many cases runs faster than it would have without such an operator.
机译:针对大型数据库执行ad hoc查询可能会非常昂贵。数据探索性分析可能不需要对查询的精确答案,但是:基于采样的结果数据通常令人满意。支持采样作为原始SQL运算符,难以困难,因为采样不会与许多SQL运算符通勤。在本文中,我们描述了IBM的实现? DB2?使用某些 SQL运算符通勤的采样操作员的通用数据库(UDB)。因此,使用采样运算符的查询始终返回答案的随机样本,并且在许多情况下运行比没有这种操作员的速度更快。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号