首页> 外国专利> MULTI-DIMENSIONAL DATABASE AND DATA CUBE COMPRESSION FOR AGGREGATE QUERY SUPPORT ON NUMERIC DIMENSIONS

MULTI-DIMENSIONAL DATABASE AND DATA CUBE COMPRESSION FOR AGGREGATE QUERY SUPPORT ON NUMERIC DIMENSIONS

机译:多维维度上的汇总查询支持的多维数据库和数据多维数据集压缩

摘要

An apparatus and method for efficiently compressing contents of a database system to support ad hoc querying and OLAP type aggregation queries. This invention consists of a new compressed representation of the data cube that (a) drastically reduces storage requirements, (b) does not require the discretization hierarchy along each query dimension to be fixed beforehand and (c) treats each dimension as a potential target measure and supports multiple aggregation functions without additional storage costs. The tradeoff is approximate, yet relatively accurate, answers to queries. The basic method relies on representing the contents of the database by a probability distribution consisting of a mixture of Gaussians. Aggregation queries, be they multi-dimensional, conjunctive, or disjunctive, can be answered by performing integration over the probability distribution. We augment the basic model with a collection of (possibly compressed) outliers rows from the data to further enhance accuracy if more system memory is available for this task.
机译:一种用于有效压缩数据库系统内容以支持即席查询和OLAP类型聚合查询的装置和方法。本发明由数据多维数据集的新压缩表示组成,它(a)大大减少了存储需求,(b)不需要预先固定每个查询维度上的离散化层次,并且(c)将每个维度视为潜在的目标度量并支持多种聚合功能,而无需额外的存储成本。权衡是近似的,但相对准确,是对查询的回答。基本方法依赖于由高斯混合构成的概率分布来表示数据库的内容。聚合查询,无论是多维查询,合取查询还是析取查询,都可以通过对概率分布进行积分来回答。如果有更多系统内存可用于此任务,我们将通过从数据中收集(可能是压缩的)异常行来增强基本模型,从而进一步提高准确性。

著录项

  • 公开/公告号WO0065479A1

    专利类型

  • 公开/公告日2000-11-02

    原文格式PDF

  • 申请/专利权人 MICROSOFT CORPORATION;

    申请/专利号WO2000US10471

  • 发明设计人 SHANMUGASUNDARAM JAYAVEL;FAYYAD USAMA;

    申请日2000-04-19

  • 分类号G06F17/30;

  • 国家 WO

  • 入库时间 2022-08-22 01:49:16

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号