首页> 外国专利> Method and apparatus for learning probabilistic relational models having attribute and link uncertainty and for performing selectivity estimation using probabilistic relational models

Method and apparatus for learning probabilistic relational models having attribute and link uncertainty and for performing selectivity estimation using probabilistic relational models

机译：用于学习具有属性和链接不确定性的概率关系模型并使用概率关系模型执行选择性估计的方法和装置

页面导航

摘要
著录项
相似文献

摘要

The invention comprises a method and apparatus for learning probabilistic models (PRM's) with attribute uncertainty. A PRM with attribute uncertainty defines a probability distribution over instantiations of a database. A learned PRM is useful for discovering interesting patterns and dependencies in the data. Unlike many existing techniques, the process is data-driven rather than hypothesis driven. This makes the technique particularly well-suited for exploratory data analysis. In addition, the invention comprises a method and apparatus for handling link uncertainty in PRM's. Link uncertainty is uncertainty over which entities are related in our domain. The invention comprises of two mechanisms for modeling link uncertainty: reference uncertainty and existence uncertainty. The invention includes learning algorithms for each form of link uncertainty. The third component of the invention is a technique for performing database selectivity estimation using probabilistic relational models. The invention provides a unified framework for the estimation of query result size for a broad class of queries involving both select and join operations. A single learned model can be used to efficiently estimate query result sizes for a wide collection of potential queries across multiple tables.

机译：本发明包括用于学习具有属性不确定性的概率模型（PRM）的方法和设备。具有属性不确定性的PRM定义了数据库实例上的概率分布。学习过的PRM对于发现数据中有趣的模式和相关性很有用。与许多现有技术不同，该过程是数据驱动的，而不是假设驱动的。这使得该技术特别适合于探索性数据分析。另外，本发明包括用于处理PRM中的链路不确定性的方法和设备。链接不确定性是我们领域中哪些实体相关的不确定性。本发明包括用于对链路不确定性建模的两种机制：参考不确定性和存在不确定性。本发明包括用于每种形式的链路不确定性的学习算法。本发明的第三部分是用于使用概率关系模型执行数据库选择性估计的技术。本发明提供了用于估计涉及选择和联接操作的广泛查询类别的查询结果大小的统一框架。单个学习模型可用于有效地估计查询结果大小，以跨多个表收集大量潜在查询。

著录项

公开/公告号US2002103793A1

专利类型
公开/公告日2002-08-01

原文格式PDF
申请/专利权人 KOLLER DAPHNE;GETOOR LISE;PFEFFER AVI;FRIEDMAN NIR;TASKAR BEN;
展开▼

申请/专利号US20010922324
发明设计人 DAPHNE KOLLER;LISE GETOOR;AVI PFEFFER;NIR FRIEDMAN;BEN TASKAR;
展开▼

申请日2001-08-02
分类号G06F7/00;
国家 US
入库时间 2022-08-22 00:50:09

相似文献

专利
外文文献
中文文献