Increasing Representational Power and Scaling Reasoning in Probabilistic Databases

机译：概率数据库中表示能力的提高和定标推理

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
相似文献
相关主题

摘要

Increasing numbers of real-world application domains are generating data that is inherently noisy, incomplete, and probabilistic in nature. Statistical analysis and probabilistic inference, widely used in those domains, often introduce additional layers of uncertainty. Examples include sensor data analysis, data integration and information extraction on the Web, social network analysis, and scientific and biomedical data management. Managing and querying such data requires us to combine the tools and the techniques from a variety of disciplines including databases, first-order logic, and probabilistic reasoning. There has been much work at the intersection of these research areas in recent years. The work on probabilistic databases has made great advances in efficiently executing SQL and inference queries over large-scale uncertain datasets [2,1]. The research in first-order probabilistic models like probabilistic relational models [5], Markov logic networks [10] etc. (see Getoor and Taskar [6] for a comprehensive overview), and the work on lifted inference [9, 3, 8, 11] has resulted in several techniques for efficiently integrating first-order logic and probabilistic reasoning. In this talk, I will present some of the foundations of large-scale probabilistic data management, and the challenges in scaling the representational power and the reasoning capabilities of probabilistic databases. I will use the PrDB probabilistic data management system being developed at the University of Maryland as a case study for this purpose [4, 7, 12]. Unlike the other recent work on probabilistic databases, PrDB is designed to represent uncertain data with rich correlation structures, and it uses probabilistic graphical models as the basic representation model. I will discuss how PrDB supports compact specification of uncertainties at different abstraction levels, from "schema-level" uncertainties that apply to entire relations to "tuple-specific" uncertainties that apply to a specific tuple or a specific set of tuples; I will also discuss how this relates to the work on first-order probabilistic models. Query evaluation in PrDB can be formulated as inference in appropriately constructed graphical models, and I will briefly present some of the key novel techniques that we have developed for efficient query evaluation, and their relationship to recent work on efficient lifted inference. I will conclude with a discussion of some of the open research challenges moving forward.

机译：越来越多的实际应用程序域正在生成本质上具有固有噪声，不完整和概率的数据。在这些领域中广泛使用的统计分析和概率推论经常引入更多的不确定性层。示例包括传感器数据分析，Web上的数据集成和信息提取，社交网络分析以及科学和生物医学数据管理。管理和查询此类数据要求我们结合各种学科的工具和技术，包括数据库，一阶逻辑和概率推理。近年来，在这些研究领域的交汇处开展了大量工作。概率数据库的工作在有效地对大型不确定数据集执行SQL和推理查询方面取得了长足的进步[2,1]。对一阶概率模型的研究，例如概率关系模型[5]，马尔可夫逻辑网络[10]等（有关全面概述，请参见Getoor和Taskar [6]），以及有关提升推理的工作[9、3、8] ，[11]提出了几种有效整合一阶逻辑和概率推理的技术。在本演讲中，我将介绍大规模概率数据管理的一些基础，以及在扩展概率数据库的表示能力和推理能力方面的挑战。为此，我将使用马里兰大学正在开发的PrDB概率数据管理系统作为案例研究[4，7，12]。与最近关于概率数据库的其他工作不同，PrDB旨在表示具有丰富相关性结构的不确定数据，并且它使用概率图形模型作为基本表示模型。我将讨论PrDB如何支持不同抽象级别上不确定性的紧凑规范，从适用于整个关系的“模式级”不确定性到适用于特定元组或特定元组集合的“元组特定”不确定性；我还将讨论这与一阶概率模型的工作之间的关系。可以将PrDB中的查询评估公式化为在适当构建的图形模型中的推理，我将简要介绍一些我们为有效查询评估而开发的关键新颖技术，以及它们与近期有关有效提升推理的工作之间的关系。最后，我将讨论一些前进的开放研究挑战。

著录项

来源
《13th international conference on database theory 2010》|2010年|p.1|共1页
会议地点 Lausanne(CH);Lausanne(CH)
作者
Amol Deshpande;
展开▼
作者单位

Department of Computer Science and UMIACS University of Maryland, College Park, MD, USA;

展开▼
会议组织
原文格式 PDF
正文语种 eng
中图分类 TP311.13;
关键词
入库时间 2022-08-26 13:59:20

相似文献

外文文献
中文文献
专利

1. Repairing errors in probabilistic databases models using probabilistic abduction reasoning [J] . Bourahla Mustapha International journal of intelligent information and database systems . 2017,第1a2期

机译：使用概率绑架推理修复概率数据库模型中的错误
2. Frontiers for propositional reasoning about fragments of probabilistic conditional independence and hierarchical database decompositions [J] . Link Sebastian Theoretical computer science . 2015,第Null期

机译：关于概率条件独立性片段和分层数据库分解的命题推理前沿
3. Measuring Probabilistic Reasoning: The Construction of a New Scale Applying Item Response Theory [J] . Primi Caterina, Morsanyi Kinga, Donati Maria Anna, Journal of Behavioral Decision Making . 2017,第4期

机译：测量概率推理：运用项目反应理论构建新的量表
4. Increasing Representational Power and Scaling Reasoning in Probabilistic Databases [C] . Amol Deshpande International conference on database theory . 2010

机译：提高概率数据库中的代表性力量和扩展推理
5. Probabilistic reasoning utilizing relational database techniques. [D] . Fang, Fang. 2005

机译：利用关系数据库技术的概率推理。
6. Explicit Tracking of Uncertainty Increases the Power of Quantitative Rule-of-Thumb Reasoning in Cell Biology [O] . Iain G. Johnston, Benjamin C. Rickett, Nick S. Jones 2014

机译：明确的不确定性跟踪提高了细胞生物学定量经验法则推理的能力
7. Increasing Representational Power and Scaling Reasoning in Probabilistic Databases [O] . Amol Deshpande 2014

机译：概率数据库中表示能力的提高和定标推理
8. Generalized Probabilistic Reasoning and Empirical Studies on ComputationalEfficiency and Scalability [R] . Baenen, E. P. 1994

机译：广义概率推理与计算效率和可扩展性的实证研究

Increasing Representational Power and Scaling Reasoning in Probabilistic Databases

摘要

著录项

相似文献

相关主题

期刊订阅