首页> 外文会议>13th international conference on database theory 2010 >Increasing Representational Power and Scaling Reasoning in Probabilistic Databases
【24h】

Increasing Representational Power and Scaling Reasoning in Probabilistic Databases

机译:概率数据库中表示能力的提高和定标推理

获取原文
获取原文并翻译 | 示例

摘要

Increasing numbers of real-world application domains are generating data that is inherently noisy, incomplete, and probabilistic in nature. Statistical analysis and probabilistic inference, widely used in those domains, often introduce additional layers of uncertainty. Examples include sensor data analysis, data integration and information extraction on the Web, social network analysis, and scientific and biomedical data management. Managing and querying such data requires us to combine the tools and the techniques from a variety of disciplines including databases, first-order logic, and probabilistic reasoning. There has been much work at the intersection of these research areas in recent years. The work on probabilistic databases has made great advances in efficiently executing SQL and inference queries over large-scale uncertain datasets [2,1]. The research in first-order probabilistic models like probabilistic relational models [5], Markov logic networks [10] etc. (see Getoor and Taskar [6] for a comprehensive overview), and the work on lifted inference [9, 3, 8, 11] has resulted in several techniques for efficiently integrating first-order logic and probabilistic reasoning. In this talk, I will present some of the foundations of large-scale probabilistic data management, and the challenges in scaling the representational power and the reasoning capabilities of probabilistic databases. I will use the PrDB probabilistic data management system being developed at the University of Maryland as a case study for this purpose [4, 7, 12]. Unlike the other recent work on probabilistic databases, PrDB is designed to represent uncertain data with rich correlation structures, and it uses probabilistic graphical models as the basic representation model. I will discuss how PrDB supports compact specification of uncertainties at different abstraction levels, from "schema-level" uncertainties that apply to entire relations to "tuple-specific" uncertainties that apply to a specific tuple or a specific set of tuples; I will also discuss how this relates to the work on first-order probabilistic models. Query evaluation in PrDB can be formulated as inference in appropriately constructed graphical models, and I will briefly present some of the key novel techniques that we have developed for efficient query evaluation, and their relationship to recent work on efficient lifted inference. I will conclude with a discussion of some of the open research challenges moving forward.
机译:越来越多的实际应用程序域正在生成本质上具有固有噪声,不完整和概率的数据。在这些领域中广泛使用的统计分析和概率推论经常引入更多的不确定性层。示例包括传感器数据分析,Web上的数据集成和信息提取,社交网络分析以及科学和生物医学数据管理。管理和查询此类数据要求我们结合各种学科的工具和技术,包括数据库,一阶逻辑和概率推理。近年来,在这些研究领域的交汇处开展了大量工作。概率数据库的工作在有效地对大型不确定数据集执行SQL和推理查询方面取得了长足的进步[2,1]。对一阶概率模型的研究,例如概率关系模型[5],马尔可夫逻辑网络[10]等(有关全面概述,请参见Getoor和Taskar [6]),以及有关提升推理的工作[9、3、8] ,[11]提出了几种有效整合一阶逻辑和概率推理的技术。在本演讲中,我将介绍大规模概率数据管理的一些基础,以及在扩展概率数据库的表示能力和推理能力方面的挑战。为此,我将使用马里兰大学正在开发的PrDB概率数据管理系统作为案例研究[4,7,12]。与最近关于概率数据库的其他工作不同,PrDB旨在表示具有丰富相关性结构的不确定数据,并且它使用概率图形模型作为基本表示模型。我将讨论PrDB如何支持不同抽象级别上不确定性的紧凑规范,从适用于整个关系的“模式级”不确定性到适用于特定元组或特定元组集合的“元组特定”不确定性;我还将讨论这与一阶概率模型的工作之间的关系。可以将PrDB中的查询评估公式化为在适当构建的图形模型中的推理,我将简要介绍一些我们为有效查询评估而开发的关键新颖技术,以及它们与近期有关有效提升推理的工作之间的关系。最后,我将讨论一些前进的开放研究挑战。

著录项

  • 来源
  • 会议地点 Lausanne(CH);Lausanne(CH)
  • 作者

    Amol Deshpande;

  • 作者单位

    Department of Computer Science and UMIACS University of Maryland, College Park, MD, USA;

  • 会议组织
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类 TP311.13;
  • 关键词

  • 入库时间 2022-08-26 13:59:20

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号