首页> 外文期刊>Data & Knowledge Engineering >Probabilistic object deputy model for uncertain data and lineage management
【24h】

Probabilistic object deputy model for uncertain data and lineage management

机译:不确定数据和谱系管理的概率对象代理模型

获取原文
获取原文并翻译 | 示例

摘要

Lineage is important in uncertain data management since it can be used for finding out which part of data contributes to a result and computing the probability of the result. Nonetheless, the existing works consider an uncertain tuple as a set of tuples that can be stored in a relational table. Lineage can derive each tuple in the table, with which one can only find out the tuples rather than specific attributes that contribute to the result. If uncertain tuples have multiple uncertain attributes, for a result tuple with low probability, users cannot know which attribute is the main cause of it. In this paper, we propose an approach to model uncertain data. Compared with the alternative way based on the relational model, our model achieves a low maintenance cost and avoids a large number of redundant storage and join operations. Based on our model, some operations are defined for querying data, generating lineage, computing probability and derivation of results. Further, we discuss how to correctly compute probability with lineage and an algorithm is proposed to transform lineage for correct probability computation. We also discuss how to realize result derivation with the lineage. Experiments show the advantages of the proposed model on uncertain data management.
机译:沿袭在不确定的数据管理中很重要,因为它可以用于找出数据的哪一部分有助于结果并计算结果的可能性。但是,现有工作将不确定的元组视为可以存储在关系表中的一组元组。沿袭可以导出表中的每个元组,使用它们只能找到元组,而不能找出对结果有贡献的特定属性。如果不确定元组具有多个不确定属性,那么对于结果概率较低的元组,用户将无法知道哪个属性是其主要原因。在本文中,我们提出了一种对不确定数据进行建模的方法。与基于关系模型的替代方法相比,我们的模型维护成本较低,并且避免了大量的冗余存储和联接操作。根据我们的模型,定义了一些用于查询数据,生成谱系,计算概率和结果推导的操作。此外,我们讨论了如何使用谱系正确地计算概率,并提出了一种转换谱系的算法以进行正确的概率计算。我们还将讨论如何实现沿袭的结果推导。实验证明了该模型在不确定数据管理中的优势。

著录项

  • 来源
    《Data & Knowledge Engineering》 |2017年第5期|70-84|共15页
  • 作者单位

    State Key Lab Software Engn, Wuhan, Peoples R China|Wuhan Univ, Comp Sch, Wuhan, Peoples R China|Huazhong Univ Sci & Technol, Tongji Med Coll, Ctr Comp, Tongji Hosp, Wuhan, Peoples R China;

    Wuhan Univ, Int Sch Software, Wuhan, Peoples R China;

    State Key Lab Software Engn, Wuhan, Peoples R China|Wuhan Univ, Comp Sch, Wuhan, Peoples R China;

  • 收录信息 美国《科学引文索引》(SCI);美国《工程索引》(EI);
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

    Uncertain data; Data modeling; Lineage; Probability computation;

    机译:不确定数据;数据建模;沿袭;概率计算;

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号