首页> 外文会议>Medical Informatics Europe Conference >Probabilistic Approaches to Overcome Content Heterogeneity in Data Integration: A Study Case in Systematic Lupus Erythematosus
【24h】

Probabilistic Approaches to Overcome Content Heterogeneity in Data Integration: A Study Case in Systematic Lupus Erythematosus

机译:克服数据集成中内容异质性的概率方法:系统狼疮红斑的研究案例

获取原文
获取外文期刊封面目录资料

摘要

Integrating data from different sources into homogeneous dataset increases the opportunities to study human health. However, disparate data collections are often heterogeneous, which complicates their integration. In this paper, we focus on the issue of content heterogeneity in data integration. Traditional approaches for resolving content heterogeneity map all source datasets to a common data model that includes only shared data items, and thus omit all items that vary between datasets. Based on an example of three datasets in Systemic Lupus Erythematosus, we describe and experimentally evaluate a probabilistic data integration approach which propagates the uncertainty resulting from content heterogeneity into statistical inference, avoiding the need to map to a common data model.
机译:将来自不同来源的数据集成到同质数据集中增加了研究人类健康的机会。 但是,不同的数据收集通常是异构的,这使他们的集成复杂化。 在本文中,我们专注于数据集成中内容异质性问题。 将内容异质性的传统方法将所有源数据集映射到仅包含共享数据项的公共数据模型,从而省略数据集之间各种各种的项目。 基于Systemic Lupus和Erythematosus中的三个数据集的示例,我们描述并通过实验评估了一种概率数据集成方法,该方法传播由内容异质性导致的不确定性转化为统计推断,避免了需要映射到公共数据模型的需要。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号