首页> 外文期刊>International Journal of Epidemiology: Official Journal of the International Epidemiological Association >DataSHIELD: resolving a conflict in contemporary bioscience--performing a pooled analysis of individual-level data without sharing the data.
【24h】

DataSHIELD: resolving a conflict in contemporary bioscience--performing a pooled analysis of individual-level data without sharing the data.

机译:DataShield:解决当代生物科学中的冲突 - 在不共享数据的情况下执行单个数据的汇总分析。

获取原文
获取原文并翻译 | 示例
       

摘要

BACKGROUND: Contemporary bioscience sometimes demands vast sample sizes and there is often then no choice but to synthesize data across several studies and to undertake an appropriate pooled analysis. This same need is also faced in health-services and socio-economic research. When a pooled analysis is required, analytic efficiency and flexibility are often best served by combining the individual-level data from all sources and analysing them as a single large data set. But ethico-legal constraints, including the wording of consent forms and privacy legislation, often prohibit or discourage the sharing of individual-level data, particularly across national or other jurisdictional boundaries. This leads to a fundamental conflict in competing public goods: individual-level analysis is desirable from a scientific perspective, but is prevented by ethico-legal considerations that are entirely valid. METHODS: Data aggregation through anonymous summary-statistics from harmonized individual-level databases (DataSHIELD), provides a simple approach to analysing pooled data that circumvents this conflict. This is achieved via parallelized analysis and modern distributed computing and, in one key setting, takes advantage of the properties of the updating algorithm for generalized linear models (GLMs). RESULTS: The conceptual use of DataSHIELD is illustrated in two different settings. CONCLUSIONS: As the study of the aetiological architecture of chronic diseases advances to encompass more complex causal pathways-e.g. to include the joint effects of genes, lifestyle and environment-sample size requirements will increase further and the analysis of pooled individual-level data will become ever more important. An aim of this conceptual article is to encourage others to address the challenges and opportunities that DataSHIELD presents, and to explore potential extensions, for example to its use when different data sources hold different data on the same individuals.
机译:背景:当代生物科学有时需要巨大的样本尺寸,并且通常别无选择,只能在几项研究中综合数据并进行适当的汇总分析。同样的需求也面临着健康服务和社会经济研究。当需要汇总分析时,通过将各个来源的各个数据组合并将其分析为单个大数据集,通常最好地提供分析效率和灵活性。但是,伦理法律限制,包括同意形式和隐私立法的措辞,通常禁止或阻止个人级别数据的分享,特别是在国家或其他司法管辖区。这导致竞争公共产品的基本冲突:从科学的角度来看,个人级别分析是可取的,但被完全有效的伦理法律考虑因素被预防。方法:通过统一单个数据库(DataShield)的匿名摘要统计数据聚合,提供了一种简单的方法来分析循环这种冲突的池数据。这是通过并行化分析和现代分布式计算实现的,并且在一个关键设置中,利用了广义线性模型(GLM)更新算法的性质。结果:DataShield的概念使用在两个不同的设置中说明。结论:作为慢性疾病的安全性建筑的研究进步涵盖更复杂的因果途径-e.g。为了包括基因的联合效果,生活方式和环境 - 样本量要求将进一步增加,并且汇集的个性级数据的分析将变得更加重要。这一概念文章的目的是鼓励他人解决数据列表所呈现的挑战和机会,并探索潜在的扩展,例如,当不同的数据源在同一个人上保存不同的数据时使用。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号