首页> 外文期刊>International Journal of Epidemiology: Official Journal of the International Epidemiological Association >DataSHIELD: resolving a conflict in contemporary bioscience--performing a pooled analysis of individual-level data without sharing the data.
【24h】

DataSHIELD: resolving a conflict in contemporary bioscience--performing a pooled analysis of individual-level data without sharing the data.

机译:DataSHIELD:解决当代生物科学中的冲突-在不共享数据的情况下对个人数据进行汇总分析。

获取原文
获取原文并翻译 | 示例
       

摘要

BACKGROUND: Contemporary bioscience sometimes demands vast sample sizes and there is often then no choice but to synthesize data across several studies and to undertake an appropriate pooled analysis. This same need is also faced in health-services and socio-economic research. When a pooled analysis is required, analytic efficiency and flexibility are often best served by combining the individual-level data from all sources and analysing them as a single large data set. But ethico-legal constraints, including the wording of consent forms and privacy legislation, often prohibit or discourage the sharing of individual-level data, particularly across national or other jurisdictional boundaries. This leads to a fundamental conflict in competing public goods: individual-level analysis is desirable from a scientific perspective, but is prevented by ethico-legal considerations that are entirely valid. METHODS: Data aggregation through anonymous summary-statistics from harmonized individual-level databases (DataSHIELD), provides a simple approach to analysing pooled data that circumvents this conflict. This is achieved via parallelized analysis and modern distributed computing and, in one key setting, takes advantage of the properties of the updating algorithm for generalized linear models (GLMs). RESULTS: The conceptual use of DataSHIELD is illustrated in two different settings. CONCLUSIONS: As the study of the aetiological architecture of chronic diseases advances to encompass more complex causal pathways-e.g. to include the joint effects of genes, lifestyle and environment-sample size requirements will increase further and the analysis of pooled individual-level data will become ever more important. An aim of this conceptual article is to encourage others to address the challenges and opportunities that DataSHIELD presents, and to explore potential extensions, for example to its use when different data sources hold different data on the same individuals.
机译:背景:当代生物科学有时需要巨大的样本量,因此往往别无选择,只能合成多个研究的数据并进行适当的汇总分析。卫生服务和社会经济研究也面临同样的需求。当需要汇总分析时,通常可以通过组合来自所有来源的单个级别的数据并将它们作为单个大数据集进行分析来最好地提高分析效率和灵活性。但是,包括同意书的措辞和隐私权立法在内的种族法律约束通常会禁止或阻止个人数据的共享,尤其是在国家或其他司法管辖范围内。这导致了竞争公共物品的根本冲突:从科学的角度出发,个人水平的分析是可取的,但出于完全合法的伦理法律考虑而被阻止。方法:通过来自统一的单个级别数据库(DataSHIELD)的匿名摘要统计信息进行的数据聚合,提供了一种简单的方法来分析可避免这种冲突的汇总数据。这是通过并行分析和现代分布式计算来实现的,并且在一键设置中,它利用了广义线性模型(GLM)更新算法的特性。结果:在两个不同的设置中说明了DataSHIELD的概念使用。结论:随着对慢性病病因学结构的研究不断发展,涵盖了更复杂的因果途径,例如包括基因,生活方式和环境样本数量的共同影响,需求量将进一步增加,对汇总的个人水平数据的分析将变得越来越重要。本概念文章的目的是鼓励其他人应对DataSHIELD提出的挑战和机遇,并探索潜在的扩展,例如在不同数据源在同一个人上拥有不同数据时对其进行扩展。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号