...
首页> 外文期刊>BMC Medical Research Methodology >Addressing data privacy in matched studies via virtual pooling
【24h】

Addressing data privacy in matched studies via virtual pooling

机译:通过虚拟池解决匹配研究中的数据隐私

获取原文
           

摘要

Background Data confidentiality and shared use of research data are two desirable but sometimes conflicting goals in research with multi-center studies and distributed data. While ideal for straightforward analysis, confidentiality restrictions forbid creation of a single dataset that includes covariate information of all participants. Current approaches such as aggregate data sharing, distributed regression, meta-analysis and score-based methods can have important limitations. Methods We propose a novel application of an existing epidemiologic tool, specimen pooling, to enable confidentiality-preserving analysis of data arising from a matched case-control, multi-center design. Instead of pooling specimens prior to assay, we apply the methodology to virtually pool (aggregate) covariates within nodes. Such virtual pooling retains most of the information used in an analysis with individual data and since individual participant data is not shared externally, within-node virtual pooling preserves data confidentiality. We show that aggregated covariate levels can be used in a conditional logistic regression model to estimate individual-level odds ratios of interest. Results The parameter estimates from the standard conditional logistic regression are compared to the estimates based on a conditional logistic regression model with aggregated data. The parameter estimates are shown to be similar to those without pooling and to have comparable standard errors and confidence interval coverage. Conclusions Virtual data pooling can be used to maintain confidentiality of data from multi-center study and can be particularly useful in research with large-scale distributed data.
机译:背景技术在使用多中心研究和分布式数据进行的研究中,数据的保密性和研究数据的共享使用是两个理想的目标,但有时是相互矛盾的。虽然非常适合直接分析,但是机密性限制禁止创建包含所有参与者的协变量信息的单个数据集。当前的方法,例如汇总数据共享,分布式回归,荟萃分析和基于评分的方法,可能会有重要的局限性。方法我们建议对现有的流行病学工具样本库进行新颖的应用,以使对病例对照,多中心设计相匹配的数据进行保密保存分析。代替在分析前合并样本,我们将方法应用于节点内的虚拟合并(聚合)协变量。这种虚拟池保留了与单个数据一起用于分析的大多数信息,并且由于单个参与者数据未在外部共享,因此节点内虚拟池可保持数据的机密性。我们表明,可以在条件逻辑回归模型中使用汇总的协变量水平来估计感兴趣的个人水平比值比。结果将标准条件logistic回归的参数估计值与基于条件logistic回归模型的数据汇总的估计值进行比较。显示的参数估计值与没有合并的参数估计值相似,并且具有可比较的标准误差和置信区间覆盖范围。结论虚拟数据池可用于维护来自多中心研究的数据的机密性,在对大规模分布式数据的研究中尤其有用。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号