首页> 美国卫生研究院文献>American Journal of Human Genetics >Expanding Access to Large-Scale Genomic Data While Promoting Privacy: A Game Theoretic Approach
【2h】

Expanding Access to Large-Scale Genomic Data While Promoting Privacy: A Game Theoretic Approach

机译:在促进隐私的同时扩展对大规模基因组数据的访问:一种博弈论方法

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

Emerging scientific endeavors are creating big data repositories of data from millions of individuals. Sharing data in a privacy-respecting manner could lead to important discoveries, but high-profile demonstrations show that links between de-identified genomic data and named persons can sometimes be reestablished. Such re-identification attacks have focused on worst-case scenarios and spurred the adoption of data-sharing practices that unnecessarily impede research. To mitigate concerns, organizations have traditionally relied upon legal deterrents, like data use agreements, and are considering suppressing or adding noise to genomic variants. In this report, we use a game theoretic lens to develop more effective, quantifiable protections for genomic data sharing. This is a fundamentally different approach because it accounts for adversarial behavior and capabilities and tailors protections to anticipated recipients with reasonable resources, not adversaries with unlimited means. We demonstrate this approach via a new public resource with genomic summary data from over 8,000 individuals—the Sequence and Phenotype Integration Exchange (SPHINX)—and show that risks can be balanced against utility more effectively than with traditional approaches. We further show the generalizability of this framework by applying it to other genomic data collection and sharing endeavors. Recognizing that such models are dependent on a variety of parameters, we perform extensive sensitivity analyses to show that our findings are robust to their fluctuations.
机译:新兴的科学工作正在创建数百万个人的大数据存储库。以尊重隐私的方式共享数据可能会导致重大发现,但备受瞩目的示范表明,有时可以重新建立身份不明的基因组数据与指定人员之间的联系。这种重新识别攻击主要针对最坏的情况,并刺激了采用不必要地阻碍研究的数据共享实践。为了减轻担忧,组织传统上依靠法律威慑力,例如数据使用协议,并且正在考虑抑制或增加基因组变异的噪音。在本报告中,我们使用博弈论的角度为基因组数据共享开发更有效,可量化的保护措施。这是一种根本不同的方法,因为它考虑了对抗行为和能力,并针对具有合理资源的预期接收者(而不是具有无限手段的对手)量身定制了保护措施。我们通过一个新的公共资源展示了这种方法,该资源具有来自8,000多个个体的基因组摘要数据-序列和表型整合交易所(SPHINX)-并显示与传统方法相比,可以更加有效地平衡风险和效用。通过将其应用于其他基因组数据收集和共享工作,我们进一步展示了该框架的可推广性。认识到此类模型取决于各种参数,我们进行了广泛的敏感性分析,以表明我们的发现对它们的波动具有鲁棒性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号