首页> 外文会议>9th ACM workshop on digital rights management 2009 >Learning your identity and disease from research papers
【24h】

Learning your identity and disease from research papers

机译:从研究论文中了解您的身份和疾病

获取原文
获取原文并翻译 | 示例

摘要

Genome-wide association studies (GWAS) aim at discovering the association between genetic variations, particularly single-nucleotide polymorphism (SNP), and common diseases, which is well recognized to be one of the most important and active areas in biomedical research. Also renowned is the privacy implication of such studies, which has been brought into the limelight by the recent attack proposed by Homer et al. Homer's attack demonstrates that it is possible to identify a GWAS participant from the allele frequencies of a large number of SNPs. Such a threat, unfortunately, was found in our research to be significantly understated. In this paper, we show that individuals can actually be identified from even a relatively small set of statistics, as those routinely published in GWAS papers. We present two attacks. The first one extends Homer's attack with a much more powerful test statistic, based on the correlations among different SNPs described by coefficient of determination (r2). This attack can determine the presence of an individual from the statistics related to a couple of hundred SNPs. The second attack can lead to complete disclosure of hundreds of participants' SNPs, through analyzing the information derived from published statistics. We also found that those attacks can succeed even when the precisions of the statistics are low and part of data is missing. We evaluated our attacks on the real human genomes and concluded that such threats are completely realistic.
机译:全基因组关联研究(GWAS)旨在发现遗传变异(尤其是单核苷酸多态性(SNP))与常见疾病之间的关联,众所周知,常见疾病是生物医学研究中最重要,最活跃的领域之一。此类研究的隐私含义也广为人知,荷马等人最近提出的攻击使其受到关注。荷马的攻击表明,可以从大量SNP的等位基因频率中识别GWAS参与者。不幸的是,在我们的研究中发现这种威胁被大大低估了。在本文中,我们表明,实际上甚至可以从GWAS论文中常规发布的相对较小的一组统计信息中识别出个人。我们提出两次攻击。第一个基于确定系数(r2)描述的不同SNP之间的相关性,以更强大的测试统计量扩展了荷马的攻击。这种攻击可以根据与数百个SNP相关的统计信息确定个人的存在。通过分析从已发布的统计信息中得出的信息,第二次攻击可能导致数百名参与者的SNP完全泄露。我们还发现,即使统计数据的精度较低且部分数据丢失,这些攻击也可能成功。我们评估了对真实人类基因组的攻击,并得出这样的威胁是完全现实的。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号