首页> 外文会议>IEEE International Conference on Control Science and Systems Engineering >A Novel EEMD-Based Privacy Preserving Approach for Top-k SNPs Query in Genome-Wide Association Studies
【24h】

A Novel EEMD-Based Privacy Preserving Approach for Top-k SNPs Query in Genome-Wide Association Studies

机译:全基因组关联研究中基于前EEMD的Top-k SNP查询的隐私保护新方法

获取原文

摘要

Genome-wide association studies (GWAS) have been a popular method for querying Top-k most significant sets of singe-nucleotide polymorphism locations (SNPs) to discover the genetic factors of diseases. Doctors have been increasing interest to in querying SNPs that significantly associated to diseases. However, the queries and GWAS data share service bring great privacy breach risk to the patients. In traditional Laplace privacy preserving mechanism, privacy budget $arepsilon$is different to be determined, the Laplace noise is added to original data only once, which causes randomness and potential security risk to privacy protection. A novel approach is proposed to implement the differential privacy preserving for Top-k SNPs query in GWAS. Ensemble Empirical Mode Decomposition(EEMD) is a noise-assisted data analysis method, it is firstly proposed to be used as noise-adding approach in privacy protection mechanism. In the EEMD approach, random white noise will be added into the SNPs querying results for many times. Moreover, the parameters of added white noise can be precisely controlled to achieve better effect. PriSTRAT software tool was redeveloped to evaluate EEMD-based privacy preserving approach, the dataset was generated by PLINK tools. Experimental results show that EEMD-based privacy preserving approach has achieved higher accuracy than Laplace mechanism under the same noise-adding strength. The novel approach could be used to realize differential privacy preserving for Top-k SNPs query in GWAS to achieve the balance between privacy preserving and query accuracy.
机译:全基因组关联研究(GWAS)是查询Top-k最重要的单核苷酸多态性位点(SNP)集以发现疾病的遗传因素的流行方法。医生一直对查询与疾病显着相关的SNP越来越感兴趣。但是,查询和GWAS数据共享服务给患者带来了极大的隐私泄露风险。在传统的拉普拉斯隐私保护机制中,隐私预算$ \ varepsilon $是要确定的,拉普拉斯噪声仅被添加到原始数据一次,这对隐私保护造成了随机性和潜在的安全风险。提出了一种新的方法来实现GWAS中Top-k SNP查询的差分隐私保护。集成经验模态分解(EEMD)是一种噪声辅助的数据分析方法,首先被提出作为隐私保护机制中的噪声添加方法。在EEMD方法中,随机白噪声将多次添加到SNP查询结果中。此外,可以精确控制添加的白噪声的参数以达到更好的效果。 PriSTRAT软件工具经过重新开发,以评估基于EEMD的隐私保护方法,数据集由PLINK工具生成。实验结果表明,在相同的噪声添加强度下,基于EEMD的隐私保护方法比Laplace机制具有更高的精度。该新方法可用于实现GWAS中Top-k SNP查询的差分隐私保护,以达到隐私保护与查询准确性之间的平衡。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号