首页> 外文会议>International conference on cryptology and network security >A Framework with Randomized Encoding for a Fast Privacy Preserving Calculation of Non-linear Kernels for Machine Learning Applications in Precision Medicine
【24h】

A Framework with Randomized Encoding for a Fast Privacy Preserving Calculation of Non-linear Kernels for Machine Learning Applications in Precision Medicine

机译:用于精确医学中机器学习应用的非线性核的快速隐私保护计算的具有随机编码的框架

获取原文

摘要

For many diseases it is necessary to gather large cohorts of patients with the disease in order to have enough power to discover the important factors. In this setting, it is very important to preserve the privacy of each patient and ideally remove the necessity to gather all data in one place. Examples include genomic research of cancer, infectious diseases or Alzheimer's. This problem leads us to develop privacy preserving machine learning algorithms. So far in the literature there are studies addressing the calculation of a specific function privately with lack of generality or utilizing computationally expensive encryption to preserve the privacy, which slows down the computation significantly. In this study, we propose a framework utilizing randomized encoding in which four basic arithmetic operations (addition, subtraction, multiplication and division) can be performed, in order to allow the calculation of machine learning algorithms involving one type of these operations privately. Among the suitable machine learning algorithms, we apply the oligo kernel and the radial basis function kernel to the coreceptor usage prediction problem of HIV by employing the framework to calculate the kernel functions. The results show that we do not sacrifice the performance of the algorithms for privacy in terms of F1-score and AUROC. Furthermore, the execution time of the framework in the experiments of the oligo kernel is comparable with the non-private version of the computation. Our framework in the experiments of radial basis function kernel is also way faster than the existing approaches utilizing integer vector homomorphic encryption and consequently homomorphic encryption based solutions, which indicates that our approach has a potential for application to many other diseases and data types.
机译:对于许多疾病,有必要聚集大量患有该疾病的患者,以便有足够的能力发现重要因素。在这种情况下,保护每个患者的隐私并从根本上消除在一个地方收集所有数据的必要性非常重要。例子包括癌症,传染病或阿尔茨海默氏症的基因组研究。这个问题导致我们开发隐私保护机器学习算法。迄今为止,在文献中已有一些研究在缺乏通用性的情况下私​​下处理特定功能的计算,或者利用计算量大的加密来保护隐私,这极大地减慢了计算速度。在这项研究中,我们提出了一种利用随机编码的框架,其中可以执行四个基本算术运算(加,减,乘和除),以便允许私有地计算涉及其中一种运算的机器学习算法。在合适的机器学习算法中,我们通过使用框架计算核函数,将寡核和径向基函数核应用于HIV的共受体使用预测问题。结果表明,在F1分数和AUROC方面,我们不牺牲算法的性能。此外,在寡核实验中该框架的执行时间与计算的非私有版本相当。我们在径向基函数内核实验中的框架也比使用整数矢量同态加密并因此基于同态加密的解决方案的现有方法要快得多,这表明我们的方法具有适用于许多其他疾病和数据类型的潜力。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号