首页> 外文期刊>Journal of Medical Imaging and Health Informatics >An Anonymous Data Publishing Framework for Streaming Genomic Data
【24h】

An Anonymous Data Publishing Framework for Streaming Genomic Data

机译:用于流媒体基因组数据的匿名数据发布框架

获取原文
获取原文并翻译 | 示例
获取外文期刊封面目录资料

摘要

The sharing of streaming genomic data, which are mainly DNA sequences, necessitates a high degree of protection against potential privacy breaches through the application of data anonymization. Although the currently available hybrid algorithm is considered to be successful for anonymously publishing incremental DNA sequences that come in streams, it is limited in its utility since it can easily overgeneralize the genomic data by generating too many clusters with three sequences. In this paper, we proposes a high-accuracy framework with delayed constraint for preserving the privacy of streaming genomic data. The framework has two possible algorithms: one with suppression and the other without. Unlike the hybrid algorithm, our algorithms generate fewer clusters with three sequences, leading to an increase in utility of the anonymization results. The experimental results demonstrate that the presented framework is more effective than existing algorithms in anonymizing streaming genomic data. Moreover, the high utility of the anonymous data is more beneficial for researchers working toward data mining.
机译:主要是DNA序列的流媒体基因组数据的共享需要通过应用数据匿名化来防止潜在隐私漏洞的高度保护。尽管当前可用的混合算法被认为是成功的,用于匿名发布来自流中的增量DNA序列,但它在其实用程序中受到限制,因为它可以通过产生具有三个序列的太多簇来容易地通过产生太多簇来全面地进行基因组数据。在本文中,我们提出了一种具有延迟约束的高精度框架,用于保留流媒体基因组数据的隐私。该框架有两个可能的算法:一个,一个抑制而另一个没有。与混合算法不同,我们的算法生成具有三个序列的较少的群集,导致匿名结果的效用增加。实验结果表明,所呈现的框架比现有的算法更有效地匿名流媒体基因组数据。此外,匿名数据的高效用对于研究数据挖掘的研究人员更有利。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号