首页> 外文期刊>Future generation computer systems >Top-κ closed co-occurrence patterns mining with differential privacy over multiple streams
【24h】

Top-κ closed co-occurrence patterns mining with differential privacy over multiple streams

机译:Top-κ封闭的共同发生模式采用多个流的差异隐私

获取原文
获取原文并翻译 | 示例
           

摘要

The frequent pattern mining over data streams is a very important problem for many applications. However, many researches investigate a single stream in which every transaction is independent and it is not considered that some transactions are generated by the same individual. Some real-world applications involve multiple streams that continuously generate objects, and interesting observations are the objects appearing in many streams, such as emerging topic discovery, e-commerce, web usage pattern mining and location-based services. In this paper, we analyze the privacy problems in mining top-κ closed co-occurrence patterns over multiple streams caused by single release of a window and continuous releases in successive windows. To prevent privacy leakage, we propose a differentially private top-κclosed co-occurrence patterns mining algorithm across multiple streams with exponential mechanism and Laplace mechanism. The algorithm consists of dissimilarity calculation phase and differentially private mining phase, where differentially private mining phase includes adjusting CP-Graph with splitting transaction, perturbing CP-Graph to obtain the top-κ closed co-occurrence patterns candidate set and adding noise to the supports of patterns. Finally, we prove our algorithm satisfies differential privacy and experiment results show the utility and efficiency of our algorithm.
机译:频繁的模式挖掘数据流是许多应用程序的一个非常重要的问题。但是,许多研究调查了每条交易是独立的单一流,并且不认为某些事务是由同一个人生成的。一些现实世界应用程序涉及多个流,该流连续生成对象,有趣的观察是出现在许多流中的对象,例如新兴主题发现,电子商务,网络使用模式挖掘和基于位置的服务。在本文中,我们分析了在连续窗口中单个释放和连续发布引起的多个流中挖掘顶级κ封闭的共发生模式的隐私问题。为了防止隐私泄漏,我们提出了一种跨多个流的差异私有的Top-κ共发生模式挖掘算法,具有指数机制和拉普拉斯机制。该算法由不同的计算阶段和差异私有挖掘阶段组成,其中差异私有挖掘阶段包括调整带有分裂事务的CP-曲线图,扰动CP-曲线图以获得顶部κ封闭的共同发生模式候选集并向支撑件添加噪声模式。最后,我们证明了我们的算法满足差异隐私和实验结果表明我们算法的效用和效率。

著录项

  • 来源
    《Future generation computer systems》 |2020年第10期|339-351|共13页
  • 作者单位

    Guangxi Key Lab of Multi-source Information Mining and Security Guangxi Normal University Guilin China School of Computer Science and Information Engineering Guangxi Normal University Guilin China;

    School of Computer Science and Information Engineering Guangxi Normal University Guilin China;

    School of Computer Science and Information Engineering Guangxi Normal University Guilin China;

    School of Computer Science and Information Engineering Guangxi Normal University Guilin China;

    Guangxi Key Lab of Multi-source Information Mining and Security Guangxi Normal University Guilin China School of Computer Science and Information Engineering Guangxi Normal University Guilin China;

    Guangxi Key Lab of Multi-source Information Mining and Security Guangxi Normal University Guilin China School of Computer Science and Information Engineering Guangxi Normal University Guilin China;

  • 收录信息 美国《科学引文索引》(SCI);美国《工程索引》(EI);
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

    Differential privacy; Multiple streams; Co-occurrence patterns; Data mining; Sliding window;

    机译:差异隐私;多个流;共同发生模式;数据挖掘;滑动窗口;

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号