首页> 外文期刊>Intelligent data analysis >A central tendency-based privacy preserving model for sensitive XML association rules using Bayesian networks
【24h】

A central tendency-based privacy preserving model for sensitive XML association rules using Bayesian networks

机译:使用贝叶斯网络的敏感XML关联规则的基于集中趋势的隐私保护模型

获取原文
获取原文并翻译 | 示例
       

摘要

The rationale of XML design is to transfer and store data at different levels. A key feature of these levels in an XML document is to identify its components for additional processing. XML components can expose sensitive information after application of data mining techniques over a shared database. Therefore, privacy preservation of sensitive information must be ensured prior to signify the outcome especially in sensitive XML Association Rules. Privacy issues in XML domain are not exceptionally addressed to determine a solution by the academia in a reliable and precise manner. In this paper, we have proposed a model for identifying sensitive items (nodes) to declare sensitive XML association rules and then to hide them. Bayesian networks-based central tendency measures are applied in declaration of sensitive XML association rules. K2 algorithm is used to generate Bayesian networks to ensure reliability and accuracy in preserving privacy of XML Association Rules. The proposed model is tested and compared using several case studies and large UCI machine learning datasets. The experimental results show improved accuracy and reliability of proposed model without any side effects such as new rules and lost rules. The proposed model uses the same minimum support threshold to find XML Association Rules from the original and transformed data sources. The significance of the proposed model is to minimize an incredible disclosure risk involved in XML association rule mining from external parties in a competitive business environment.
机译:XML设计的基本原理是在不同级别上传输和存储数据。 XML文档中这些级别的关键功能是确定其组件以进行其他处理。在共享数据库上应用数据挖掘技术之后,XML组件可以公开敏感信息。因此,在表示结果之前,必须确保敏感信息的隐私保护,特别是在敏感的XML关联规则中。 XML领域的隐私问题并未得到特别解决,学术界无法以可靠而精确的方式确定解决方案。在本文中,我们提出了一种用于识别敏感项目(节点)的模型,以声明敏感的XML关联规则,然后将其隐藏。基于贝叶斯网络的集中趋势度量被用于敏感XML关联规则的声明中。 K2算法用于生成贝叶斯网络,以确保在保留XML关联规则的隐私性方面的可靠性和准确性。使用多个案例研究和大型UCI机器学习数据集对提出的模型进行了测试和比较。实验结果表明,所提模型的准确性和可靠性得到了提高,并且没有任何新规则和丢失规则等副作用。提出的模型使用相同的最小支持阈值从原始数据源和转换后的数据源中查找XML关联规则。提出的模型的意义在于,在竞争激烈的商业环境中,最大程度地降低XML关联规则从外部各方挖掘中涉及的令人难以置信的披露风险。

著录项

  • 来源
    《Intelligent data analysis》 |2014年第2期|281-303|共23页
  • 作者单位

    Department of Computer Science, School of Computer and Communication Engineering, University of Science and Technology Beijing, Beijing, China;

    Department of Computer Science, School of Computer and Communication Engineering, University of Science and Technology Beijing, Beijing 100083, China;

    Institute of Automation, Chinese Academy of Sciences, Beijing, China;

    College of Computer Sciences and Information Technology, King Faisal University, Saudi Arabia;

    Department of Computer Science, School of Computer and Communication Engineering, University of Science and Technology Beijing, Beijing, China;

  • 收录信息 美国《科学引文索引》(SCI);
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

    Privacy preservation; XML; Bayesian networks; sensitive information; sensitive XML association rules;

    机译:隐私保护;XML;贝叶斯网络;敏感信息;敏感的XML关联规则;

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号