...
首页> 外文期刊>Information Systems Research >Maximizing Accuracy of Shared Databases when Concealing Sensitive Patterns
【24h】

Maximizing Accuracy of Shared Databases when Concealing Sensitive Patterns

机译:隐藏敏感模式时,最大化共享数据库的准确性

获取原文
获取原文并翻译 | 示例
           

摘要

The sharing of databases either within or across organizations raises the possibility of unintentionally revealing sensitive relationships contained in them. Recent advances in data-mining technology have increased the chances of such disclosure. Consequently, firms that share their databases might choose to hide these sensitive relationships prior to sharing. Ideally, the approach used to hide relationships should be impervious to as many data-mining techniques as possible, while minimizing the resulting distortion to the database. This paper focuses on frequent item sets, the identification of which forms a critical initial step in a variety of data-mining tasks. It presents an optimal approach for hiding sensitive item sets, while keeping the number of modified transactions to a minimum. The approach is particularly attractive as it easily handles databases with millions of transactions. Results from extensive tests conducted on publicly available real data and data generated using IBM's synthetic data generator indicate that the approach presented is very effective, optimally solving problems involving millions of transactions in a few seconds.
机译:在组织内部或组织之间共享数据库增加了无意间揭示其中包含的敏感关系的可能性。数据挖掘技术的最新进展增加了此类披露的机会。因此,共享数据库的公司可能选择在共享之前隐藏这些敏感关系。理想情况下,用于隐藏关系的方法应该不受尽可能多的数据挖掘技术的影响,同时最大程度地减少对数据库的失真。本文着重介绍频繁的项目集,对这些项目集的识别构成了各种数据挖掘任务中至关重要的初始步骤。它提供了一种隐藏敏感项目集的最佳方法,同时将修改后的事务数量保持在最低水平。该方法特别有吸引力,因为它可以轻松处理具有数百万个事务的数据库。对可公开获得的真实数据和使用IBM的综合数据生成器生成的数据进行的广泛测试的结果表明,所提供的方法非常有效,可以在几秒钟内以最佳方式解决涉及数百万笔交易的问题。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号