...
首页> 外文期刊>European Journal of Operational Research >Minimum-distance controlled perturbation methods for large-scale tabular data protection
【24h】

Minimum-distance controlled perturbation methods for large-scale tabular data protection

机译:用于大规模表格数据保护的最小距离控制摄动方法

获取原文
获取原文并翻译 | 示例
           

摘要

National Statistical Agencies routinely release large amounts of tabular information. Prior to dissemination, tabular data needs to be processed to avoid the disclosure of individual confidential information. One widely used class of methods is based on the modification of the table cells values. However, previous approaches were not able to preserve the values of the marginal cells and the additivity relations for a general table of any dimension, size and structure. This void was recently filled by the controlled tabular adjustment and one of its variants, the quadratic minimum-distance controlled perturbation method. Although independently developed, both approaches rely on the same strategy: given a set of tables to be protected, they find the minimum-distance values to the original cells that make the released information safe. Controlled tabular adjustment uses the L-1 distance; the quadratic minimum-distance variant considers L-2. This work presents both approaches within an unified framework, and includes a new variant based on L-infinity. Among other benefits, the unified framework permits the simple comparison of the three distances, and a single general result about their disclosure risk. The three distances are evaluated with the unique standard library for tabular data protection currently available. Some of the complex instances were contributed by National Statistical Agencies, and, therefore, are good representatives of theirs real needs. Unlike alternative methods, the three distances were able to solve all the instances, requiring only few seconds for each of them on a personal computer using a general purpose solver. The results show that this class of methods are an effective and promising toot for the protection of large volumes of tabular data. All the linear and quadratic problems solved in the paper are delivered to the optimization community in MPS format. (c) 2004 Elsevier B.V. All rights reserved.
机译:国家统计局通常会发布大量表格信息。在分发之前,需要处理表格数据,以避免泄露个人机密信息。一类广泛使用的方法是基于表单元格值的修改。但是,以前的方法无法保留任何尺寸,大小和结构的通用表的边缘单元格的值和加性关系。最近,这种空隙被受控的表格调整及其变体之一,即二次最小距离受控扰动方法所填补。尽管是独立开发的,但这两种方法都依赖于相同的策略:给定一组要保护的表,它们会找到与原始单元格的最小距离值,从而使发布的信息安全。受控表格调整使用L-1距离;二次最小距离变式考虑L-2。这项工作在统一的框架内介绍了这两种方法,并包括一个基于L-infinity的新变体。除其他好处外,统一框架还允许对三个距离进行简单比较,并获得有关其披露风险的单一总体结果。使用当前可用的表格式数据保护的唯一标准库评估这三个距离。一些复杂的实例是由国家统计机构提供的,因此可以很好地代表其实际需求。与其他方法不同,这三个距离能够求解所有实例,在使用通用求解器的个人计算机上,每个实例仅需要几秒钟的时间。结果表明,此类方法是保护大量表格数据的有效且有前途的嘟嘟声。本文解决的所有线性和二次问题都以MPS格式提供给优化社区。 (c)2004 Elsevier B.V.保留所有权利。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号