首页> 外文期刊>Information Systems >Anonymization of moving objects databases by clustering and perturbation
【24h】

Anonymization of moving objects databases by clustering and perturbation

机译:通过聚类和微扰对移动对象数据库进行匿名化

获取原文
获取原文并翻译 | 示例
       

摘要

Preserving individual privacy when publishing data is a problem that is receiving increasing attention. Thanks to its simplicity the concept of k-anonymity, introduced by Samarati and Sweeney [1], established itself as one fundamental principle for privacy preserving data publishing. According to the fc-anonymity principle, each release of data must be such that each individual is indistinguishable from at least k-1 other individuals.rnIn this article we tackle the problem of anonymization of moving objects databases. We propose a novel concept of k-anonymity based on co-localization, that exploits the inherent uncertainty of the moving object's whereabouts. Due to sampling and imprecision of the positioning systems (e.g., GPS), the trajectory of a moving object is no longer a polyline in a three-dimensional space, instead it is a cylindrical volume, where its radius δ represents the possible location imprecision: we know that the trajectory of the moving object is within this cylinder, but we do not know exactly where. If another object moves within the same cylinder they are indistinguishable from each other. This leads to the definition of (k,δ)-anonymity for moving objects databases. We first characterize the (k,δ)-anonymity problem, then we recall NWA (Never Walk Alone), a method that we introduced in [2] based on clustering and spatial perturbation. Starting from a discussion on the limits of NWA we develop a novel clustering method that, being based on EDR distance [3], has the important feature of being time-tolerant. As a consequence it perturbs trajectories both in space and time. The novel method, named W4M (Wait for Me), is empirically shown to produce higher quality anonymization than NWA, at the price of higher computational requirements. Therefore, in order to make W4M scalable to large datasets, we introduce two variants based on a novel (and computationally cheaper) time-tolerant distance function, and on chunking.rnAll the variants of W4M~1 are empirically evaluated in terms of data quality and efficiency, and thoroughly compared to their predecessor NWA.~2 Data quality is assessed both by means of objective measures of information distortion, and by more usability oriented measure, i.e., by comparing the results of (i) spatio-temporal range queries and (ii) frequent pattern mining, executed on the original database and on the (k,δ)-anonymized one.rnExperimental results over both real-world and synthetic mobility data confirm that, for a wide range of values of δ and k, the relative distortion introduced by our anonymization methods is kept low. Moreover, the techniques introduced to make VV4M scalable to large datasets, achieve their goal without giving up data quality in the anonymization process.
机译:发布数据时保护个人隐私是一个日益受到关注的问题。由于其简单性,由Samarati和Sweeney [1]提出的k-匿名性概念将其自身确立为隐私保护数据发布的一项基本原则。根据fc-匿名性原则,每次发布数据时都必须确保每个人都与至少k-1个其他人没有区别。在本文中,我们解决了移动对象数据库的匿名化问题。我们提出了一种基于共定位的k匿名性的新概念,该概念利用了移动物体下落的固有不确定性。由于定位系统(例如GPS)的采样和不精确性,移动物体的轨迹不再是三维空间中的折线,而是一个圆柱体,其半径δ表示可能的位置不精确性:我们知道运动物体的轨迹在此圆柱体内,但是我们不知道确切的位置。如果另一个物体在同一圆柱体内移动,则它们彼此是无法区分的。这导致了移动对象数据库的(k,δ)-匿名性的定义。我们首先描述(k,δ)-匿名问题,然后我们回顾一下NWA(从不走单行),这是我们在文献[2]中基于聚类和空间扰动引入的一种方法。从对NWA限制的讨论开始,我们开发了一种新的聚类方法,该方法基于EDR距离[3],具有耐时间的重要特征。结果,它扰乱了空间和时间上的轨迹。实验证明,这种名为W4M(对我来说是Wait)的新方法可以产生比NWA更高质量的匿名化,但需要更高的计算需求。因此,为了使W4M可扩展到大型数据集,我们基于新颖的(且计算上更便宜的)时间容限距离函数和分块引入了两个变体.rn对W4M〜1的所有变体均根据数据质量进行了经验评估〜2数据质量既可以通过信息失真的客观测量方法,也可以通过以可用性为导向的测量方法进行评估,即通过比较(i)时空范围查询和(ii)在原始数据库和(k,δ)匿名数据库上执行的频繁模式挖掘。现实和合成迁移率数据的实验结果证实,对于很大的δ和k值,由我们的匿名方法引入的相对失真保持较低。此外,引入的使VV4M可扩展到大型数据集的技术可以实现其目标,而不会在匿名化过程中放弃数据质量。

著录项

  • 来源
    《Information Systems》 |2010年第8期|p.884-910|共27页
  • 作者单位

    Department of Computer Engineering, TOBB University of Economics and Technology, Sogutozu, Ankara, Turkey;

    rnYahool Research, Avinguda Diagonal 177, 08018 Barcelona, Spain;

    rnPisa KDD Laboratory, ISTI - CNR, Area della Ricerca di Pisa, Via Giuseppe Moruzzi 1, 56124 Pisa, Italy;

  • 收录信息 美国《科学引文索引》(SCI);美国《工程索引》(EI);
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

    moving objects databases; trajectories; anonymity; uncertainty; clustering;

    机译:移动物体数据库;轨迹匿名;不确定;聚类;
  • 入库时间 2022-08-18 02:48:01

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号