PUBLISHING SENSITIVE TIME-SERIES DATA UNDER PRESERVATION OF PRIVACY AND DISTANCE ORDERS

Mi-Jung Choi; Hea-Suk Kim; Yang-Sae Moon

首页> 外文期刊>International Journal of Innovative Computing Information and Control >PUBLISHING SENSITIVE TIME-SERIES DATA UNDER PRESERVATION OF PRIVACY AND DISTANCE ORDERS

【24h】

PUBLISHING SENSITIVE TIME-SERIES DATA UNDER PRESERVATION OF PRIVACY AND DISTANCE ORDERS

机译：在保留私密性和远程性顺序的情况下发布敏感的时间序列数据

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

In this paper, we address the problem of preserving mining accuracy as well as privacy in publishing sensitive time-series data. For example, people with heart disease do not want to disclose their ECG time-series, but they still allow mining some accurate patterns from their time-series. Our privacy model assumes that (1) data sources publish their time-series independently, and (2) all information used in publishing time-series can be publicly revealed. Based on this model, we introduce three assumptions: full disclosure, equi-uncertainty, and independency. We also derive two requirements: uncertainty preservation and distance order preservation. We show that only randomization methods satisfy all three assumptions, but even those methods do not satisfy both the requirements. Thus, we discuss the randomization-based solutions that satisfy all assumptions and requirements. For this purpose, we present a novel notion of the noise averaging effect of piecewise aggregate approximation (PAA), which is derived from a simple intuition that the summation of random noise converges to 0. This noise averaging effect can alleviate the problem of destroying distance orders in randomly perturbed time-series. Based on the noise averaging effect, we first propose two naive solutions that use the random data perturbation in publishing time-series while exploiting the PAA distance in computing distances. There is, however, a tradeoff between these two solutions with respect to uncertainty and distance orders. We thus propose three more advanced solutions that take advantages of both naive solutions. Experimental results show that our advanced solutions are superior to the naive solutions in the preservation of uncertainty, distance orders, and clustering accuracy.

机译：在本文中，我们解决了在发布敏感的时间序列数据时保留挖掘准确性以及隐私的问题。例如，患有心脏病的人不想透露其心电图时间序列，但是他们仍然允许从其时间序列中挖掘出一些准确的模式。我们的隐私模型假设（1）数据源独立发布其时间序列，并且（2）可以公开披露用于发布时间序列的所有信息。基于此模型，我们引入三个假设：完全披露，等同不确定性和独立性。我们还得出两个要求：不确定性保留和距离顺序保留。我们证明只有随机化方法才能满足所有三个假设，但即使是那些方法也不能满足这两个要求。因此，我们讨论了满足所有假设和要求的基于随机化的解决方案。为此，我们提出了一种新的概念，即分段聚合近似（PAA）的平均噪声效果，它是根据一个简单的直觉得出的，即随机噪声的总和收敛为0。这种平均噪声的效果可以缓解距离破坏的问题在随机扰动的时间序列中排序。基于噪声平均效果，我们首先提出两种幼稚的解决方案，它们在发布时间序列时使用随机数据扰动，同时在计算距离时利用PAA距离。但是，这两种解决方案在不确定性和距离顺序方面需要权衡。因此，我们提出了三个更高级的解决方案，它们都利用了这两种幼稚的解决方案。实验结果表明，在保留不确定性，距离顺序和聚类精度方面，我们的高级解决方案优于天真的解决方案。

著录项

来源
《International Journal of Innovative Computing Information and Control》 |2012年第5b期|p.3619-3638|共20页
作者
Mi-Jung Choi; Hea-Suk Kim; Yang-Sae Moon;
展开▼
作者单位

Department of Computer Science Kangwon National University 192-1 Hyoja2-Dong, Chunchon, Kangwon 200-701, Republic of Korea;

Department of Computer Science Kangwon National University 192-1 Hyoja2-Dong, Chunchon, Kangwon 200-701, Republic of Korea;

Department of Computer Science Kangwon National University 192-1 Hyoja2-Dong, Chunchon, Kangwon 200-701, Republic of Korea;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
time-series data; privacy preservation; data mining; clustering; distance orders;

机译：时间序列数据;隐私保护;数据挖掘;集群距离订单;

相似文献

外文文献
中文文献
专利

1. Sensitive Label Privacy Preservation with Anatomization for Data Publishing [J] . Yao Lin, Chen Zhenyu, Wang Xin, IEEE transactions on dependable and secure computing . 2021,第2期

机译：敏感标签隐私保存与数据发布的解剖化
2. Sensitive attribute privacy preservation of trajectory data publishing based on l-diversity [J] . Yao Lin, Chen Zhenyu, Hu Haibo, Distributed and Parallel Databases . 2021,第3期

机译：基于L-多样性的轨迹数据发布的敏感属性隐私保留
3. CTS-DP: Publishing correlated time-series data via differential privacy [J] . Wang Hao, Xu Zhengquan Knowledge-Based Systems . 2017,第APRa15期

机译：CTS-DP：通过差分隐私发布相关的时间序列数据
4. Publishing Time-Series Data under Preservation of Privacy and Distance Orders [C] . Yang-Sae Moon, Hea-Suk Kim, Sang-Pil Kim, DEXA 2010;International conference on database and expert systems applications . 2010

机译：在保留隐私和距离命令的情况下发布时间序列数据
5. Privacy preservation in data publishing and sharing [D] . Li, Tiancheng 2010

机译：数据发布和共享中的隐私保护
6. Privacy preserving data publishing of categorical data through k-anonymity and feature selection [O] . Aristos Aristodimou, Athos Antoniades, Constantinos S. Pattichis 2016

机译：通过k-匿名性和特征选择来保护分类数据的隐私保护数据发布
7. Preservation of proximity privacy in publishing numerical sensitive data [O] . Jiexing Li, Yufei Tao, Xiaokui Xiao 2008

机译：在发布数字敏感数据时保护邻近隐私

PUBLISHING SENSITIVE TIME-SERIES DATA UNDER PRESERVATION OF PRIVACY AND DISTANCE ORDERS

摘要

著录项

相似文献

相关主题

期刊订阅