A sanitization approach for big data with improved data utility

首页> 外文期刊>Applied Intelligence: The International Journal of Artificial Intelligence, Neural Networks, and Complex Problem-Solving Technologies >A sanitization approach for big data with improved data utility

【24h】

A sanitization approach for big data with improved data utility

机译：改进数据实用程序的大数据的消毒方法

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

The process of collaborative data mining may sometimes expose the sensitive patterns present inside the data which may be undesirable to the data owner. Sensitive Pattern Hiding (SPH) is a subfield of data mining that addresses this problem. However, most of the existing approaches used for hiding sensitive patterns cause high side-effect on non-sensitive patterns which in-turn reduces the utility of the sanitized dataset. Furthermore, most of them are sequential in nature and are not able to cope with massive amounts of data and often results in high execution time. To resolve these identified challenges of utility and non-feasibility, two parallelized approaches have been proposed named PGVIR and PHCR based on spark parallel computing framework which modifies the data such that no sensitive patterns can be extracted while maintaining the utility of the sanitized dataset. Experiments performed using benchmark dataset shows that PGVIR scales better and PHCR causes fewer side-effects to the data compared to the existing techniques.

机译：协同数据挖掘的过程有时可能暴露在数据所有者中可能不期望的数据内的敏感模式。敏感图案隐藏（SPH）是解决此问题的数据挖掘的子字段。然而，用于隐藏敏感图案的大多数现有方法导致对非敏感模式的高副作用，从而减少了消毒数据集的效用。此外，大多数在性质上是连续的，并且无法应对大量数据，并且通常会导致高执行时间。为了解决这些识别的实用性和不可行性的挑战，已经提出了基于火花并行计算框架的PGVIR和PHCR提出了两个并行化方法，该PGVIR计算框架修改了数据，使得可以在维护消毒数据集的实用程序的同时没有提取敏感模式。使用基准数据集执行的实验表明，与现有技术相比，PGVIR尺度更好，PHCR导致数据较少副作用。

著录项

来源
《Applied Intelligence: The International Journal of Artificial Intelligence, Neural Networks, and Complex Problem-Solving Technologies》 |2020年第7期|共15页
作者

展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类自动化技术、计算机技术;
关键词
Big data; Data utility; Parallel processing; Privacy preservation; Sensitive patterns; Spark;

机译：大数据;数据实用程序;并行处理;隐私保存;敏感模式;火花;

相似文献

外文文献
中文文献
专利

1. A sanitization approach for big data with improved data utility [J] . Applied Intelligence: The International Journal of Artificial Intelligence, Neural Networks, and Complex Problem-Solving Technologies . 2020,第7期

机译：改进数据实用程序的大数据的消毒方法
2. Privacy-preserving data publishing based on sanitized probability matrix using transactional graph for improving the security in medical environment [J] . Saranya K., Premalatha K. Journal of supercomputing . 2020,第8期

机译：基于Sanitized概率矩阵使用事务图来提高医疗环境安全的隐私保留数据发布
3. A sanitization approach for privacy preserving data mining on social distributed environment [J] . Lekshmy P. L., Rahiman M. Abdul Journal of ambient intelligence and humanized computing . 2020,第7期

机译：隐私保留数据挖掘对社会分布式环境的消毒方法
4. Utility of Knowledge Extracted from Unsanitized Data when Applied to Sanitized Data [C] . Michal Sramka, Reihaneh Safavi-Naini, Jorg Denzinger, Annual Conference on Privacy, Security and Trust . 2008

机译：应用于消毒数据时，从不合格数据提取的知识的效用
5. An Iterative Approach to Examining the Effectiveness of Data Sanitization [D] . Singh, Anhad Preet 2015

机译：一种检验数据清理有效性的迭代方法
6. Public health utility of cause of death data: applying empirical algorithms to improve data quality [O] . Sarah Charlotte Johnson, Matthew Cunningham, Ilse N. Dippenaar, 2021

机译：死亡原因的公共卫生实用性：应用实证算法以提高数据质量
7. Data Sanitization: Improving the Forensic Utility of Anomaly Detection Systems [O] . Cretu Gabriela F., Stavrou Angelos, Stolfo Salvatore, 2007

机译：数据消毒：提高异常检测系统的法证效用
8. A STUDY TO DEVELOP IMPROVED SPACECRAFT SNOW SURVEY METHODS USING SKYLAB/EREP DATA, DEMONSTRATION OF THE UTILITY OF THE S190 AND S192 DATA [R] . James C. Barnes, Clinton J. Bowley, Michael D. Smallwood 1974

机译：利用sKYLaB / EREp数据开发改进的航天器雪测量方法的研究，对s190和s192数据的实用性进行了演示

A sanitization approach for big data with improved data utility

摘要

著录项

相似文献

相关主题

期刊订阅