Subspace-based aggregation for enhancing utility, information measures, and cluster identification in privacy preserved data mining on high-dimensional continuous data

Shashidhar Virupaksha; D. Venkatesulu

首页> 外文期刊>International Journal of Computers & Applications >Subspace-based aggregation for enhancing utility, information measures, and cluster identification in privacy preserved data mining on high-dimensional continuous data

【24h】

Subspace-based aggregation for enhancing utility, information measures, and cluster identification in privacy preserved data mining on high-dimensional continuous data

机译：Subspace-based aggregation for enhancing utility, information measures, and cluster identification in privacy preserved data mining on high-dimensional continuous data

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

页面导航

摘要
著录项
相关主题

摘要

Clustering is a data mining technique that has been effectively used in the last few decades for knowledge extraction. Privacy is a major problem while releasing data for clustering and therefore privacy-preserving data mining (PPDM) algorithms have been developed. Aggregation is a popular PPDM technique that has been used. However, in the last few years, certain applications require that data mining be performed on high-dimensional data. The present privacy preservation techniques perform aggregation in a univariate manner along each dimension. This affects the utility measures, information measures, and especially retention of original clusters. This paper proposes a new technique called as subspace-based aggregation (SBA). SBA categorizes the dimensions into dense and non-dense subspaces based on the density of points. Aggregation is performed separately for dense and non-dense subspaces. This approach helps to maximize utility measures, information measures, and retention of clusters. SBA is run on high-dimensional continuous datasets from UCI Machine Learning repository. SBA is compared with related work methods such as SINGLE, SIMPLE, MDAV, and PPPCA. SBA provides an improvement of 66 in utility, 400 in cluster identification, 5 in co-variance, and standard deviation.

著录项

来源
《International Journal of Computers & Applications》 |2022年第12期|1130-1139|共10页
作者
Shashidhar Virupaksha; D. Venkatesulu;
展开▼
作者单位

Department of CSE, VFSTR (Deemed to be University), Guntur, India,Department of CSE, Presidency University, Bengaluru, India;

Department of CSE, VFSTR (Deemed to be University), Guntur, India;

展开▼
收录信息美国《工程索引》(EI);
原文格式 PDF
正文语种英语
中图分类计算机的应用;
关键词
Privacy preservation; privacy preserved data mining; Data privacy;

Subspace-based aggregation for enhancing utility, information measures, and cluster identification in privacy preserved data mining on high-dimensional continuous data

摘要

著录项

相关主题

期刊订阅