Privacy-Preserving K-Means Clustering Upon Negative Databases

机译：保留隐私k-means群集在负数据库时

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Data mining has become very popular with the arrival of big data era, but it also raises privacy issues. Negative database (NDB) is a new type of data representation which stores the negative image of data and can protect privacy while supporting some basic data mining operations such as classification and clustering. However, the existing clustering algorithm upon NDBs is based on Hamming distance, when facing datasets which have many categories for each attribute, the encoded data will become very long and resulting in low computational efficiency. In this paper, we propose a privacy-preserving k-means clustering algorithm based on Euclidean distance upon NDBs. The main step of k-means algorithm is to calculate the distance between each record and cluster centers, in order to solve the problem of privacy disclosure in this step, we transform each record in database into an NDB and propose a method to estimate Euclidean distance from a binary string and an NDB. Our work opens up new ideas for data mining upon negative database.

机译：数据挖掘已经变得非常受到大数据时代的到来，但它也提出了隐私问题。否定数据库（NDB）是一种新的数据表示，其存储数据的负图像，并且可以保护隐私，同时支持一些基本数据挖掘操作，例如分类和聚类。然而，在NDB上的现有聚类算法基于汉明距离，当面对每个属性具有许多类别的数据集时，编码数据将变得非常长并且导致计算效率低。在本文中，我们提出了一种基于NDBS的欧几里德距离的隐私保留的K-Means聚类算法。 K-means算法的主要步骤是计算每个记录和集群中心之间的距离，以解决本步骤中的隐私披露问题，我们将数据库中的每个记录转换为NDB，并提出一种估计欧几里德距离的方法来自二进制字符串和NDB。我们的工作为负数据库的数据挖掘开辟了新的想法。

著录项

来源
《International Conference on Neural Information Processing》|2018年|699p|共14页
会议地点
作者
Xiaoyi Hu; Liping Lu; Dongdong Zhao; Jianwen Xiang; Xing Liu; Haiying Zhou; Shengwu Xiong; Jing Tian;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TP183-53;
关键词
Privacy protection; Data mining; Negative database; K-means clustering;

机译：隐私保护;数据挖掘;负数据库;K-mears聚类;

相似文献

外文文献
中文文献
专利

1. Efficient and Privacy-Preserving Multi-User Outsourced K-Means Clustering [J] . Na Li, Lianguan Huang, Yanling Li, Computer and Information Science . 2021,第2期

机译：高效和隐私保留的多用户外包K-means群集
2. Efficient and Privacy-Preserving Multi-User Outsourced K-Means Clustering [J] . Na Li, Lianguan Huang, Yanling Li, Computer and information science . 2021,第2期

机译：高效和隐私保留的多用户外包K-means群集
3. Efficient two-party privacy-preserving collaborative k-means clustering protocol supporting both storage and computation outsourcing [J] . Information Sciences: An International Journal . 2020,第期

机译：高效的双方隐私保留协作K-Means群集协议支持存储和计算外包
4. A Fine-grained Privacy-preserving k-means Clustering Algorithm Upon Negative Databases [C] . Dongdong Zhao, Xiaoyi Hu, Shengwu Xiong, IEEE Symposium Series on Computational Intelligence . 2019

机译：基于负数据库的细粒度隐私保护k均值聚类算法
5. Automated Parsing of Flexible Molecular Systems Using Principal Component Analysis and K-Means Clustering Techniques [D] . Nwerem, Matthew Jonathan Chukwunenye. 2021

机译：使用主成分分析和K-Means聚类技术自动解析灵活分子系统
6. Does Determination of Initial Cluster Centroids Improve the Performance of K-Means Clustering Algorithm? Comparison of Three Hybrid Methods by Genetic Algorithm Minimum Spanning Tree and Hierarchical Clustering in an Applied Study [O] . Saeedeh Pourahmad, Atefeh Basirat, Amir Rahimi, 2020

机译：初始簇质心的确定是否提高了K-Means聚类算法的性能？应用研究中遗传算法最小生成树和分层聚类的三种混合方法的比较
7. Practical Privacy-Preserving K-means Clustering [O] . Payman Mohassel, Mike Rosulek, Ni Trieu 2020

机译：实用隐私保留k-means集群

Privacy-Preserving K-Means Clustering Upon Negative Databases

摘要

著录项

相似文献

相关主题

期刊订阅