Not a Free Lunch, But a Cheap One:On Classifiers Performance on Anonymized Datasets

机译：不是免费的午餐，而是便宜的午餐：关于分类器在匿名数据集上的性能

获取原文

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

The problem of protecting datasets from the disclosure of confidential information, while published data remains useful for analysis, has recently gained momentum. To solve this problem, anonymization techniques such as k-anonymity, ℓ-diversity, and i-closeness have been used to generate anonymized datasets for training classifiers. While these techniques provide an effective means to generate anonymized datasets, an understanding of how their application affects the performance of classifiers is currently missing. This knowledge enables the data owner and analyst to select the most appropriate classification algorithm and training parameters in order to guarantee high privacy requirements while minimizing the loss of accuracy. In this study, we perform extensive experiments to verify how the classifiers performance changes when trained on an anonymized dataset compared to the original one, and evaluate the impact of classification algorithms, datasets properties, and anonymization parameters on classifiers' performance.

机译：保护数据集不被泄露机密信息的问题，虽然公布的数据对分析仍然有用，但最近得到了发展。为了解决这个问题，匿名技术，比如k-匿名，ℓ-多样性和i-贴近度已用于生成用于训练分类器的匿名数据集。虽然这些技术提供了一种生成匿名数据集的有效方法，但目前尚不清楚它们的应用如何影响分类器的性能。这些知识使数据所有者和分析人员能够选择最合适的分类算法和训练参数，以保证高隐私要求，同时最大限度地减少准确性损失。在本研究中，我们进行了大量实验，以验证与原始数据集相比，在匿名数据集上训练分类器时，分类器的性能如何变化，并评估分类算法、数据集属性和匿名参数对分类器性能的影响。

著录项

来源
《Annual IFIP WG11.3 Conference on Data and Applications Security and Privacy》|2021年|237-258|共22页
会议地点
作者
Mina Alishahi; Nicola Zannone;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Privacy-preserving; k-anonymity; ℓ-diversity; t-closeness; Classifiers comparison;

机译：保护隐私;k-匿名性;ℓ-差异t-贴近度;量词比较;

相似文献

外文文献
中文文献
专利

1. High Performance Frequent Subgraph Mining on Transaction Datasets: A Survey and Performance Comparison [J] . Bismita S.Jena, Cynthia Khan, Rajshekhar Sunderraman 大数据挖掘与分析(英文) . 2019,第003期
2. No free lunch but a cheaper supper: A general framework for streaming anomaly detection [J] . Calikus Ece, Nowaczyk Slawomir, SantAnna Anita, Expert systems with applications . 2020,第Octa期

机译：没有免费的午餐，但是一个更便宜的晚餐：流动异常检测的一般框架
3. Is a 'free lunch' a good lunch? The performance of zero wholesale price-based supply-chain contracts [J] . European Journal of Operational Research . 2020,第1期

机译：是一个午餐的“免费午餐”？零批发价格的供应链合同的性能
4. Performance Analysis of Machine Learning Classifiers for Intrusion Detection using UNSW-NB15 Dataset [J] . Geeta Kocher, Gulshan Kumar Computer Science & Information Technology . 2020,第20期

机译：使用UNSW-NB15数据集进行入侵检测机器学习分类器的性能分析
5. A first approach towards the usage of classifiers’ performance to create fuzzy measures for ensembles of classifiers: a case study on highly imbalanced datasets [C] . M. Uriz, D. Paternain, H. Bustince, IEEE International Conference on Fuzzy Systems . 2018

机译：利用分类器性能为分类器集合创建模糊度量的第一种方法：以高度不平衡的数据集为例
6. Free Lunch: Healthfulness and Sustainability of Free Meals Provided in the Tech Workplace [D] . Marchini, Katlyn Michelle. 2017

机译：免费午餐：技术工作场所提供的免费餐点的健康性和可持续性
7. Weak data do not make a free lunch only a cheap meal [O] . Zhipu Luo, Kanagalaghatta Rajashankar, Zbigniew Dauter -1

机译：数据不足不能免费午餐只能便宜一顿饭
8. Weak data do not make a free lunch, only a cheap meal [O] . Zhipu Luo, Kanagalaghatta Rajashankar, Zbigniew Dauter 2014

机译：弱数据不会享用免费午餐，只需一顿便宜的饭菜
9. Identification and Optimization of Classifier Genes from Multi-Class Earthworm Microarray Dataset [R] . 2010

机译：多类蚯蚓微阵列数据集分类器基因的鉴定与优化

Not a Free Lunch, But a Cheap One:On Classifiers Performance on Anonymized Datasets

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅