首页> 外文会议>Annual IFIP WG11.3 Conference on Data and Applications Security and Privacy >Not a Free Lunch, But a Cheap One:On Classifiers Performance on Anonymized Datasets
【24h】

Not a Free Lunch, But a Cheap One:On Classifiers Performance on Anonymized Datasets

机译:不是免费的午餐,而是便宜的午餐:关于分类器在匿名数据集上的性能

获取原文

摘要

The problem of protecting datasets from the disclosure of confidential information, while published data remains useful for analysis, has recently gained momentum. To solve this problem, anonymization techniques such as k-anonymity, ℓ-diversity, and i-closeness have been used to generate anonymized datasets for training classifiers. While these techniques provide an effective means to generate anonymized datasets, an understanding of how their application affects the performance of classifiers is currently missing. This knowledge enables the data owner and analyst to select the most appropriate classification algorithm and training parameters in order to guarantee high privacy requirements while minimizing the loss of accuracy. In this study, we perform extensive experiments to verify how the classifiers performance changes when trained on an anonymized dataset compared to the original one, and evaluate the impact of classification algorithms, datasets properties, and anonymization parameters on classifiers' performance.
机译:保护数据集不被泄露机密信息的问题,虽然公布的数据对分析仍然有用,但最近得到了发展。为了解决这个问题,匿名技术,比如k-匿名,ℓ-多样性和i-贴近度已用于生成用于训练分类器的匿名数据集。虽然这些技术提供了一种生成匿名数据集的有效方法,但目前尚不清楚它们的应用如何影响分类器的性能。这些知识使数据所有者和分析人员能够选择最合适的分类算法和训练参数,以保证高隐私要求,同时最大限度地减少准确性损失。在本研究中,我们进行了大量实验,以验证与原始数据集相比,在匿名数据集上训练分类器时,分类器的性能如何变化,并评估分类算法、数据集属性和匿名参数对分类器性能的影响。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号