首页> 外文会议>International FLINS conference >Machine learning using synthetic and real data: Similarity of evaluation metrics for different healthcare datasets and for different algorithms
【24h】

Machine learning using synthetic and real data: Similarity of evaluation metrics for different healthcare datasets and for different algorithms

机译:使用合成数据和真实数据进行机器学习:不同医疗数据集和不同算法的评估指标的相似性

获取原文

摘要

Sharing data is often a risk in terms of security and privacy especially if the data is sensitive. Algorithms can be used to generate synthetic data from an original raw dataset in order to share data that are considered more 'privacy preserving', and that increase the level of anonymity. In this paper, we carry out an experiment to study the validity of conducting machine learning on synthetic data. We compare the evaluation metrics produced from machine learning models that were trained using synthetic data with metrics yielded from machine learning models that were trained using the corresponding real data.
机译:就安全性和隐私性而言,共享数据通常存在风险,尤其是在数据敏感的情况下。可以使用算法从原始原始数据集中生成合成数据,以便共享被认为更“隐私保护”并提高匿名性的数据。在本文中,我们进行了一项实验,以研究对综合数据进行机器学习的有效性。我们将使用合成数据训练的机器学习模型产生的评估指标与使用相应的实际数据训练的机器学习模型产生的指标进行比较。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号