首页> 外文会议>International Conference on Artificial Intelligence in Medicine >SMOOTH-GAN: Towards Sharp and Smooth Synthetic EHR Data Generation
【24h】

SMOOTH-GAN: Towards Sharp and Smooth Synthetic EHR Data Generation

机译:光滑GaN:朝向夏普和平滑的合成EHR数据生成

获取原文

摘要

Generative adversarial networks (GANs) have been highly successful for generating realistic synthetic data. In healthcare, synthetic data generation can be helpful for producing annotated data and improving data-driven research without worries on data privacy. However, electronic health records (EHRs) are noisy, incomplete and complex, and existing work on EHR data is mainly devoted to generating discrete elements such as diagnosis codes and medications or frequent laboratory values. In this work, we propose SMOOTH-GAN, a novel approach for generating reliable EHR data such as laboratory values and medications given diagnosis codes. SMOOTH-GAN takes advantage of a conditional GAN architecture with WGAN-GP loss, and is able to learn transitions between disease stages with high flexibility over data customization. Our experiments demonstrate the model's effectiveness in terms of both statistical similarity and accuracy on machine learning based prediction. To further demonstrate the usage of our model, we apply counterfactual reasoning and generate data with occurrence of multiple diseases, which can provide unique datasets for artificial intelligence driven healthcare research.
机译:生成的对抗性网络(GANS)非常成功地产生现实的合成数据。在医疗保健中,合成数据生成可能有助于生产注释数据和改善数据驱动的研究,而不会担心数据隐私。然而,电子健康记录(EHRS)是嘈杂的,不完整和复杂的,并且在EHR数据上的现有工作主要致力于产生离散元素,例如诊断码和药物或频繁的实验室值。在这项工作中,我们提出了一种流畅的GaN,一种新的方法,用于产生可靠的EHR数据,例如实验室值和药物给予诊断码。光滑GaN利用WAN-GP损失的条件GAN架构,并且能够在数据定制上具有高灵活性的疾病阶段之间的过渡。我们的实验在基于机器学习预测的统计相似性和准确性方面,展示了模型的有效性。为了进一步展示我们模型的使用,我们应用了反事实推理并产生了多种疾病的发生,这可以为人工智能驱动的医疗保健研究提供独特的数据集。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号