首页> 外文会议>Annual International Conference of the IEEE Engineering in Medicine and Biology Society >Improving Young Stroke Prediction by Learning with Active Data Augmenter in a Large-Scale Electronic Medical Claims Database
【24h】

Improving Young Stroke Prediction by Learning with Active Data Augmenter in a Large-Scale Electronic Medical Claims Database

机译:通过使用大型电子医疗理赔数据库中的活动数据增强器学习来改善年轻中风的预测

获取原文

摘要

Electronic medical claims (EMC) database has been successfully used for predicting occurrences of stroke and a variety of other diseases. However, inadequate predictive performances have been observed in cases of rare occurrences due to both insufficient training samples and highly imbalanced class distribution. In this work, our aim is to improve stroke prediction, especially for young age group (25-45 year-old) in a large population-based EMC database (552,898 subjects). We learn a young stroke predictive deep neural network model using a novel active data augmenter. The augmenter selects the most informative EHR data samples from old age stroke patients. This approach achieves 9.3% and 8.2% area under the receiver operating characteristic curve (AUC) value improvements compared to training directly with only young age group data and training all age groups data, respectively. We further provide analyses on the AUC values obtained as a function of the training data size, and the amount and the type of augmented data samples.
机译:电子医疗索赔(EMC)数据库已成功用于预测中风和多种其他疾病的发生。然而,由于训练样本不足和班级分布高度失衡,在罕见情况下,观察到的预测性能不足。在这项工作中,我们的目的是在基于人群的大型EMC数据库(552,898名受试者)中改善卒中预测,尤其是针对年龄在25-45岁之间的年轻人。我们使用新型主动数据增强器学习了年轻的中风预测性深度神经网络模型。增强器从老年卒中患者中选择最有用的EHR数据样本。与仅使用年轻年龄组数据进行训练和分别对所有年龄组数据进行训练相比,此方法在接收器工作特性曲线(AUC)值改进下可实现9.3%和8.2%的面积。我们进一步提供了关于获得的AUC值的分析,该AUC值是训练数据大小,增强数据样本的数量和类型的函数。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号