首页> 外文期刊>Big Data, IEEE Transactions on >Comparison of Different Machine Learning Approaches to Predict Small for Gestational Age Infants
【24h】

Comparison of Different Machine Learning Approaches to Predict Small for Gestational Age Infants

机译:不同机器学习方法的比较预测胎龄婴幼儿

获取原文
获取原文并翻译 | 示例
       

摘要

Diagnosing infants who are small for gestational age (SGA) at early stages could help physicians to introduce interventions for SGA infants earlier. Machine learning (ML) is envisioned as a tool to identify SGA infants. However, ML has not been widely studied in this field. To develop effective SGA prediction models, we conducted four groups of experiments that considered basic ML methods, imbalanced data, feature selection and the time characteristics of variables, respectively. Infants with SGA data collected from 2010 to 2013 with gestational weeks between 24 and 42 were detected. Support vector machine (SVM), random forest (RF), logistic regression (LR) and Sparse LR models were trained on 10-fold cross validation. Precision and the area under the curve (AUC) of the receiver operator characteristic curve were evaluated. For each group, the performance of SVM and Sparse LR was similarly well. LR without any sparsity penalties performed worst, possibly caused by the overfitting problem. With the combination of handling imbalanced data and feature selection, the RF ensemble classifier performed best, which even obtained the highest AUC value (0.8547) with the help of expert knowledge. In other cases, RF performed worse than Sparse LR and SVM, possibly because of fully grown trees.
机译:诊断早期胎龄(SGA)小的婴儿可以帮助医生提前对SGA婴儿的干预措施。设备学习(ML)被设想为识别SGA婴儿的工具。但是,ML尚未在该领域中被广泛研究。为了开发有效的SGA预测模型,我们分别进行了四组实验,分别考虑了基本ML方法,不平衡数据,特征选择以及变量的时间特征。从2010年至2013年收集的SGA数据的婴儿被检测到24和42之间的妊娠期。支持向量机(SVM),随机森林(RF),逻辑回归(LR)和稀疏LR模型在10倍交叉验证上培训。评估了接收器操作员特征曲线的曲线(AUC)下的精度和区域。对于每个组,SVM和稀疏LR的性能同样良好。没有任何稀疏性惩罚的LR表现最差,可能是由过度装备问题引起的。随着处理不平衡数据和特征选择的组合,RF集合分类器最佳地执行,甚至在专业知识的帮助下获得了最高AUC值(0.8547)。在其他情况下,RF比稀疏的LR和SVM表现差,可能是因为完全成长的树木。

著录项

  • 来源
    《Big Data, IEEE Transactions on》 |2020年第2期|334-346|共13页
  • 作者单位

    Beijing Univ Technol Sch Software Engn Beijing Engn Res Ctr IoT Software & Syst Beijing 100124 Peoples R China|Tsinghua Univ Tsinghua Natl Lab Informat Sci & Technol Beijing 100084 Peoples R China;

    Beijing Univ Technol Sch Software Engn Beijing 100124 Peoples R China;

    Beijing Univ Technol Sch Software Engn Beijing 100124 Peoples R China;

    Beijing Univ Technol Sch Software Engn Beijing 100124 Peoples R China;

    Tsinghua Univ Tsinghua Natl Lab Informat Sci & Technol Beijing 100084 Peoples R China;

    Chinese Acad Med Sci & Peking Union Med Coll Peking Union Med Coll Hosp Dept Endocrinol Beijing 100730 Peoples R China;

    Chinese Acad Med Sci & Peking Union Med Coll Peking Union Med Coll Hosp Dept Endocrinol Beijing 100730 Peoples R China;

    Tsinghua Univ Tsinghua Natl Lab Informat Sci & Technol Beijing 100084 Peoples R China;

    Chinese Acad Med Sci & Peking Union Med Coll Peking Union Med Coll Hosp Dept Endocrinol Beijing 100730 Peoples R China;

  • 收录信息
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

    Support vector machines; Pediatrics; Predictive models; Training; Radio frequency; Vegetation; Big data; Feature selection; machine learning; prediction model; small for gestational age;

    机译:支持向量机;儿科;预测模型;训练;射频;植被;大数据;特征选择;机器学习;预测模型;小于胎龄;

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号