首页> 外文会议>International Conference on Pattern Recognition and Machine Learning >Tree-Based Models Using Random Grid Search Optimization for Disease Classification Based on Environmental Factors: A Case Study on Asthma Hospitalizations
【24h】

Tree-Based Models Using Random Grid Search Optimization for Disease Classification Based on Environmental Factors: A Case Study on Asthma Hospitalizations

机译:基于树的模型,基于环境因素的疾病分类优化:哮喘住院案例研究

获取原文

摘要

An understanding on the exposure to environmental factors aggravating global disease burden can aid mitigating it. Generally, a class of generalized linear models and generalized additive models are used in predicting disease burden whereas, tree-based models are underused. The objective of this paper is to evaluate the performance of different tree-based models namely decision tree, random forest, gradient boosted tree and stochastic gradient boosted trees in predicting asthma attack based on short-term exposure to environmental factors and to examine the environmental factors triggering asthma attack. A sample of patients during 2013 - 2015 from different parts of Victoria was considered. The study area for the considered study period had reasonably good air quality and relatively humid environment. The tree-based models were tuned using random grid search optimization with bootstrapping to address over-fitting. The models considered performed well in predicting asthma attacks in terms of area under the receiver operating curve (ROC AUC) (>0.82). All the gradient boosted trees (accuracy = 76%; recall = 63%; F2-score = 64%) showed better overall prediction whereas decision tree (accuracy = 71%; recall = 75%; F2-score = 71%) outperformed other models in identifying the positive cases. Tree-based models revealed that O3 exposure consistently influence Asthma. Further, decision tree revealed O3 exposure < 13 ppb or with high O3 exposure >= 13 ppb, and with [SO2 exposure < 0.5 ppb and maximum wind speed > 5.4. km/hr.] influenced Asthma. In addition, relative humidity and exposure to CO were also detected in other tree-based models as relevant predictors triggering asthma attacks.
机译:对暴露于全球疾病负担的环境因素的接触可以帮助减轻它。通常,一类广义的线性模型和广义添加剂模型用于预测疾病负担,而基于树的模型。本文的目的是评估不同树的模型的性能即决策树,随机森林,梯度提升树和随机梯度提高树木在预测基于短期暴露于环境因素并检查环境因素的哮喘攻击触发哮喘发作。考虑了从维多利亚不同地区的2013年患者样本。考虑研究期的研究领域具有合理的空气质量和相对潮湿的环境。使用随机网格搜索优化进行调整基于树的模型,以便引导以解决过度拟合。在接收器操作曲线(ROC AUC)下的区域(ROC AUC)(> 0.82)下预测哮喘攻击时,考虑的模型良好。所有梯度提高树木(精度= 76%;召回= 63%; f 2 -score = 64%)显示出更好的整体预测,而决策树(精度= 71%;召回= 75%; f 2 -score = 71%)表现出识别阳性情况时的其他模型。基于树的模型显示o 3 暴露始终如一地影响哮喘。此外,决策树揭示了o 3 暴露<13 ppb或高o 3 曝光> = 13 ppb,并且[所以 2 暴露<0.5 ppb和最大风速> 5.4。 km / hr。]影响哮喘。此外,还在其他基于树的模型中检测到相对湿度和对CO的接触,作为触发哮喘攻击的相关预测因子。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号