首页> 外文会议>IEEE International Conference on Application of Information and Communication Technologies >Learning the Relationship between Asthma and Meteorological Events by Using Machine Learning Methods
【24h】

Learning the Relationship between Asthma and Meteorological Events by Using Machine Learning Methods

机译:通过机器学习方法学习哮喘与气象事件之间的关系

获取原文

摘要

In this article, a new methodology is proposed by using the relationships between meteorological events and asthma cases of asthma patients in a region compared to other regions in a country. We focus on the impact of weather conditions on asthma in order to estimate asthma cases using machine learning methods based on meteorological events only. In order to increase the success of the estimates, in addition to the 10 features identified by the National Environmental Information Centers, we create some new semi-synthetic features by using the multiplication and addition operations on the features given after the scaling. Then, we use machine learning methods and the R-square coefficient approach to learn the effective features using the features obtained from publicly available data sets for Russia. After determining the effective features, we use three different machine learning algorithms: random forest, linear regression, and kernel ridge regression algorithms. We use transfer learning to store effective features obtained from a dataset for Russia and then apply them to a dataset for Kazakhstan. Our hypothesis is that a combination of the selected semi-synthetic properties of the random forest algorithm has the best performance accuracy for this application. The model successfully identifies (predicts) very high, high, medium, low or very low numbers of people with asthma for the first time in the region.
机译:在本文中,通过使用一个地区与一个国家的其他地区相比,气象事件与该地区哮喘患者的哮喘病例之间的关系,提出了一种新的方法。我们专注于天气条件对哮喘的影响,以便仅基于气象事件使用机器学习方法估算哮喘病例。为了增加估算的成功率,除了国家环境信息中心确定的10个特征外,我们还对定标后给出的特征使用了乘法和加法运算,从而创建了一些新的半合成特征。然后,我们使用机器学习方法和R平方系数方法,使用从俄罗斯公开数据集获得的特征来学习有效特征。确定有效特征后,我们使用三种不同的机器学习算法:随机森林,线性回归和核岭回归算法。我们使用转移学习来存储从俄罗斯数据集获得的有效要素,然后将其应用于哈萨克斯坦的数据集。我们的假设是,随机森林算法的选定半合成属性的组合对于此应用程序具有最佳性能精度。该模型首次成功识别(预测)了该地区非常高,高,中,低或非常低的哮喘患者人数。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号