首页> 外文期刊>American Journal of Theoretical and Applied Statistics >COVID-19 Prediction and Detection Using Machine Learning Algorithms: Catboost and Linear Regression
【24h】

COVID-19 Prediction and Detection Using Machine Learning Algorithms: Catboost and Linear Regression

机译:Covid-19使用机器学习算法预测和检测:Catboost和线性回归

获取原文
           

摘要

A global pandemic COVID-19 has been rapidly spreading, and the predictions for infected rate shows how the cases will increase or decrease. Even though the number of people who get the corona vaccine is increasing, COVID-19 has been a serious worldwide problem. As machine learning and deep learning were implemented to predict COVID-19 in recent days, machine learning to predict the number of confirmed and death cases of COVID-19 was used. Prediction graphs of our proposed model play a crucial role for preventing more people getting infected. The project collected the number of daily infected cases in New York from March 21th 2020 to March 6th 2021. For precise results, the dataset in 6 different kinds of the machine learning methods was used. The methods were Decision Tree, Random Forest, Linear Regression, Gradient Boosting, XGboosting, and LGBM. RMSE and MAE values fluctuated from 9.95 to 68.85 and 5.99 to 58.76. The most accurate model was Linear Regression, RMSE and MAE with 9.96 and 5.99 for death cases and 597.61 and 346.04 for confirmed cases. Therefore, those prediction graph almost matched the same as the real number graph that the project drew with an actual dataset. The other dataset was about common COVID-19 symptoms, and the Catboost model listed from the most influential factor, breathing problem. Collecting data from other areas and specifying the patients' features could have improved the quality of the research, though overall the result was successful.
机译:全球大流行Covid-19一直在迅速传播,对感染率的预测显示了如何增加或减少案例。即使获得电晕疫苗的人数正在增加,Covid-19也是一个严重的全球问题。由于机器学习和深度学习被实施到最近几天内的Covid-19,使用机器学习预测Covid-19的确认和死亡情况的数量。我们拟议模型的预测图对预防更多人受到感染的作用至关重要。该项目于2020年3月21日至2021年3月21日收集了纽约日常感染病例的数量。对于精确的结果,使用了6种不同类型的机器学习方法的数据集。该方法是决策树,随机森林,线性回归,渐变升压,XGBoosting和LGBM。 RMSE和MAE值从9.95波动到68.85和5.99到58.76。最准确的模型是线性回归,RMSE和MAE,9.96和5.99用于死亡病例,597.61和346.04用于确认案件。因此,那些预测图几乎与项目使用实际数据集的实数图相同。另一个数据集是关于常见的Covid-19症状,以及从最具影响力的因素,呼吸问题列出的Catboost模型。从其他地区收集数据并指定患者的特征可以提高研究质量,但总体而言,结果成功。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号