【24h】

CatBoost Ensemble Approach for Diabetes Risk Prediction at Early Stages

机译:早期阶段的糖尿病风险预测的Catboost合奏方法

获取原文

摘要

Diabetes prediction at the early stage is an important issue in the healthcare field and helps an individual to avoid dangerous situations by initiating treatment. For the prediction of diabetes at the early stages, many techniques in the area of machine learning and ensemble learning have been used. In this paper, we propose an ensemble technique CatBoost which is a Gradient Boosting Decision Tree (GBDT) for diabetes prediction at early stages. The experiment is conducted by comparing the performance of CatBoost with other machine learning methods such as K-Nearest neighbor, Multi-layer perceptron, Logistic regression, Gaussian Naive Bayes, and Stochastic gradient descent and the result is evaluated using accuracy, precision, recall, f1-score, and AUC-ROC curve. Experimentation is conducted using the dataset available in the UCI machine learning repository named “Early stage diabetes risk prediction”. The results prove that CatBoost outperforms compared to the other machine learning methods.
机译:早期阶段的糖尿病预测是医疗保健领域的一个重要问题,并通过启动治疗帮助个人避免危险情况。为了预测早期阶段的糖尿病,已经使用了机器学习领域的许多技术和集合学习。在本文中,我们提出了一种集合技术Catboost,其是早期阶段的糖尿病预测的渐变升压决策树(GBDT)。通过比较Catboost与其他机器学习方法的性能进行实验,如K-Collest邻居,多层感知,Logistic回归,高斯天真贝叶斯和随机梯度下降,并且使用精度,精度,召回评估结果, F1分数和AUC-ROC曲线。使用UCI机器学习储存库中可用的数据集进行了实验,名为“早期阶段糖尿病风险预测”。与其他机器学习方法相比,结果证明了临时效果优势。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号