【24h】

Analysis of Diabetes Dataset

机译:糖尿病数据集分析

获取原文

摘要

The focus of the research study was analysis of diabetes dataset and how it will perform if we try to do a prediction of diabetes with different machine learning algorithms. We used the original dataset from the National Institute of Diabetes, and Digestive and Kidney Diseases. The dataset can be used to predict whether or not a patient has diabetes, based on certain diagnostics. For analysis we used Amazon Web Services. We used AWS S3 service to store our dataset, and Amazon Sagemaker to perform an analysis. For the given dataset we applied three classification models: Logistic Regression Model, K-nearest Neighbors and Support Vector Machines. For each of the models we also performed a performance measurement. We also compared all the results we got and according to the results, Support Vector Machines has the best performance. Insights and recommendations are provided.
机译:研究研究的重点是糖尿病数据集的分析以及如果我们尝试使用不同的机器学习算法预测糖尿病的预测。我们使用来自国家糖尿病研究所和消化和肾病的原始数据集。数据集可用于预测患者是否基于某些诊断患者是否具有糖尿病。对于分析,我们使用了亚马逊Web服务。我们使用AWS S3服务来存储我们的数据集和Amazon Sagemaker来执行分析。对于给定的数据集,我们应用了三种分类模型:Logistic回归模型,K-Etcleit邻居和支持向量机。对于每个模型,我们还执行了性能测量。我们还比较了我们得到的所有结果,并根据结果,支持向量机具有最佳性能。提供了见解和建议。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号