首页> 外文会议>International Symposium on Multidisciplinary Studies and Innovative Technologies >Classification of Diabetes Dataset with Data Mining Techniques by Using WEKA Approach
【24h】

Classification of Diabetes Dataset with Data Mining Techniques by Using WEKA Approach

机译:基于WEKA方法的数据挖掘技术对糖尿病数据集的分类

获取原文

摘要

Diabetes is a very common disease and beside it causes serious health problems such as fatal kidney damage or blindness; it may lead the patient to death. There is no exact cure for this disease yet but it is manageable with medication and diet. In this manner, importance of correct diagnosis of diabetes is very important to identify the diseases in early stage and take necessary precautions. There is a lot of data accumulated on this subject, as there are so many patients with this condition. This makes it possible for researchers to use data mining techniques on this subject. This study is proposed to classify diabetes by using data mining techniques. The dataset which has been obtained from UCI machine learning depository contains 520 instances, each having 17 attributes. Seven different classification algorithm including Bayes Network, Naïve Bayes, J48, Random Tree, Random Forest, k-NN and SVM have been studied on this dataset. Obtained results indicated that k-NN performed the highest accuracy with 98.07% and this algorithm is the best method to identify and classify diabetes diseases on studies dataset.
机译:糖尿病是一种非常常见的疾病,此外还会引起严重的健康问题,例如致命的肾脏损害或失明。可能导致患者死亡。目前尚不能确切治愈该病,但可以通过药物和饮食来控制。以这种方式,正确诊断糖尿病的重要性对于早期发现疾病并采取必要的预防措施非常重要。由于有这么多患者患有此病,因此有大量关于此主题的数据积累。这使得研究人员可以在此主题上使用数据挖掘技术。提出这项研究是通过使用数据挖掘技术对糖尿病进行分类。从UCI机器学习存储库获得的数据集包含520个实例,每个实例具有17个属性。在该数据集上研究了七种不同的分类算法,包括贝叶斯网络,朴素贝叶斯,J48,随机树,随机森林,k-NN和SVM。所得结果表明,k-NN的准确率最高,为98.07%,该算法是在研究数据集上识别和分类糖尿病疾病的最佳方法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号