首页> 外文期刊>Journal of Soft Computing and Applications >Implementation of Clustering Algorithms for real datasets in Medical Diagnostics using MATLAB
【24h】

Implementation of Clustering Algorithms for real datasets in Medical Diagnostics using MATLAB

机译:使用MATLAB实现医学诊断中真实数据集的聚类算法

获取原文
       

摘要

As in the medical field, for one disease there require samples given by diagnosis. The samples will be analyzed by a doctor or a pharmacist. As the no. of patients increases their samples also increases, there require more time to analyze samples for deciding the stage of the disease. To analyze the sample every time requires a skilled person. The samples can be classified by applying them to clustering algorithms. Data clustering has been considered as the most important raw data analysis method used in data mining technology. Most of the clustering techniques proved their efficiency in many applications such as decision making systems, medical sciences, earth sciences etc. Partition based clustering is one of the main approach in clustering. There are various algorithms of data clustering, every algorithm has its own advantages and disadvantages. This work reports the results of classification performance of three such widely used algorithms namely K-means (KM), Fuzzy c-means and Fuzzy Possibilistic c-Means (FPCM) clustering algorithms. To analyze these algorithms three known data sets from UCI machine learning repository are taken such as thyroid data, liver and wine. The efficiency of clustering output is compared with the classification performance, percentage of correctness. The experimental results show that K-means and FCM give same performance for liver data. And FCM and FPCM are giving same performance for thyroid and wine data. FPCM has more efficient classification performance in all the given data sets.
机译:与医学领域一样,对于一种疾病,需要通过诊断给出样本。样品将由医生或药剂师进行分析。由于没有。的患者增加他们的样品也增加,需要更多时间分析样品以确定疾病的阶段。每次要分析样品都需要技术人员。通过将样本应用于聚类算法可以对样本进行分类。数据聚类被认为是数据挖掘技术中最重要的原始数据分析方法。大多数聚类技术证明了它们在决策系统,医学,地球科学等许多应用中的效率。基于分区的聚类是聚类的主要方法之一。数据聚类有多种算法,每种算法都有其自身的优缺点。这项工作报告了三种广泛使用的算法,即K-均值(KM),Fuzzy c-means和Fuzzy Possibilistic c-Means(FPCM)聚类算法的分类性能结果。为了分析这些算法,采用了来自UCI机器学习存储库的三个已知数据集,例如甲状腺数据,肝脏和葡萄酒。将聚类输出的效率与分类性能,正确性百分比进行比较。实验结果表明,K均值和FCM在肝数据方面具有相同的性能。 FCM和FPCM在甲状腺和葡萄酒数据方面具有相同的性能。在所有给定的数据集中,FPCM具有更有效的分类性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号