首页> 外文期刊>Informatics in Medicine Unlocked >Hybrid approach using fuzzy sets and extreme learning machine for classifying clinical datasets
【24h】

Hybrid approach using fuzzy sets and extreme learning machine for classifying clinical datasets

机译:使用模糊集和极限学习机的混合方法对临床数据集进行分类

获取原文
           

摘要

Data mining techniques play a major role in developing computer aided diagnosis systems and expert systems that will aid a physician in clinical decision making. In this work, a classifier that combines the relative merits of fuzzy sets and extreme learning machine (FELM) for clinical datasets is proposed. The three major subsystems in the FELM framework are preprocessing subsystem, fuzzification subsystem and classification subsystem. Missing value imputation and outlier elimination are handled by the preprocessing subsystem. The fuzzification subsystem maps each feature to a fuzzy set and the classification subsystem uses extreme learning machine for classification. Cleveland heart disease (CHD), Statlog heart disease (SHD) and Pima Indian diabetes (PID) datasets from the University of California Irvine (UCI) machine learning repository have been used for experimentation. The CHD and SHD datasets have been experimented with two class labels one indicating the absence and the other indicating the presence of heart disease. The CHD dataset has also been experimented with five class labels, one class label indicating the absence of heart disease and the other four class labels indicating the severity of heart disease namely low risk, medium risk, high risk and serious. The PID data set has been experimented with two class labels one indicating the absence and the other indicating the presence of gestational diabetes. The classifier has achieved an accuracy of 93.55% for CHD data set with two class labels; 73.77% for CHD data set with five class labels; 94.44% for SHD data set and 92.54% for PID dataset. Highlights ? A classification approach using fuzzy logic and extreme learning machine is proposed. ? In data preprocessing, instances with outliers are eliminated from the dataset. ? Missing values are imputed by the most frequent value of the 5 nearest neighbors. ? Trapezoidal membership function is applied to transform the clinical dataset to linguistic variables. ? Extreme Learning Machine (ELM) is used for training the single layer feed forward neural network.
机译:数据挖掘技术在开发可帮助医师进行临床决策的计算机辅助诊断系统和专家系统中扮演着重要角色。在这项工作中,提出了一种分类器,该分类器结合了模糊集和极限学习机(FELM)的临床数据集的相对优点。 FELM框架中的三个主要子系统是预处理子系统,模糊化子系统和分类子系统。缺失值估算和异常值消除由预处理子系统处理。模糊化子系统将每个特征映射到一个模糊集,而分类子系统则使用极限学习机进行分类。来自加利福尼亚大学尔湾分校(UCI)机器学习存储库的克利夫兰心脏病(CHD),Statlog心脏病(SHD)和Pima印度糖尿病(PID)数据集已用于实验。 CHD和SHD数据集已使用两个类别标签进行了实验,一个类别标签指示不存在,另一个类别指示存在心脏病。还对CHD数据集进行了五类标签实验,一类标签指示没有心脏病,其他四类标签指示心脏病的严重程度,即低风险,中风险,高风险和严重。 PID数据集已使用两个类别标签进行了试验,一个类别标签指示不存在,另一个类别指示存在妊娠糖尿病。对于带有两个类别标签的CHD数据集,该分类器已达到93.55%的准确性;带有五个类别标签的冠心病数据集的73.77%; SHD数据集为94.44%,PID数据集为92.54%。强调 ?提出了一种基于模糊逻辑和极限学习机的分类方法。 ?在数据预处理中,从数据集中消除具有异常值的实例。 ?缺失值由5个最近邻居中最频繁的值来估算。 ?梯形隶属度函数用于将临床数据集转换为语言变量。 ?极限学习机(ELM)用于训练单层前馈神经网络。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号