首页> 外文会议>IEEE International Conference on e-Health Networking, Applications and Services >Missing value imputation methods for TCM medical data and its effect in the classifier accuracy
【24h】

Missing value imputation methods for TCM medical data and its effect in the classifier accuracy

机译:中医医学数据的缺失值插补方法及其对分类器准确性的影响

获取原文

摘要

Objective: Medical data mining is a research hotspot. But medical data often contains missing values, which brings difficulties to the medical data analysis. This work evaluates the performance of several imputation methods. Methods: In this paper, we first simulate the missing data set by completely deleting some data from the complete data set, and use the Euclidean distance KNN, the correlation coefficient KNN and the mean to fill several algorithms to estimate the exact data and compare the accuracy of different algorithm estimation. Then we use these filling algorithms to fill clinical data which has missing values and get complete data. Then we construct a predict model of patient disease by random forest algorithm and classification and regression trees algorithm. By comparing the observed values with the predicted values, we examined the effect of different filling algorithms on the prediction accuracy. Results: The accuracy of the three algorithms is compared under different missing rates. In the filling experiment, the performance of KNN based Pearson correlation coefficient is obviously better than KNN based Euclidean metric and mean imputation. And in the predict model, the performance of these three filling algorithms is the same as in the filling experiment. But the gap is not very significant.
机译:目的:医学数据挖掘是一个研究热点。但是,医学数据通常包含缺失值,这给医学数据分析带来了困难。这项工作评估了几种插补方法的性能。方法:在本文中,我们首先通过从完整数据集中完全删除一些数据来模拟缺失数据集,然后使用欧氏距离KNN,相关系数KNN和均值填充几种算法来估计准确数据并比较不同算法估计的准确性。然后,我们使用这些填充算法填充缺少值的临床数据并获得完整的数据。然后通过随机森林算法,分类回归树算法构建患者疾病的预测模型。通过将观测值与预测值进行比较,我们检查了不同填充算法对预测精度的影响。结果:比较了三种算法在不同丢失率下的准确性。在填充实验中,基于KNN的Pearson相关系数的性能明显优于基于KNN的欧氏度量和均值插补。并且在预测模型中,这三种填充算法的性能与填充实验中的相同。但是差距不是很大。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号