首页> 外文会议>International conference on intelligent data engineering and automated learning >An Improved Outlier Detection Algorithm to Medical Insurance
【24h】

An Improved Outlier Detection Algorithm to Medical Insurance

机译:一种改进的医疗保险异常值检测算法

获取原文
获取外文期刊封面目录资料

摘要

With the development of the medical insurance industry in China, medical insurance data with complex, multidimensional and interdisciplinary feature are extremely increasing. How to mine the potential value from the vast amounts of data and improve the efficiency of data analysis are topical issues in the study of data mining. This paper presents an improved LOF Outlier Detection Algorithm - GdiLOF, an algorithm which reduces dataset by removing the normal data and introduces information entropy to improve the accuracy of the LOF algorithm. Platform adaptability is analyzed by running it on Hadoop platform. The experimental results show that GdiLOF algorithm has high efficiency and the accuracy is 6 percentage points higher than LOF algorithm. And it also run better in the Hadoop distributed platforms, as well as having obvious advantages in processing huge amounts of data.
机译:随着中国医疗保险业的发展,具有复杂,多维,跨学科特征的医疗保险数据正在急剧增加。如何从海量数据中挖掘潜在价值并提高数据分析效率是数据挖掘研究的主题。本文提出了一种改进的LOF离群值检测算法-GdiLOF,该算法通过去除常规数据来减少数据集,并引入信息熵来提高LOF算法的准确性。通过在Hadoop平台上运行来分析平台的适应性。实验结果表明,GdiLOF算法效率高,精度比LOF算法高6个百分点。而且它在Hadoop分布式平台上也运行得更好,并且在处理大量数据方面具有明显的优势。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号