首页> 中文期刊> 《计算机工程》 >基于优化ID3的井漏类型分类算法

基于优化ID3的井漏类型分类算法

         

摘要

决策树算法用于井漏分类时, 由于井漏数据离散化后多值属性占比较大, 且具有多值偏向的缺点, 分类效果不理想.为此, 提出一种基于改进ID3的AFIV-ID3算法.在ID3的基础上引入属性重要度计算新的信息熵, 属性重要度大小由决策者依靠先验或领域知识决定.在信息增益计算中加入关联度函数比, 对信息增益值做出修正.AFIV-ID3算法克服了ID3多值偏向的缺点, 提高了数据中重要属性的权重, 从而提升井漏类型分类精度.4组UCI数据集和真实井漏数据测试结果表明, 该算法的分类精度优于ID3和C4. 5算法, 并能够将人工经验法不稳定的分类精度提高至约72. 23%.%When the decision tree algorithm is used in well leakage classification, the classification effect is not satisfactory because of the large proportion of multi-valued attributes after the well leakage data is discretized, and because the algorithm has the shortcoming of multi-value bias. Therefore, an improved AFIV-ID3 algorithm based on ID3 is proposed. On the basis of ID3, attribute importance is introduced to calculate new information entropy. Attribute importance is determined by the decision maker depending on prior knowledge or domain knowledge. The association function ratio is added to the information gain calculation to modify the information gain value. The AFIV-ID3 algorithm overcomes the shortcoming of ID3 multi-value bias, improves the weight of important attributes in the data, and effectively improves the classification accuracy of well leakage type. The test results of four UCI data sets and real well leakage data show that the classification accuracy of this algorithm is better than that of ID3 and C4. 5 algorithm, and the unstable classification accuracy of artificial experience method can be improved to about 72. 23%.

著录项

相似文献

  • 中文文献
  • 外文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号