首页> 外文会议>International Conference on Information and Communications Technology >Improving Accuracy of C4.5 Algorithm Using Split Feature Reduction Model and Bagging Ensemble for Credit Card Risk Prediction
【24h】

Improving Accuracy of C4.5 Algorithm Using Split Feature Reduction Model and Bagging Ensemble for Credit Card Risk Prediction

机译:使用分流特征减少模型和袋装集合的C4.5算法精度提高C4.5算法进行信用卡风险预测

获取原文

摘要

Giving credit to prospective debtor is determined by the existence of credit scoring. The accuracy of credit scoring to classify the debtor data is very important. The method that can be applied is classification and one of the classification method is decision tree. One of the decision tree algorithm that can be used is C4.5 algorithm. In this paper, the problem that discussed is how to increase the accuracy of C4.5 algorithm to predict credit receipts. The increasing accuracy is conducted by applying the Split Feature Reduction Model and Bagging Ensemble. Split Feature Reduction Model is applied in the preprocessing process which split datasets to the amount of n. In this paper, datasets split into 4 splits. Split 1 consists of 16 features, Split 2 consists of 12 features, Split 3 consists of 8 features, and Split 4 consists of 4 features. Then, C4.5 algorithm is applied to every splits. The best accuracy result by applying split feature reduction model with C4.5 algorithm is in Split 3 amount 73.1%. Then, the best accuracy results obtained by applying the split feature reduction model and bagging ensemble with C4.5 algorithm is in Split 3 amount 75.1%. In comparison to the accuracy of C4.5 algorithm stand alone, the applying of split feature reduction model and bagging ensemble obtained increased accuracy by 4.6%.
机译:向预期债务人提供信贷是由信用评分的存在决定的。信用评分的准确性来分类债务人数据非常重要。可以应用的方法是分类,其中一个分类方法是决策树。可以使用的决策树算法之一是C4.5算法。在本文中,讨论的问题是如何提高C4.5算法的准确性来预测信用收据。通过应用分离特征减少模型和装袋集合来进行增加的准确性。拆分特征减少模型应用于预处理过程,将数据集分为n的数量。在本文中,数据集分成4个分裂。拆分1由16个功能组成,拆分2由12个功能组成,拆分3由8个功能组成,拆分4由4个功能组成。然后,将C4.5算法应用于每个分割。通过使用C4.5算法应用分割特征减少模型的最佳精度结果是分割3份73.1%。然后,通过使用C4.5算法应用分割特征减少模型和袋装集合获得的最佳精度结果是分配3份75.1%。与单独的C4.5算法的准确性相比,分裂特征减少模型和袋装集合的应用提高了4.6%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号