首页> 外文会议>International Conference on Information and Communications Technology >Improving accuracy of C4.5 algorithm using split feature reduction model and bagging ensemble for credit card risk prediction
【24h】

Improving accuracy of C4.5 algorithm using split feature reduction model and bagging ensemble for credit card risk prediction

机译:使用拆分特征约简模型和袋装集成提高C4.5算法的准确性以预测信用卡风险

获取原文

摘要

Giving credit to prospective debtor is determined by the existence of credit scoring. The accuracy of credit scoring to classify the debtor data is very important. The method that can be applied is classification and one of the classification method is decision tree. One of the decision tree algorithm that can be used is C4.5 algorithm. In this paper, the problem that discussed is how to increase the accuracy of C4.5 algorithm to predict credit receipts. The increasing accuracy is conducted by applying the Split Feature Reduction Model and Bagging Ensemble. Split Feature Reduction Model is applied in the preprocessing process which split datasets to the amount of n. In this paper, datasets split into 4 splits. Split 1 consists of 16 features, Split 2 consists of 12 features, Split 3 consists of 8 features, and Split 4 consists of 4 features. Then, C4.5 algorithm is applied to every splits. The best accuracy result by applying split feature reduction model with C4.5 algorithm is in Split 3 amount 73.1%. Then, the best accuracy results obtained by applying the split feature reduction model and bagging ensemble with C4.5 algorithm is in Split 3 amount 75.1%. In comparison to the accuracy of C4.5 algorithm stand alone, the applying of split feature reduction model and bagging ensemble obtained increased accuracy by 4.6%.
机译:向潜在债务人提供信用取决于信用评分的存在。信用评分对债务人数据进行分类的准确性非常重要。可以应用的方法是分类,分类方法之一是决策树。可以使用的决策树算法之一是C4.5算法。本文讨论的问题是如何提高C4.5算法预测信用收据的准确性。通过应用“拆分特征缩减模型”和“装袋合奏”可以提高准确性。分割特征约简模型在预处理过程中应用,将数据集分割为n个数量。在本文中,数据集分为4个分割。拆分1包含16个功能,拆分2包含12个功能,拆分3包含8个功能,拆分4包含4个功能。然后,将C4.5算法应用于每个分割。通过使用C4.5算法应用分割特征约简模型的最佳精度结果是分割3量的73.1%。然后,通过应用分割特征约简模型和C4.5算法的装袋集成获得的最佳精度结果为分割3量75.1%。与单独使用C4.5算法的准确性相比,使用分割特征约简模型和装袋集成可将准确性提高4.6%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号