首页> 外文OA文献 >Recent methods from statistics and machine learning for credit scoring
【2h】

Recent methods from statistics and machine learning for credit scoring

机译:最近的统计和机器学习方法用于信用评分

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

Credit scoring models are the basis for financial institutions like retail and consumer credit banks. The purpose of the models is to evaluate the likelihood of credit applicants defaulting in order to decide whether to grant them credit. The area under the receiver operating characteristic (ROC) curve (AUC) is one of the most commonly used measures to evaluate predictive performance in credit scoring. The aim of this thesis is to benchmark different methods for building scoring models in order to maximize the AUC. While this measure is used to evaluate the predictive accuracy of the presented algorithms, the AUC is especially introduced as direct optimization criterion.ududThe logistic regression model is the most widely used method for creating credit scorecards and classifying applicants into risk classes. Since this development process, based on the logit model, is standard in the retail banking practice, the predictive accuracy of this proceeding is used for benchmark reasons throughout this thesis.ududThe AUC approach is a main task introduced within this work. Instead of using theudmaximum likelihood estimation, the AUC is considered as objective function to optimize it directly. The coefficients are estimated by calculating the AUC measure with Wilcoxon-Mann-Whitney and by using the Nelder-Mead algorithm for the optimization. The AUC optimization denotes a distribution-free approach, which is analyzed within a simulation study for investigating the theoretical considerations. It can be shown that the approach still works even if the underlying distribution is not logistic.ududIn addition to the AUC approach and classical well-known methods like generalized additive models, new methods from statistics and machine learning are evaluated for the credit scoring case. Conditional inference trees, model-based recursive partitioning methods and random forests are presented as recursive partitioning algorithms. Boosting algorithms are also explored by additionally using the AUC as a loss function.ududThe empirical evaluation is based on data from a German bank. From the application scoring, 26 attributes are included in the analysis. Besides the AUC, different performance measures are used for evaluating the predictive performance of scoring models. While classification trees cannot improve predictive accuracy for the current credit scoring case, the AUC approach and special boosting methods provide outperforming results compared to the robust classical scoring models regarding the predictive performance with the AUC measure.
机译:信用评分模型是零售和消费者信用银行等金融机构的基础。该模型的目的是评估信贷申请人违约的可能性,以便决定是否授予他们信贷。接收者操作特征(ROC)曲线(AUC)下的面积是评估信用评分的预测性能的最常用方法之一。本文的目的是对建立评分模型的不同方法进行基准测试,以使AUC最大化。虽然此度量用于评估所提出算法的预测准确性,但特别引入了AUC作为直接优化标准。 ud ud对数回归模型是创建信用计分卡并将申请人分类为风险类别的最广泛使用的方法。由于这种基于logit模型的开发过程是零售银行业务实践的标准,因此在整个论文中,出于基准原因,使用此过程的预测准确性。 ud udAUC方法是这项工作中引入的主要任务。代替使用 udmaximum估计,AUC被认为是直接对其进行优化的目标函数。通过使用Wilcoxon-Mann-Whitney计算AUC度量并使用Nelder-Mead算法进行优化来估计系数。 AUC优化表示一种无分布方法,该方法在模拟研究中进行了分析,以研究理论上的考虑。可以证明,即使基础分布不是逻辑分布,该方法仍然有效。 ud ud除了AUC方法和经典的众所周知的方法(例如广义加性模型)之外,还对统计和机器学习中的新方法进行了信用评估计分案例。提出了条件推理树,基于模型的递归分区方法和随机森林作为递归分区算法。还通过将AUC用作损失函数来探索提升算法。 ud ud实证评估是基于德国一家银行的数据。根据应用程序评分,分析中包括26个属性。除了AUC之外,还使用其他性能指标来评估评分模型的预测性能。虽然分类树无法提高当前信用评分情况的预测准确性,但与健壮的经典评分模型相比,AUC度量的预测性能却优于AUC方法和特殊的提升方法。

著录项

  • 作者

    Kraus Anne;

  • 作者单位
  • 年度 2014
  • 总页数
  • 原文格式 PDF
  • 正文语种
  • 中图分类

相似文献

  • 外文文献
  • 中文文献
  • 专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号