Credit scoring models are the basis for financial institutions like retail and consumer credit banks. The purpose of the models is to evaluate the likelihood of credit applicants defaulting in order to decide whether to grant them credit. The area under the receiver operating characteristic (ROC) curve (AUC) is one of the most commonly used measures to evaluate predictive performance in credit scoring. The aim of this thesis is to benchmark different methods for building scoring models in order to maximize the AUC. While this measure is used to evaluate the predictive accuracy of the presented algorithms, the AUC is especially introduced as direct optimization criterion.ududThe logistic regression model is the most widely used method for creating credit scorecards and classifying applicants into risk classes. Since this development process, based on the logit model, is standard in the retail banking practice, the predictive accuracy of this proceeding is used for benchmark reasons throughout this thesis.ududThe AUC approach is a main task introduced within this work. Instead of using theudmaximum likelihood estimation, the AUC is considered as objective function to optimize it directly. The coefficients are estimated by calculating the AUC measure with Wilcoxon-Mann-Whitney and by using the Nelder-Mead algorithm for the optimization. The AUC optimization denotes a distribution-free approach, which is analyzed within a simulation study for investigating the theoretical considerations. It can be shown that the approach still works even if the underlying distribution is not logistic.ududIn addition to the AUC approach and classical well-known methods like generalized additive models, new methods from statistics and machine learning are evaluated for the credit scoring case. Conditional inference trees, model-based recursive partitioning methods and random forests are presented as recursive partitioning algorithms. Boosting algorithms are also explored by additionally using the AUC as a loss function.ududThe empirical evaluation is based on data from a German bank. From the application scoring, 26 attributes are included in the analysis. Besides the AUC, different performance measures are used for evaluating the predictive performance of scoring models. While classification trees cannot improve predictive accuracy for the current credit scoring case, the AUC approach and special boosting methods provide outperforming results compared to the robust classical scoring models regarding the predictive performance with the AUC measure.
展开▼
机译:信用评分模型是零售和消费者信用银行等金融机构的基础。该模型的目的是评估信贷申请人违约的可能性,以便决定是否授予他们信贷。接收者操作特征(ROC)曲线(AUC)下的面积是评估信用评分的预测性能的最常用方法之一。本文的目的是对建立评分模型的不同方法进行基准测试,以使AUC最大化。虽然此度量用于评估所提出算法的预测准确性,但特别引入了AUC作为直接优化标准。 ud ud对数回归模型是创建信用计分卡并将申请人分类为风险类别的最广泛使用的方法。由于这种基于logit模型的开发过程是零售银行业务实践的标准,因此在整个论文中,出于基准原因,使用此过程的预测准确性。 ud udAUC方法是这项工作中引入的主要任务。代替使用 udmaximum估计,AUC被认为是直接对其进行优化的目标函数。通过使用Wilcoxon-Mann-Whitney计算AUC度量并使用Nelder-Mead算法进行优化来估计系数。 AUC优化表示一种无分布方法,该方法在模拟研究中进行了分析,以研究理论上的考虑。可以证明,即使基础分布不是逻辑分布,该方法仍然有效。 ud ud除了AUC方法和经典的众所周知的方法(例如广义加性模型)之外,还对统计和机器学习中的新方法进行了信用评估计分案例。提出了条件推理树,基于模型的递归分区方法和随机森林作为递归分区算法。还通过将AUC用作损失函数来探索提升算法。 ud ud实证评估是基于德国一家银行的数据。根据应用程序评分,分析中包括26个属性。除了AUC之外,还使用其他性能指标来评估评分模型的预测性能。虽然分类树无法提高当前信用评分情况的预测准确性,但与健壮的经典评分模型相比,AUC度量的预测性能却优于AUC方法和特殊的提升方法。
展开▼