首页> 外文期刊>Expert Systems with Application >The application of brute force logistic regression to corporate credit scoring models: Evidence from Serbian financial statements
【24h】

The application of brute force logistic regression to corporate credit scoring models: Evidence from Serbian financial statements

机译:蛮力逻辑回归在企业信用评分模型中的应用:来自塞尔维亚财务报表的证据

获取原文
获取原文并翻译 | 示例

摘要

In this paper a brute force logistic regression (LR) modeling approach is proposed and used to develop predictive credit scoring model for corporate entities. The modeling is based on 5 years of data from end-of-year financial statements of Serbian corporate entities, as well as, default event data. To the best of our knowledge, so far no relevant research about predictive power of financial ratios derived from Serbian financial statements has been published. This is also the first paper that generated 350 financial ratios to represent independent variables for 7590 corporate entities default predictions'. Many of derived financial ratios are new and were not discussed in literature before. Weight of evidence (WOE) method has been applied to transform and prepare financial ratios for brute force LR fitting simulations. Clustering method has been utilized to reduce long list of variables and to remove highly correlated financial ratios from partitioned training and validation datasets. The clustering results have revealed that number of variables can be reduced to short list of 24 financial ratios which are then analyzed in terms of default event predictive power. In this paper we propose the most predictive financial ratios from financial statements of Serbian corporate entities. The obtained short list of financial ratios has been used as a main input for brute force LR model simulations. According to literature, common practice to select variables in final model is to run stepwise, forward or backward LR. However, this research has been conducted in a way that the brute force LR simulations have to obtain all possible combinations of models that comprise of 5-14 independent variables from the short list of 24 financial ratios. The total number of simulated resulting LR models is around 14 million. Each model has been fitted through extensive and time consuming brute force LR simulations using SAS~® code written by the authors. The total number of 342,016 simulated models ("well-founded" models) has satisfied the established credit scoring model validity conditions. The well-founded models have been ranked according to GINI performance on validation dataset. After all well-founded models have been ranked, the model with highest predictive power and consisting of 8 financial ratios has been selected and analyzed in terms of receiver-operating characteristic curve (ROC), GINI, AIC, SC, LR fitting statistics and correlation coefficients. The financial ratio constituents of that model have been discussed and benchmarked with several models from relevant literature.
机译:本文提出了一种蛮力逻辑回归(LR)建模方法,并将其用于开发企业实体的预测信用评分模型。该建模基于塞尔维亚公司实体年终财务报表中的5年数据以及默认事件数据。据我们所知,到目前为止,尚未发表有关塞尔维亚财务报表中财务比率​​预测能力的相关研究。这也是第一篇产生350个财务比率代表7590个企业实体违约预测的自变量的论文。许多衍生的财务比率是新的,以前在文献中没有讨论过。证据权重(WOE)方法已应用于为蛮力LR拟合模拟转换和准备财务比率。聚类方法已被用于减少一长串变量,并从分区的训练和验证数据集中删除高度相关的财务比率。聚类结果表明,变量的数量可以减少到24个财务比率的简短列表,然后根据默认事件的预测能力进行分析。在本文中,我们提出了塞尔维亚公司实体财务报表中最具预测性的财务比率。获得的财务比率简短列表已用作蛮力LR模型仿真的主要输入。根据文献,在最终模型中选择变量的常规做法是逐步运行,向前或向后LR。但是,该研究的进行方式是,蛮力LR模拟必须从24个财务比率的简短列表中获得包含5-14个独立变量的模型的所有可能组合。模拟生成的LR模型的总数约为1400万。使用作者编写的SAS〜®代码通过广泛且耗时的蛮力LR模拟来拟合每个模型。总数为342,016个仿真模型(“良好基础”的模型)满足了已建立的信用评分模型的有效性条件。根据验证数据集上GINI的性能对良好模型进行排名。在对所有有根据的模型进行排名之后,选择并根据接收者工作特征曲线(ROC),GINI,AIC,SC,LR拟合统计和相关性对具有8种财务比率的具有最高预测能力的模型进行了分析。系数。该模型的财务比率构成部分已通过相关文献中的几种模型进行了讨论和基准测试。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号