首页> 外文会议>International Conference on Communication Systems and Network Technologies >A Data-Driven Approach to Predict Default Risk of Loan for Online Peer-to-Peer (P2P) Lending
【24h】

A Data-Driven Approach to Predict Default Risk of Loan for Online Peer-to-Peer (P2P) Lending

机译:一种数据驱动的方法,以预测在线点对点(P2P)贷款的默认贷款风险

获取原文

摘要

Online Peer-to-Peer (P2P) lending has achieved explosive development recently, which could be beneficial to both sides of individual lending. In this study, a data mining (DM) approach to predict the performance of P2P loan before funded is proposed. Using data from the Lending Club, we explore the characteristics of loan and its applicant and use random forest to do the feature selection in the modeling phase. The Difference from other risk prediction models is that the prediction is classified into three or four categories, rather than just two the default and not default classes. Then we compare five DM models: two decision trees (DTs), two neural networks (NNs) and one support vector machine (SVM) and use two metrics: average percent hit rate and area of the lift cumulative curve to evaluate the prediction results. The Empirical result shows that the term of loan, annual income, the amount of loan, debt-to-income ratio, credit grade and revolving line utilization play an important role in loan defaults. And SVM, Classification and Regression Tree (CART) and Multi-layer perceptron (MPL)'s prediction performance are almost equal.
机译:在线点对点(P2P)贷款最近取得了爆炸性发展,这可能对个人贷款的双方有利。在本研究中,提出了一种数据挖掘(DM)方法来预测在资助之前的P2P贷款的表现。使用来自贷款俱乐部的数据,我们探讨了贷款及其申请人的特点,并使用随机森林在建模阶段进行功能选择。与其他风险预测模型的差异是将预测分为三个或四个类别,而不是仅为默认等级而不是默认类。然后我们比较五个DM型号:两个决策树(DTS),两个神经网络(NNS)和一个支持向量机(SVM)和使用两个度量:平均百分比命中率和电梯累积曲线的面积来评估预测结果。经验结果表明,贷款期限,年收入,贷款金额,收入比率,信贷等级和循环线利用率在贷款违约方面发挥着重要作用。和SVM,分类和回归树(推车)和多层的Perceptron(MPL)的预测性能几乎相等。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号