...
首页> 外文期刊>Expert systems with applications >Reconciling performance and interpretability in customer churn prediction using ensemble learning based on generalized additive models
【24h】

Reconciling performance and interpretability in customer churn prediction using ensemble learning based on generalized additive models

机译:使用基于广义加性模型的集成学习协调客户流失预测中的性能和可解释性

获取原文
获取原文并翻译 | 示例
   

获取外文期刊封面封底 >>

       

摘要

To build a successful customer churn prediction model, a classification algorithm should be chosen that fulfills two requirements: strong classification performance and a high level of model interpretability. In recent literature, ensemble classifiers have demonstrated superior performance in a multitude of applications and data mining contests. However, due to an increased complexity they result in models that are often difficult to interpret. In this study, GAMensPlus, an ensemble classifier based upon generalized additive models (GAMs), in which both performance and interpretability are reconciled, is presented and evaluated in a context of chum prediction modeling. The recently proposed GAMens, based upon Bagging, the Random Subspace Method and semi-parametric GAMs as constituent classifiers, is extended to include two instruments for model interpretability: generalized feature importance scores, and bootstrap confidence bands for smoothing splines. In an experimental comparison on data sets of six real-life churn prediction projects, the competitive performance of the proposed algorithm over a set of well-known benchmark algorithms is demonstrated in terms of four evaluation metrics. Further, the ability of the technique to deliver valuable insight into the drivers of customer churn is illustrated in a case study on data from a European bank. Firstly, it is shown how the generalized feature importance scores allow the analyst to identify the relative importance of churn predictors in function of the criterion that is used to measure the quality of the model predictions. Secondly, the ability of GAMensPlus to identify nonlinear relationships between predictors and churn probabilities is demonstrated.
机译:要构建成功的客户流失预测模型,应选择一种满足以下两个要求的分类算法:强大的分类性能和较高的模型可解释性。在最近的文献中,集成分类器已在众多应用程序和数据挖掘竞赛中展示了卓越的性能。但是,由于复杂性增加,它们导致模型通常难以解释。在这项研究中,GAMensPlus是基于广义加性模型(GAM)的集成分类器,在该模型中,性能和可解释性都得到了协调,并在总体预测建模的背景下进行了评估。最近基于袋装,随机子空间法和半参数GAM作为构成分类器的GAMens被扩展为包括两种用于模型可解释性的工具:广义特征重要性评分和用于平滑样条曲线的自举置信带。在对六个真实客户流失预测项目的数据集进行的实验比较中,通过四个评估指标证明了该算法相对于一组知名基准算法的竞争性能。此外,在对一家欧洲银行的数据进行的案例研究中,展示了该技术能够为客户流失的驱动因素提供有价值的见解的能力。首先,它显示了广义特征重要性评分如何使分析人员根据用于测量模型预测质量的标准来识别流失预测变量的相对重要性。其次,证明了GAMensPlus能够识别预测变量和搅动概率之间的非线性关系的能力。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号