...
首页> 外文期刊>Journal of Business Research >Customer churn prediction in telecommunication industry using data certainty
【24h】

Customer churn prediction in telecommunication industry using data certainty

机译:使用数据确定性的电信行业客户流失预测

获取原文
获取原文并翻译 | 示例
           

摘要

Customer Churn Prediction (CCP) is a challenging activity for decision makers and machine learning community because most of the time, churn and non-churn customers have resembling features. From different experiments on customer churn and related data, it can be seen that a classifier shows different accuracy levels for different zones of a dataset. In such situations, a correlation can easily be observed in the level of classifier's accuracy and certainty of its prediction. If a mechanism can be defined to estimate the classifier's certainty for different zones within the data, then the expected classifier's accuracy can be estimated even before the classification. In this paper, a novel CCP approach is presented based on the above concept of classifier's certainty estimation using distance factor. The dataset is grouped into different zones based on the distance factor which are then divided into two categories as; (i) data with high certainty, and (ii) data with low certainty, for predicting customers exhibiting Churn and Non-churn behavior. Using different state-of-the-art evaluation measures (e.g., accuracy, f-measure, precision and recall) on different publicly available the Telecommunication Industry (TCI) datasets show that (i) the distance factor is strongly co-related with the certainty of the classifier, and (ii) the classifier obtained high accuracy in the zone with greater distance factor's value (i.e., customer churn and non-churn with high certainty) than those placed in the zone with smaller distance factor's value (i.e., customer chum and non-churn with low certainty).
机译:对于决策者和机器学习社区而言,客户流失预测(CCP)是一项具有挑战性的活动,因为在大多数情况下,客户流失和非客户流失具有相似的功能。通过对客户流失率和相关数据的不同实验,可以看出分类器针对数据集的不同区域显示了不同的准确性级别。在这种情况下,可以容易地观察到分类器准确性和预测确定性之间的相关性。如果可以定义一种机制来估计数据中不同区域的分类器确定性,那么甚至可以在分类之前就估计预期分类器的准确性。本文基于上述基于距离因子的分类器确定性估计概念,提出了一种新颖的CCP方法。根据距离因子将数据集分为不同的区域,然后将其分为两类: (i)具有较高确定性的数据,以及(ii)具有较低确定性的数据,用于预测表现出流失和非流失行为的客户。对不同的公开可用的不同的最新评估方法(例如准确性,f度量,准确性和召回率),电信行业(TCI)数据集显示(i)距离因子与分类器的确定性;(ii)分类器在距离因子值较大的区域(即,客户流失率和非确定性高确定性)中,比放置在距离因子值较小的区域(即,客户流失率)高不确定的混音和非混音)。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号