首页> 外文会议>International Conference on Artificial Intelligence and Data Processing >TSCBAS: A Novel Correlation Based Attribute Selection Method and Application on Telecommunications Churn Analysis
【24h】

TSCBAS: A Novel Correlation Based Attribute Selection Method and Application on Telecommunications Churn Analysis

机译:TSCBAS:一种基于相关性的新属性选择方法及其在电信用户流失分析中的应用

获取原文

摘要

Attribute selection has a significant effect on the performance of the machine learning studies by selecting the attributes having significant effect on result, reducing the number of attributes, and reducing the calculation cost. In this study, a new attribute selection method which is a combination of the R-correlation coefficient-based attribute selection (RCBAS) and the ρ-correlation coefficient-based attribute selection (ρCBAS) called the Two-Stage Correlation-Based Attribute Selection (TSCBAS) is proposed to select significant attributes. The proposed attribute selection method has been applied to customer churn prediction on a telecommunications dataset for performance evaluation. The dataset used in the study includes real customer call records details for the years 2013 and 2014 obtained from a major telecommunications company in Turkey. Apart from the proposed attribute selection method, four different methods named Rcorrelation coefficient-based attribute selection, ρ-correlation coefficient-based attribute selection, ReliefF, and Gain Ratio have been used for creating five datasets. After that, four classifier algorithms including Random Forest, C4.5 Decision Tree, Naive Bayes and AdaBoost.M1 have been applied. The obtained results have been compared according to the performance metrics comprising Accuracy (ACC), Sensitivity (TPR), Specificity (SPC), F-measure (F), AUC (area under the ROC curve), and run-time. The results of the comparisons show that the proposed attribute selection algorithm outperforms the state of the art methods on customer churn prediction.
机译:通过选择对结果有重大影响的属性,减少属性的数量并减少计算成本,属性选择对机器学习研究的性能具有重大影响。在这项研究中,一种基于R相关系数的属性选择(RCBAS)和基于ρ相关系数的属性选择(ρCBAS)相结合的新属性选择方法称为两阶段基于相关的属性选择(建议使用TSCBAS来选择重要属性。所提出的属性选择方法已应用于电信数据集的客户流失预测以进行性能评估。该研究中使用的数据集包括从土耳其一家主要电信公司获得的2013年和2014年的真实客户呼叫记录详细信息。除了提出的属性选择方法外,还使用了四种不同的方法来创建五个数据集,这些方法分别是基于相关系数的属性选择,基于ρ相关系数的属性选择,ReliefF和增益比。之后,应用了四种分类器算法,包括随机森林,C4.5决策树,朴素贝叶斯和AdaBoost.M1。根据性能指标对获得的结果进行了比较,这些指标包括准确性(ACC),灵敏度(TPR),特异性(SPC),F量度(F),AUC(ROC曲线下的面积)和运行时间。比较结果表明,所提出的属性选择算法在客户流失预测方面优于最新方法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号