An Improved XGBoost Model Based on Spark for Credit Card Fraud Prediction

机译：基于火花的信用卡欺诈预测的改进XGBoost模型

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Credit card fraud causes huge economic losses for many financial institutions. Given the imbalance of dataset and the huge amount of data in the field of credit card fraud, an improved XGBoost model based on Spark is proposed. In this project, the Smote algorithm was used to to balance the training set. And the XGBoost classifier based on Spark was used as the fraud detection mechanism. Finally, the test sets were classified in parallel. In the model comparison experiment, the model proposed in this project is compared with logistic regression model, decision tree model, random forest model, and original XGBoost model. The experimental results show that in the three metrics of Recall, Fl-Score, and AUC, the model proposed in this project is the best, which is 9.1%, 1.4%, and 1.2% ahead of the model ranked second respectively. In the speedup experiment, the speedup on the dataset of 70,000, 140,000, and 280,000 samples are 2.06, 3.28, and 3.75 respectively. The experimental results of these two parts show that the proposed model can accurately and efficiently predict credit card fraud and has a good practical effect.

机译：信用卡欺诈使许多金融机构导致巨大的经济损失。鉴于数据集的不平衡和信用卡欺诈领域中的大量数据，提出了一种基于火花的改进的XGBoost模型。在该项目中，粉碎算法用于平衡训练集。基于火花的XGBoost分类器用作欺诈检测机制。最后，测试集并行分类。在模型比较实验中，将该项目中提出的模型与Logistic回归模型，决策树模型，随机林模型和原始XGBoost模型进行比较。实验结果表明，在召回，飞机和AUC的三个指标中，该项目提出的模型是最佳的，分别在模型中排名为9.1％，1.4％和1.2％。在加速实验中，数据集的加速分别为70,000,140,000和280,000个样本分别为2.06,3.28和3.75。这两部分的实验结果表明，该建议的模型可以准确，有效地预测信用卡欺诈并具有良好的实际效果。

著录项

来源
《IEEE International Symposium on Smart and Wireless Systems;International Conferences on Intelligent Data Acquisition and Advanced Computing Systems》|2020年|1-6|共6页
会议地点
作者
Hongwei Chen; He Ai; Zhihui Yang; Weiwei Yang; Zhiwei Ye; Dawei Dong;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Predictive models; Sparks; Prediction algorithms; Training; Data models; Credit cards; Mathematical model;

机译：预测模型;火花;预测算法;训练;数据模型;信用卡;数学模型;

相似文献

外文文献
中文文献
专利

1. Improved Sheep Flock Heredity Algorithm Based Prevention of Credit Card Fraud Detection for Online and Offline Transaction [J] . V.Mareeswari, G. Gunasekaran International journal of computational intelligence research . 2015,第2期

机译：基于改进的羊群遗传算法的在线和离线交易信用卡欺诈预防
2. SCARFF: A scalable framework for streaming credit card fraud detection with spark [J] . Fabrizio Carcillo, Andrea Dal Pozzolo, Yann-A?l Le Borgne, Information Fusion . 2018,第期

机译： scarff ：用spark媒体流媒体绘制信用卡欺诈检测的可扩展框架
3. A Heterogeneous Ensemble Learning Model Based on Data Distribution for Credit Card Fraud Detection [J] . Yalong Xie, Aiping Li, Liqun Gao, Wireless communications & mobile computing . 2021,第a期

机译：基于信用卡欺诈检测数据分布的异构集合学习模型
4. Influence of Optimizing XGBoost to handle Class Imbalance in Credit Card Fraud Detection [C] . C. Victoria Priscilla, D. Padma Prabha International Conference on Smart Systems and Inventive Technology . 2020

机译：优化XGBoost对处理信用卡欺诈中类别不平衡的影响
5. Improving Credit Card Fraud Detection using a Meta-Learning Strategy. [D] . Pun, Joseph King-Fung. 2011

机译：使用元学习策略改善信用卡欺诈检测。
6. FraudMiner: A Novel Credit Card Fraud Detection Model Based on Frequent Itemset Mining [O] . K. R. Seeja, Masoumeh Zareapoor -1

机译：FraudMiner：一种基于频繁项集挖掘的新型信用卡欺诈检测模型
7. SCARFF: a Scalable Framework for Streaming Credit Card Fraud Detection with Spark [O] . Carcillo, Fabrizio, Dal Pozzolo, Andrea, Le Borgne, Yann-Aël, 2017

机译：SCARFF：使用Spark传输信用卡欺诈检测的可扩展框架

An Improved XGBoost Model Based on Spark for Credit Card Fraud Prediction

摘要

著录项

相似文献

相关主题

期刊订阅