首页> 外文会议>International conference on information systems security >WiP: Generative Adversarial Network for Oversampling Data in Credit Card Fraud Detection
【24h】

WiP: Generative Adversarial Network for Oversampling Data in Credit Card Fraud Detection

机译:WiP:在信用卡欺诈检测中对数据进行过采样的生成对抗网络

获取原文

摘要

In this digital world, numerous credit card-based transactions take place all over the world. Concomitantly, gaps in process flows and technology result in many fraudulent transactions. Owing to the spurt in the number of reported fraudulent transactions, customers and credit card service providers incur significant financial and reputation losses respectively. Therefore, building a powerful fraud detection system is paramount. It is noteworthy that fraud detection datasets, by nature, are highly unbalanced. Consequently, almost all of the supervised classifiers, when built on the unbalanced datasets, yield high false negative rates. But, the extant oversampling methods while reducing the false negatives, increase the false positives. In this paper, we propose a novel data oversampling method using Generative Adversarial Network (GAN). We use GAN and its variant to generate synthetic data of fraudulent transactions. To evaluate the effectiveness of the proposed method, we employ machine learning classifiers on the data balanced by GAN. Our proposed GAN-based oversampling method simultaneously achieved high precision, F1-score and dramatic reduction in the count of false positives compared to the state-of-the-art synthetic data generation based oversampling methods such as Synthetic Minority Oversampling Technique (SMOTE), Adaptive Synthetic Sampling (ADASYN) and random oversampling. Moreover, an ablation study involving the over-sampling based on the ensemble of SMOTE and GAN/WGAN generated datasets indicated that it is outperformed by the proposed methods in terms of F1 score and false positive count.
机译:在这个数字世界中,全世界发生了许多基于信用卡的交易。随之而来的是,流程和技术之间的差距导致了许多欺诈性交易。由于报告的欺诈交易数量激增,客户和信用卡服务提供商分别蒙受了重大的财务和声誉损失。因此,构建功能强大的欺诈检测系统至关重要。值得注意的是,欺诈检测数据集本质上是高度不平衡的。因此,当建立在不平衡的数据集上时,几乎所有的监督分类器都会产生很高的假阴性率。但是,现有的过采样方法在减少误报的同时会增加误报。在本文中,我们提出了一种使用生成对抗网络(GAN)的新型数据过采样方法。我们使用GAN及其变体生成欺诈性交易的综合数据。为了评估该方法的有效性,我们对GAN平衡的数据采用了机器学习分类器。与最新的基于综合数据生成的过采样方法(例如,综合少数群体过采样技术(SMOTE))相比,我们提出的基于GAN的过采样方法可同时实现高精度,F1得分和假阳性计数的显着减少。自适应合成采样(ADASYN)和随机过采样。此外,一项涉及基于SMOTE和GAN / WGAN生成的数据集的过采样的消融研究表明,就F1得分和假阳性计数而言,该方法优于所提出的方法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号