WiP: Generative Adversarial Network for Oversampling Data in Credit Card Fraud Detection

机译：WiP：在信用卡欺诈检测中对数据进行过采样的生成对抗网络

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

In this digital world, numerous credit card-based transactions take place all over the world. Concomitantly, gaps in process flows and technology result in many fraudulent transactions. Owing to the spurt in the number of reported fraudulent transactions, customers and credit card service providers incur significant financial and reputation losses respectively. Therefore, building a powerful fraud detection system is paramount. It is noteworthy that fraud detection datasets, by nature, are highly unbalanced. Consequently, almost all of the supervised classifiers, when built on the unbalanced datasets, yield high false negative rates. But, the extant oversampling methods while reducing the false negatives, increase the false positives. In this paper, we propose a novel data oversampling method using Generative Adversarial Network (GAN). We use GAN and its variant to generate synthetic data of fraudulent transactions. To evaluate the effectiveness of the proposed method, we employ machine learning classifiers on the data balanced by GAN. Our proposed GAN-based oversampling method simultaneously achieved high precision, F1-score and dramatic reduction in the count of false positives compared to the state-of-the-art synthetic data generation based oversampling methods such as Synthetic Minority Oversampling Technique (SMOTE), Adaptive Synthetic Sampling (ADASYN) and random oversampling. Moreover, an ablation study involving the over-sampling based on the ensemble of SMOTE and GAN/WGAN generated datasets indicated that it is outperformed by the proposed methods in terms of F1 score and false positive count.

机译：在这个数字世界中，全世界发生了许多基于信用卡的交易。随之而来的是，流程和技术之间的差距导致了许多欺诈性交易。由于报告的欺诈交易数量激增，客户和信用卡服务提供商分别蒙受了重大的财务和声誉损失。因此，构建功能强大的欺诈检测系统至关重要。值得注意的是，欺诈检测数据集本质上是高度不平衡的。因此，当建立在不平衡的数据集上时，几乎所有的监督分类器都会产生很高的假阴性率。但是，现有的过采样方法在减少误报的同时会增加误报。在本文中，我们提出了一种使用生成对抗网络（GAN）的新型数据过采样方法。我们使用GAN及其变体生成欺诈性交易的综合数据。为了评估该方法的有效性，我们对GAN平衡的数据采用了机器学习分类器。与最新的基于综合数据生成的过采样方法（例如，综合少数群体过采样技术（SMOTE））相比，我们提出的基于GAN的过采样方法可同时实现高精度，F1得分和假阳性计数的显着减少。自适应合成采样（ADASYN）和随机过采样。此外，一项涉及基于SMOTE和GAN / WGAN生成的数据集的过采样的消融研究表明，就F1得分和假阳性计数而言，该方法优于所提出的方法。

著录项

来源
《International conference on information systems security》|2019年|123-134|共12页
会议地点
作者
Akhilesh Kumar Gangwar; Vadlamani Ravi;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Fraud detection; Supervised classification; Deep learning; Generative Adversarial Network; Oversampling; SMOTE;

机译：欺诈识别;监督分类;深度学习;生成对抗网络;过度采样;冒烟;

相似文献

外文文献
中文文献
专利

1. Using generative adversarial networks for improving classification effectiveness in credit card fraud detection [J] . Fiore Ugo, De Santis Alfredo, Perla Francesca, Information Sciences: An International Journal . 2019,第期

机译：利用生成的对抗网络来提高信用卡欺诈检测中的分类效果
2. Oversampling Log Messages using A Sequence Generative Adversarial Network for Anomaly Detection and Classification [J] . Amir Farzad, T. Aaron Gulliver Computer Science & Information Technology . 2020,第5期

机译：使用序列生成的对冲网络用于异常检测和分类的过采样日志消息
3. Medical insurance fraud detection based on generative adversarial networks [J] . Du Y. N., Yang Q. Basic & clinical pharmacology & toxicology. . 2019,第S8期

机译：基于生成对抗网络的医疗保险欺诈检测
4. WiP: Generative Adversarial Network for Oversampling Data in Credit Card Fraud Detection [C] . Akhilesh Kumar Gangwar, Vadlamani Ravi International Conference on Information Systems Security . 2019

机译：WIP：用于信用卡欺诈检测中的过采样数据的生成对抗网络
5. Augmenting Electrocardiogram Datasets Using Generative Adversarial Networks [D] . Alladi, Santhosh. 2020

机译：使用生成的对抗网络增强心电图数据集
6. Synthetic minority oversampling of vital statistics data with generative adversarial networks [O] . Aki Koivu, Mikko Sairanen, Antti Airola, 2020

机译：具有生成对抗网络的重要统计数据的合成少数民族空缺
7. A SMOTe based Oversampling Data-Point Approach to Solving the Credit Card Data Imbalance Problem in Financial Fraud Detection [O] . Nhlakanipho Mqadi, Nalindren Naicker, Timothy Adeliyi 2021

机译：基于席卷的过采样数据点方法来解决金融欺诈检测中的信用卡数据不平衡问题

WiP: Generative Adversarial Network for Oversampling Data in Credit Card Fraud Detection

摘要

著录项

相似文献

相关主题

期刊订阅