【24h】

Dataset Shift Quantification for Credit Card Fraud Detection

机译:用于信用卡欺诈检测的数据集移位量化

获取原文
获取原文并翻译 | 示例

摘要

Machine learning and data mining techniques have been used extensively in order to detect credit card frauds. However purchase behaviour and fraudster strategies may change over time. This phenomenon is named dataset shift [1] or concept drift in the domain of fraud detection [2]. In this paper, we present a method to quantify day-by-day the dataset shift in our face-to-face credit card transactions dataset (card holder located in the shop) . In practice, we classify the days against each other and measure the efficiency of the classification. The more efficient the classification, the more different the buying behaviour between two days, and vice versa. Therefore, we obtain a distance matrix characterizing the dataset shift. After an agglomerative clustering of the distance matrix, we observe that the dataset shift pattern matches the calendar events for this time period (holidays, week-ends, etc). We then incorporate this dataset shift knowledge in the credit card fraud detection task as a new feature. This leads to a small improvement of the detection.
机译:机器学习和数据挖掘技术已被广泛使用,以检测信用卡欺诈。但是,购买行为和欺诈策略可能会随着时间而改变。这种现象在欺诈检测领域被称为数据集移位[1]或概念漂移[2]。在本文中,我们提出了一种方法来量化面对面信用卡交易数据集(位于商店中的持卡人)中数据集的日常移动。在实践中,我们将天数彼此分类,并衡量分类的效率。分类越有效,两天之间的购买行为就越不同,反之亦然。因此,我们获得了一个表征数据集位移的距离矩阵。在距离矩阵的聚集聚类之后,我们观察到数据集移位模式与该时间段(节假日,周末等)的日历事件匹配。然后,我们将此数据集转移知识纳入信用卡欺诈检测任务中,作为一项新功能。这导致检测的微小改进。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号