首页> 外文会议>International Conference on Industrial Informatics >Supervised and Unsupervised Learning for Fraud and Money Laundering Detection using Behavior Measuring Distance
【24h】

Supervised and Unsupervised Learning for Fraud and Money Laundering Detection using Behavior Measuring Distance

机译:使用行为测量距离监督和无监督的学习欺诈和洗钱检测

获取原文

摘要

Money laundering is the process of making large amounts of fund obtained from criminal activities appear to originate from a legitimate source. Fraud occurs when a person or business intentionally deceives another with promises of services or financial benefits that do not exist or were misrepresented. Fraud and Money laundering detections require to analyze abnormal behavioral patterns. To develop a detection model, we present a machine learning-based model which incorporates risk scoring and statistical clustering approaches. Given a customer represented by its values in a set of attributes, we define its Customer Behavior Score based on its percentile rank in each attribute, which measures the behavior of the customer against the median or “normal” customers in the group. The Customer Behavior Score induces a distance, called Behavior Measuring Distance, between any two customers. The k-medoids clustering technique based on the Behavior Measuring Distance is then applied iteratively to classify customers. The key features of the model are that the abnormality of customers' behaviors are measured based on their percentile ranks in their respective classes and that such measurement is dynamically updated based on the reclassification after each iteration during the training. Finally, the model is tested using the country risk data collected from public and internal sources, and the model outcomes are compared against a benchmark model. The experimental results show convergence and effectiveness of the model.
机译:洗钱是从犯罪活动获得的大量基金的过程似乎来自合法来源。当一个人或企业故意欺骗另一个人的服务或财政福利的承诺时,就会发生欺诈行为。欺诈和洗钱检测需要分析异常行为模式。要开发检测模型,我们介绍了一种基于机器学习的模型,它包含风险评分和统计聚类方法。给定由一组属性中的值表示的客户,我们根据每个属性中的百分位数定义其客户行为分数,该分数测量客户对组中位数或“正常”客户的行为。客户行为分数引起距离,称为行为测量距离,在任何两个客户之间。然后,基于行为测量距离的K-METOIDS聚类技术迭代地应用于对客户进行分类。该模型的关键特征是客户行为的异常基于它们各个类别中的百分位测量,并且这种测量基于在训练期间每次迭代之后的重新分类动态更新。最后,使用从公共和内部来源收集的国家风险数据进行测试,并将模型结果与基准模型进行比较。实验结果表明了模型的收敛性和有效性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号