首页> 外文会议>IEEE International Conference on Big Data and Smart Computing >Network-Based Feature Extraction Method for Fraud Detection via Label Propagation
【24h】

Network-Based Feature Extraction Method for Fraud Detection via Label Propagation

机译:基于网络的标签传播欺诈检测特征提取方法

获取原文

摘要

Using machine learning to detect fraud has become a new mainstream in the online lending industry. In the process of learning, feature engineering is a key step to determine the performance of the final model. Traditional feature engineering is mostly based on the intrinsic features of dataset, especially the combination of them. However, fraud has gradually become a group behavior. In a related network, nodes represent users, divided into fraudulent nodes and non-fraudulent nodes. Fraudulent nodes are sparsely related with non-fraudulent nodes, and the links between fraudulent nodes are closely related. This paper presents a method based on Label Propagation Algorithm to extract network features from the related network. Based on the related network containing of known fraudulent users and relationship between them, a personalized label propagation algorithm is used to infer the unknown user's fraud probability, and the fraud probability is regarded as a network-based derivative feature to increase the information entropy of the feature engineering. Additionally, we changed the initialization method of the transition probability matrix and label distribution matrix, to avoid the performance degradation of label propagation algorithm caused by the unbalanced distribution of fraud data. By testing in real datasets, 17% precision score of detecting fraudster was achieved using only network feature.
机译:使用机器学习检测欺诈已成为在线借贷行业的新主流。在学习过程中,要素工程是确定最终模型性能的关键步骤。传统的特征工程主要基于数据集的固有特征,尤其是它们的组合。但是,欺诈已逐渐成为一种群体行为。在相关网络中,节点代表用户,分为欺诈性节点和非欺诈性节点。欺诈性节点与非欺诈性节点稀疏相关,欺诈性节点之间的链接紧密相关。本文提出了一种基于标签传播算法的从相关网络中提取网络特征的方法。基于包含已知欺诈用户及其之间的关系的相关网络,使用个性化标签传播算法来推断未知用户的欺诈概率,并将欺诈概率视为基于网络的派生特征,以增加攻击者的信息熵。特征工程。此外,我们更改了转移概率矩阵和标签分配矩阵的初始化方法,以避免由于欺诈数据的不平衡分配而导致标签传播算法的性能下降。通过在真实数据集中进行测试,仅使用网络功能即可达到17%的欺诈者检测准确率。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号