首页> 外文会议>International Conference on Innovations in Bio-Inspired Computing and Applications >Revealing Abnormality Based on Hybrid Clustering and Classification Approach (RA-HC-CA)
【24h】

Revealing Abnormality Based on Hybrid Clustering and Classification Approach (RA-HC-CA)

机译:揭示基于混合聚类和分类方法的异常(RA-HC-CA)

获取原文

摘要

Abnormality Detection is the process of locating abnormal instances within the data. In this work, we have applied Abnormality Detection to the domain of detection associated with Credit Card Fraud. This problem is actually attributed to demonstrating those credit card transactions which have occurred in the earlier times, with the presence of awareness related to those instances, which are actually fraud ones. Applying this model, we can use it to predict if a new transaction is a fraud based or not. In this proposed work, we have utilized a combination framework of data mining clustering algorithms so as to solve the problem of credit card fraud detection to a particular extent. The proposed work Revealing Abnormality Based on Hybrid Clustering and Classification Approach (RA-HC-CA) consists of two stages namely a clustering phase followed by a detection phase. In the clustering phase, we have employed a combined clustering approach initiated by k-means clustering algorithm followed by hierarchical clustering algorithm. Prior to Hierarchical/Agglomerative clustering, the whole data set is clustered into meaningful 'k' knots by k-means clustering procedure. The output of 'k' groups is then inputted to Agglomerative clustering algorithm to merge the already obtained 'k' clusters from the previous phase, into more meaningful clusters. This is continued until 70-75% of data falls on one large group, which is the Normal group. The remaining data instances may converge in various other abnormal groups. The strong assumption made here is that such clusters with less instances, than a particular threshold are considered to be groups pertaining to fraud ones. Then, so as to check for the presence of an instance as fraud one, we initially identify the proximate gathering to which it fit into. Then, within that identified cluster, LDA (Linear Discriminant Analysis) is carried out. It has been observed that the proposed approach (RA-HC-CA) achieved 80.5% accuracy in compariso
机译:异常检测是在数据内定位异常情况的过程。在这项工作中,我们将异常检测应用于与信用卡欺诈相关的检测领域。这个问题实际上归因于展示在较早时间发生的信用卡交易,这是与那些实际情况有关的意识,这实际上是欺诈的。应用此模型,我们可以使用它来预测新事务是基于欺诈的。在这一拟议的工作中,我们利用了数据挖掘聚类算法的组合框架,以解决特定范围的信用卡欺诈检测问题。基于混合聚类和分类方法(RA-HC-CA)的所提出的工作揭示异常(RA-HC-CA)包括两个阶段,即聚类阶段,然后是检测阶段。在聚类阶段,我们已经采用了K-Means聚类算法发起的组合聚类方法,然后是分层聚类算法。在分层/凝聚聚类之前,通过K-means聚类程序将整个数据集聚集成有意义的'k'结。然后将'K'组的输出输入到附名聚类算法,以将已获得的'k'集群与前一阶段合并到更有意义的群集中。这是持续到70-75%的数据落在一个大型群体上,这是正常组。剩余的数据实例可以在各种其他异常组中收敛。这里所制定的强烈假设是这种具有较少情况的簇,而不是特定阈值被认为是与欺诈有关的群体。然后,以便检查是否存在实例作为欺诈之一,我们最初识别它适合的邻近收集。然后,在该识别的群集内,执行LDA(线性判别分析)。已经观察到所提出的方法(RA-HC-CA)在比较中实现了80.5%的准确性

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号