首页> 外文OA文献 >Financial fraud detection by using grammar-based multiobjective genetic programming with ensemble learning
【2h】

Financial fraud detection by using grammar-based multiobjective genetic programming with ensemble learning

机译:使用基于语法的多目标遗传规划和集成学习进行财务欺诈检测

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

Financial fraud is a criminal act, which violates the law, rules or policy to gain unauthorized financial benefit. As an increasingly serious problem, it has attracted a lot of concerns. The major consequences are loss of billions of dollars each year, investor confidence and corporate reputation. Therefore, a study area called Financial Fraud Detection (FFD) is obligatory, in order to prevent the destructive results caused by financial fraud. In general, traditional modeling approaches are applied and based on pre-defined hypothesis testing of causes and effects for FFD problems. In addition, the evaluation criteria are often based on variable significance level or Goodness-of-fit only.FFD has many common features like other data mining problems. It has accumulated vast amounts of data records of different forms (e.g. financial statements or annual reports) over a period of time. It is very difficult to observe the interesting information just by relying on traditional statistical methods. However, data mining techniques can be used to extract implicit, previously unknown and potentially useful patterns, rules or relations from massive data repositories. Such discovered patterns are appropriate to executive leadership, stakeholders and related regulatory agencies to reduce or avoid the losses.As real-life problems, it is not sufficient for FFD to consider only a single criterion (e.g. Goodness-of-fit or accuracy). Instead, FFD can also seek multiple objectives (e.g. accuracy versus interestingness). It is not easy to consider multiple objectives at the same time unless applying combination methods (e.g. linear combination) by assigning different weights to present the importance for each criterion by using data mining techniques with a single evaluation criterion. For example, accuracy is more important than interestingness with weights of 0.9:0.1. But it is still difficult to decide the appropriate or exact values for weights. There-fore, multi-objective data-mining techniques are required to tackle FFD problems.In this study, FFD is targeted, and comprehensively evaluated by a number of methods. The proposed method is based on Grammar-Based Genetic Programming (GBGP), which has been proven to be a powerful data mining technique to generate compact and straightforward results. The major contributions are three improvements of GBGP for FFD problems. First, multi-criteria are considered by integrating the concept of multi-objectives into GBGP. Second, minority prediction is applied to demonstrate the class prediction with unmatched rows in their rules. Lastly, a new meta-heuristic approach is introduced for ensemble learning in order to help users to select patterns from a pool of models to facilitate final decision-making. The experimental results showed the effectiveness of the new approach in four FFD problems including two real-life problems. The major implications and significances of the study can concretely generalize for three points. First, it suggests a new ensemble learning technique with GBGP. Second, it demonstrates the usability of classification rules generated by the proposed method. Third, it provides an efficient multi-objective method for solving FFD problems.
机译:金融欺诈是一种犯罪行为,违反法律,法规或政策以获取未经授权的经济利益。作为一个日益严重的问题,它引起了很多关注。主要后果是每年损失数十亿美元,投资者信心和企业声誉。因此,为了防止财务欺诈造成的破坏性结果,必须设立一个称为财务欺诈检测(FFD)的研究领域。通常,使用传统的建模方法并基于预定义的假设检验来检验FFD问题的原因和结果。此外,评估标准通常仅基于可变的显着性水平或拟合优度。FFD具有许多共同的特征,如其他数据挖掘问题。它在一段时间内已积累了大量不同形式的数据记录(例如财务报表或年度报告)。仅依靠传统的统计方法很难观察到有趣的信息。但是,数据挖掘技术可以用于从海量数据存储库中提取隐式,以前未知且潜在有用的模式,规则或关系。这种发现的模式适合执行领导,利益相关者和相关监管机构减少或避免损失。作为现实生活中的问题,FFD仅考虑一个标准(例如拟合优度或准确性)是不够的。相反,FFD也可以寻求多个目标(例如,准确性与趣味性)。除非同时使用组合方法(例如线性组合)通过使用具有单个评估标准的数据挖掘技术分配不同的权重来表示每个标准的重要性,否则很难同时考虑多个目标。例如,权重为0.9:0.1时,准确性比趣味性更重要。但是,仍然难以确定适当或精确的权重值。因此,需要多目标数据挖掘技术来解决FFD问题。在这项研究中,FFD是有针对性的,并通过多种方法进行了综合评估。所提出的方法基于基于语法的遗传编程(GBGP),已被证明是一种强大的数据挖掘技术,可以生成紧凑而直接的结果。主要贡献是针对FFD问题的GBGP的三个改进。首先,通过将多目标概念集成到GBGP中来考虑多标准。其次,应用少数派预测来证明类别预测中规则不匹配的类。最后,为集成学习引入了一种新的元启发式方法,以帮助用户从模型库中选择模式,以促进最终决策。实验结果表明,该新方法对包括两个现实问题在内的四个FFD问题均有效。该研究的主要意义和意义可以具体概括为三点。首先,它提出了一种新的GBGP集成学习技术。其次,它演示了所提出的方法生成的分类规则的可用性。第三,它为解决FFD问题提供了一种有效的多目标方法。

著录项

  • 作者

    LI Haibing;

  • 作者单位
  • 年度 2015
  • 总页数
  • 原文格式 PDF
  • 正文语种
  • 中图分类

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号