首页> 外文OA文献 >Financial fraud detection by using grammar-based multiobjective genetic programming with ensemble learning

【2h】

Financial fraud detection by using grammar-based multiobjective genetic programming with ensemble learning

机译：使用基于语法的多目标遗传规划和集成学习进行财务欺诈检测

代理获取

本网站仅为用户提供外文OA文献查询和代理获取服务，本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文，但由于OA文献来源多样且变更频繁，仍可能出现获取不到、文献不完整或与标题不符等情况，如果获取不到我们将提供退款服务。请知悉。

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

Financial fraud is a criminal act, which violates the law, rules or policy to gain unauthorized financial benefit. As an increasingly serious problem, it has attracted a lot of concerns. The major consequences are loss of billions of dollars each year, investor confidence and corporate reputation. Therefore, a study area called Financial Fraud Detection (FFD) is obligatory, in order to prevent the destructive results caused by financial fraud. In general, traditional modeling approaches are applied and based on pre-defined hypothesis testing of causes and effects for FFD problems. In addition, the evaluation criteria are often based on variable significance level or Goodness-of-fit only.FFD has many common features like other data mining problems. It has accumulated vast amounts of data records of different forms (e.g. financial statements or annual reports) over a period of time. It is very difficult to observe the interesting information just by relying on traditional statistical methods. However, data mining techniques can be used to extract implicit, previously unknown and potentially useful patterns, rules or relations from massive data repositories. Such discovered patterns are appropriate to executive leadership, stakeholders and related regulatory agencies to reduce or avoid the losses.As real-life problems, it is not sufficient for FFD to consider only a single criterion (e.g. Goodness-of-fit or accuracy). Instead, FFD can also seek multiple objectives (e.g. accuracy versus interestingness). It is not easy to consider multiple objectives at the same time unless applying combination methods (e.g. linear combination) by assigning different weights to present the importance for each criterion by using data mining techniques with a single evaluation criterion. For example, accuracy is more important than interestingness with weights of 0.9:0.1. But it is still difficult to decide the appropriate or exact values for weights. There-fore, multi-objective data-mining techniques are required to tackle FFD problems.In this study, FFD is targeted, and comprehensively evaluated by a number of methods. The proposed method is based on Grammar-Based Genetic Programming (GBGP), which has been proven to be a powerful data mining technique to generate compact and straightforward results. The major contributions are three improvements of GBGP for FFD problems. First, multi-criteria are considered by integrating the concept of multi-objectives into GBGP. Second, minority prediction is applied to demonstrate the class prediction with unmatched rows in their rules. Lastly, a new meta-heuristic approach is introduced for ensemble learning in order to help users to select patterns from a pool of models to facilitate final decision-making. The experimental results showed the effectiveness of the new approach in four FFD problems including two real-life problems. The major implications and significances of the study can concretely generalize for three points. First, it suggests a new ensemble learning technique with GBGP. Second, it demonstrates the usability of classification rules generated by the proposed method. Third, it provides an efficient multi-objective method for solving FFD problems.

机译：金融欺诈是一种犯罪行为，违反法律，法规或政策以获取未经授权的经济利益。作为一个日益严重的问题，它引起了很多关注。主要后果是每年损失数十亿美元，投资者信心和企业声誉。因此，为了防止财务欺诈造成的破坏性结果，必须设立一个称为财务欺诈检测（FFD）的研究领域。通常，使用传统的建模方法并基于预定义的假设检验来检验FFD问题的原因和结果。此外，评估标准通常仅基于可变的显着性水平或拟合优度。FFD具有许多共同的特征，如其他数据挖掘问题。它在一段时间内已积累了大量不同形式的数据记录（例如财务报表或年度报告）。仅依靠传统的统计方法很难观察到有趣的信息。但是，数据挖掘技术可以用于从海量数据存储库中提取隐式，以前未知且潜在有用的模式，规则或关系。这种发现的模式适合执行领导，利益相关者和相关监管机构减少或避免损失。作为现实生活中的问题，FFD仅考虑一个标准（例如拟合优度或准确性）是不够的。相反，FFD也可以寻求多个目标（例如，准确性与趣味性）。除非同时使用组合方法（例如线性组合）通过使用具有单个评估标准的数据挖掘技术分配不同的权重来表示每个标准的重要性，否则很难同时考虑多个目标。例如，权重为0.9：0.1时，准确性比趣味性更重要。但是，仍然难以确定适当或精确的权重值。因此，需要多目标数据挖掘技术来解决FFD问题。在这项研究中，FFD是有针对性的，并通过多种方法进行了综合评估。所提出的方法基于基于语法的遗传编程（GBGP），已被证明是一种强大的数据挖掘技术，可以生成紧凑而直接的结果。主要贡献是针对FFD问题的GBGP的三个改进。首先，通过将多目标概念集成到GBGP中来考虑多标准。其次，应用少数派预测来证明类别预测中规则不匹配的类。最后，为集成学习引入了一种新的元启发式方法，以帮助用户从模型库中选择模式，以促进最终决策。实验结果表明，该新方法对包括两个现实问题在内的四个FFD问题均有效。该研究的主要意义和意义可以具体概括为三点。首先，它提出了一种新的GBGP集成学习技术。其次，它演示了所提出的方法生成的分类规则的可用性。第三，它为解决FFD问题提供了一种有效的多目标方法。

著录项

作者
LI Haibing;
展开▼
作者单位

展开▼
年度 2015
总页数
原文格式 PDF
正文语种
中图分类

相似文献

外文文献
中文文献
专利

1. Financial Fraud Detection: Multi-Objective Genetic Programming with Grammars and Statistical Selection Learning [J] . Haibing Li, Wing-Lun Lam, Chi-Wai Chung, International Journal of Computer Science and Engineering . 2020,第2期

机译：金融欺诈检测：具有语法和统计选择学习的多目标遗传编程
2. Multiobjective grammar-based genetic programming applied to the study of asthma and allergy epidemiology [J] . Rafael V. Veiga, Helio J. C. Barbosa, Heder S. Bernardino, BMC Bioinformatics . 2018,第1期

机译：基于多目标语法的遗传编程应用于哮喘和过敏流行病学研究
3. Probabilistic Contextual and Structural Dependencies Learning in Grammar-Based Genetic Programming [J] . Wong Pak-Kan, Wong Man-Leung, Leung Kwong-Sak Evolutionary computation . 2021,第2期

机译：基于语法的遗传编程中的概率语境和结构依赖性学习
4. Financial fraud detection by using Grammar-based multi-objective genetic programming with ensemble learning [C] . Haibing Li, Man-Leung Wong IEEE Congress on Evolutionary Computation . 2015

机译：通过使用基于语法的多目标遗传规划和集成学习进行财务欺诈检测
5. Semantics-aware Grammar-based Genetic Programming and Its Applications [D] . Wong, Pak Kan. 2019

机译：语义感知语法基于基于语法的遗传编程及其应用
6. Multiobjective grammar-based genetic programming applied to the study of asthma and allergy epidemiology [O] . Rafael V. Veiga, Helio J. C. Barbosa, Heder S. Bernardino, 2018

机译：基于多目标语法的遗传规划在哮喘和过敏流行病学研究中的应用
7. Multiobjective grammar-based genetic programming applied to the study of asthma and allergy epidemiology [O] . Rafael V. Veiga, Helio J. C. Barbosa, Heder S. Bernardino, 2018

机译：基于多目标语法的遗传编程应用于哮喘和过敏流行病学研究
8. Credit card fraud detection: An application of the gene expression messy genetic algorithm [R] . Kargupta, H. , Gattiker, J. R. , Buescher, K. 1996

机译：信用卡欺诈检测：基因表达杂乱遗传算法的应用

Financial fraud detection by using grammar-based multiobjective genetic programming with ensemble learning

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅