首页> 外文期刊>World Wide Web >An effective contrast sequential pattern mining approach to taxpayer behavior analysis
【24h】

An effective contrast sequential pattern mining approach to taxpayer behavior analysis

机译:一种有效的对比顺序模式挖掘方法进行纳税人行为分析

获取原文
获取原文并翻译 | 示例
       

摘要

Data mining for client behavior analysis has become increasingly important in business, however further analysis on transactions and sequential behaviors would be of even greater value, especially in the financial service industry, such as banking and insurance, government and so on. In a real-world business application of taxation debt collection, in order to understand the internal relationship between taxpayers' sequential behaviors (payment, lodgment and actions) and compliance to their debt, we need to find the contrast sequential behavior patterns between compliant and non-compliant taxpayers. Contrast Patterns (CP) are defined as the itemsets showing the difference/discrimination between two classes/datasets (Dong and Li, 1999). However, the existing CP mining methods which can only mine itemset patterns, are not suitable for mining sequential patterns, such as time-ordered transactions in taxpayer sequential behaviors. Little work has been conducted on Contrast Sequential Pattern (CSP) mining so far. Therefore, to address this issue, we develop a CSP mining approach, e C S P, by using an effective CSP-tree structure, which improves the PrefixSpan tree (Pei et al., 2001) for mining contrast patterns. We propose some heuristics and interestingness filtering criteria, and integrate them into the CSP-tree seamlessly to reduce the search space and to find business-interesting patterns as well. The performance of the proposed approach is evaluated on three real-world datasets. In addition, we use a case study to show how to implement the approach to analyse taxpayer behaviour. The results show a very promising performance and convincing business value.
机译:用于客户行为分析的数据挖掘在业务中已变得越来越重要,但是对交易和顺序行为的进一步分析将具有更大的价值,尤其是在金融服务行业,例如银行和保险,政府等。在现实世界中的税收收债业务应用中,为了了解纳税人的先后行为(付款,住宿和行为)与他们的债务合规之间的内部关系,我们需要找到合规与不合规之间的对比先后行为模式合规的纳税人。对比模式(CP)定义为表示两个类别/数据集之间差异/区分的项目集(Dong和Li,1999)。但是,现有的只能挖掘项目集模式的CP挖掘方法不适用于挖掘顺序模式,例如纳税人顺序行为中的时间顺序交易。到目前为止,有关对比度顺序模式(CSP)挖掘的工作很少。因此,为了解决这个问题,我们通过使用有效的CSP树结构开发了一种CSP挖掘方法e C S P,它改进了PrefixSpan树(Pei等人,2001),用于挖掘对比模式。我们提出了一些启发式和兴趣过滤标准,并将它们无缝集成到CSP树中,以减少搜索空间并找到感兴趣的业务模式。在三个真实的数据集上评估了该方法的性能。此外,我们通过案例研究来说明如何实施分析纳税人行为的方法。结果显示出非常令人鼓舞的性能和令人信服的商业价值。

著录项

  • 来源
    《World Wide Web》 |2016年第4期|633-651|共19页
  • 作者单位

    Univ Technol Sydney, Fac Engn & Informat Technol, Adv Analyt Inst, Sydney, NSW 2007, Australia;

    Univ Technol Sydney, Fac Engn & Informat Technol, Adv Analyt Inst, Sydney, NSW 2007, Australia;

    Univ Technol Sydney, Fac Engn & Informat Technol, Adv Analyt Inst, Sydney, NSW 2007, Australia;

    Univ Technol Sydney, Fac Engn & Informat Technol, Adv Analyt Inst, Sydney, NSW 2007, Australia;

    Univ Technol Sydney, Fac Engn & Informat Technol, Adv Analyt Inst, Sydney, NSW 2007, Australia;

    Australian Taxat Off, Sydney, NSW, Australia;

  • 收录信息
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

    Contrast pattern; Sequential pattern; Client behavior analysis;

    机译:对比模式;顺序模式;客户行为分析;
  • 入库时间 2022-08-17 13:25:57

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号