首页> 外文会议>International Conference on Advanced Data Mining and Applications >Topic Discovery and Topic-Driven Clustering for Audit Method Datasets
【24h】

Topic Discovery and Topic-Driven Clustering for Audit Method Datasets

机译:主题发现和主题驱动群集用于审计方法数据集

获取原文

摘要

As the promotion of China's Golden Auditing Project and the fast growth of on-line auditing, there are thousands of new computer audit methods emerged every year to fulfill various needs of audit prac-tices. How to organize these existing computer audit methods and use them intelligently have become a fundamental and challenging problem. In this paper, we propose to use topic-driven clustering methods to organize computer audit methods according to the system of computer audit methods that is issued by the National Audit Office of China. We also apply Latent Dirichlet allocation (LDA) analysis to audit method datasets at different levels of granularity. Our experimental results on social insurance computer audit methods show that the topic-driven clustering scheme with topics created by domain experts is the overall best scheme. It achieved an average purity of 0.862 across the datasets. Topics discovered by LDA were consistent with classes defined in the taxonomy for four out of five datasets, and they were effective when used in the topic-driven clustering scheme.
机译:随着中国黄金审计项目的推广和在线审计的快速增长,每年都有成千上万的新计算机审计方法,以满足审计实践的各种需求。如何组织这些现有的计算机审计方法并使用它们智能地成为一个基本和挑战性问题。在本文中,我们建议使用主题驱动的聚类方法来根据国家审计办公室颁发的计算机审计方法系统组织计算机审计方法。我们还将潜在的Dirichlet分配(LDA)分析应用于不同粒度的审计方法数据集。我们对社会保险计算机审计方法的实验结果表明,主题驱动的聚类方案与域专家创建的主题是整体最佳方案。它在数据集中实现了0.862的平均纯度。 LDA发现的主题与分类学中定义的四个数据集中定义的类一致,并且在主题驱动的聚类方案中使用时它们是有效的。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号