首页> 外文会议>Pacific Rim International Conference on Artificial Intelligence >A Regression-Based Approach for Improving the Association Rule Mining through Predicting the Number of Rules on General Datasets
【24h】

A Regression-Based Approach for Improving the Association Rule Mining through Predicting the Number of Rules on General Datasets

机译:一种基于回归的方法,用于通过预测一般数据集上的规则数量来改进关联规则挖掘

获取原文

摘要

Association rule mining is one of the useful techniques in data mining and knowledge discovery that extracts interesting relationships between items in datasets. Generally, the number of association rules in a particular dataset mainly depends on the measures of 'support' and 'confidence'. To choose the number of useful rules, normally, the measures of 'support' and 'confidence' need to be tried many times. In some cases, the measures of 'support' and 'confidence' are chosen by experience. Thus, it is a time consuming to find the optimal measure of 'support' and 'confidence' for the process of association rule mining in large datasets. This paper proposes a regression based approach to improve the association rule mining process through predicting the number of rules on datasets. The approach includes a regression model in a generic level for general domains and an instantiation scheme to create concrete models in particular domains for predicting the potential number of association rules on a dataset before mining. The proposed approach can be used in broad domains with different types of datasets to improve the association rule mining process. A case study to build a concrete regression model based on a real dataset is demostrated and the result shows the good performance of the proposed approach.
机译:关联规则挖掘是数据挖掘和知识发现中的有用技术之一,从而提取数据集中项目之间有趣的关系。通常,特定数据集中的关联规则的数量主要取决于“支持”和“信心”的措施。为了选择有用规则的数量,通常情况下,“支持”和“信心”的措施需要多次尝试。在某些情况下,通过经验选择“支持”和“信心”的措施。因此,对于在大型数据集中的关联规则挖掘过程中找到“支持”和“信心”的最佳度量是令人徒次的令人徒次耗时。本文提出了一种基于回归的方法来改进关联规则挖掘过程,通过预测数据集上的规则数。该方法包括在通用域的通用级别和实例化方案中的回归模型,以在挖掘之前以用于预测数据集上的潜在域的具体模型来创建具体模型。所提出的方法可以在具有不同类型数据集的广泛域中使用,以改善关联规则挖掘过程。基于实际数据集构建一个具体回归模型的案例研究被脱开,结果显示了所提出的方法的良好性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号