Scoring the data using association rules

Liu B.; Ma YM.; Wong CK.

首页> 外文期刊>Applied Intelligence: The International Journal of Artificial Intelligence, Neural Networks, and Complex Problem-Solving Technologies >Scoring the data using association rules

【24h】

Scoring the data using association rules

机译：使用关联规则对数据评分

获取原文

获取原文并翻译 | 示例

获取外文期刊封面封底 >>

开具论文收录证明 >>

文献代查 >>

团队文献服务 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

In many data mining applications, the objective is to select data cases of a target class. For example, in direct marketing, marketers want to select likely buyers of a particular product for promotion. In such applications, it is often too difficult to predict who will definitely be in the target class (e.g., the buyer class) because the data used for modeling is often very noisy and has a highly imbalanced class distribution. Traditionally, classification systems are used to solve this problem. Instead of classifying each data case to a definite class (e.g., buyer or non-buyer), a classification system is modified to produce a class probability estimate (or a score) for the data case to indicate the likelihood that the data case belongs to the target class (e.g., the buyer class). However, existing classification systems only aim to find a subset of the regularities or rules that exist in data. This subset of rules only gives a partial picture of the domain. In this paper, we show that the target selection problem can be mapped to association rule mining to provide a more powerful solution to the problem. Since association rule mining aims to find all rules in data, it is thus able to give a complete picture of the underlying relationships in the domain. The complete set of rules enables us to assign a more accurate class probability estimate to each data case. This paper proposes an effective and efficient technique to compute class probability estimates using association rules. Experiment results using public domain data and real-life application data show that in general the new technique performs markedly better than the state-of-the-art classification system C4.5, boosted C4.5, and the Naive Bayesian system. [References: 35]

机译：在许多数据挖掘应用程序中，目标是选择目标类的数据案例。例如，在直接营销中，营销人员希望选择特定产品的可能购买者进行促销。在这样的应用中，通常很难预测谁肯定会属于目标类别（例如，买方类别），因为用于建模的数据通常非常嘈杂，并且类别分布高度不平衡。传统上，分类系统用于解决此问题。代替将每个数据案例分类为确定的类别（例如，购买者或非购买者），修改分类系统以产生用于数据案例的类别概率估计（或分数），以指示该数据案例所属的可能性目标类别（例如，买方类别）。但是，现有的分类系统仅旨在查找数据中存在的规则或规则的子集。规则的这个子集仅给出了部分域的情况。在本文中，我们表明可以将目标选择问题映射到关联规则挖掘中，从而为该问题提供更强大的解决方案。由于关联规则挖掘旨在查找数据中的所有规则，因此它能够提供域中基础关系的完整图片。完整的规则集使我们能够为每个数据案例分配更准确的类别概率估计。本文提出了一种有效和高效的技术来使用关联规则来计算类别概率估计。使用公共领域数据和现实生活中的应用程序数据进行的实验结果表明，总体而言，新技术的性能明显优于最新的分类系统C4.5，增强的C4.5和朴素的贝叶斯系统。 [参考：35]

著录项

来源
《Applied Intelligence: The International Journal of Artificial Intelligence, Neural Networks, and Complex Problem-Solving Technologies 》 |2003年第2期| 共17页
作者
Liu B.; Ma YM.; Wong CK.;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类自动化技术、计算机技术 ;
关键词
Data mining; Scoring; Target selection; Association rules; Classifications;

机译：数据挖掘;评分;目标选择;关联规则;分类;

相似文献

外文文献
中文文献
专利

1. Data-driven study on the achievement of LEED credits using percentage of average score and association rule analysis [J] . Ma Jun, Cheng Jack C. P. Building and Environment . 2016 ,第Mara期

机译：基于平均分数百分比和关联规则分析的LEED学分实现的数据驱动研究
2. Applicability of Apriori Based Association Rules on Medical Data: Identification of Associations on Medical Data/Heart disease Dataset using Apriori Based Algorithm [J] . P. Sambasiva Rao, T. Uma Devi International Journal of Applied Engineering Research . 2017 ,第20aPta2期

机译：基于APRIORI基于医疗数据的适用性：使用基于APRiori的算法识别医学数据/心脏病数据集的关联
3. Mining Association Rules from No-SQL data bases using Map-Reduce Fuzzy Association Rule Mining Algorithm [J] . Chatakunta Praveen Kumar, Pole Anjaiah, Santosh Patil, International Journal of Applied Engineering Research . 2017 ,第21aPta1期

机译：使用地图减少模糊关联规则挖掘算法来自No-SQL数据基础的挖掘关联规则
4. Constructing Metrics for Evaluating Multi-Relational Association Rules in the Semantic Web from Metrics for Scoring Association Rules [C] . Tran Duc Minh, Claudia DAmato, Andrea G.B. Tettamanzi, IEEE-RIVF International Conference on Computing and Communication Technologies . 2019

机译：从计分关联规则的度量中构造用于评估语义网中多关系关联规则的度量
5. Mining fuzzy association rules on large numerical data: A data mining system for NAWN. [D] . Komo, Zimpi. 2003

机译：在大型数值数据上挖掘模糊关联规则：NAWN的数据挖掘系统。
6. Development and validation of data quality rules in administrative health data using association rule mining [O] . Mingkai Peng, Sangmin Lee, Adam G. D’Souza, 2020

机译：使用关联规则挖掘来开发和验证行政健康数据中的数据质量规则
7. Constructing Metrics for Evaluating Multi-Relational Association Rules in the Semantic Web from Metrics for Scoring Association Rules [O] . Tran Duc Minh, Claudia DAmato, Andrea G.B. Tettamanzi, 2019

机译：从评分评分关联规则中从度量评估语义Web中的多关系关联规则的标准
8. Comparison of Utah and DoDPI Scoring Accuracy: Equating Veracity Decision Rule, Chart Rule, and Number of Data Channels Used [R] . Senter, S. M. , Dollins, A. B. , Krapohl, D. J. 2000

机译：犹他州和DoDpI评分准确性的比较：等同准确性决策规则，图表规则和使用的数据通道数量

Scoring the data using association rules

摘要

著录项

相似文献

相关主题

期刊订阅