Efficient Rule Retrieval and Postponed Restrict Operations for Association Rule Mining

机译：关联规则挖掘的有效规则检索和延迟的限制操作

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Knowledge discovery in databases is a complex, iterative, and highly interactive process. When mining for association rules, typically interactivity is largely smothered by the execution times of the rule generation algorithms. Our approach is to accept a single, possibly expensive run, but all subsequent mining queries are supposed to be answered interactively by accessing a sophisticated rule cache. However there are two critical aspects. First, access to the cache must be efficient and comfortable. Therefore we enrich the basic association mining framework by descriptions of items through application dependent attributes. Furthermore we extend current mining query languages to deal with these attributes through "exist" and "any" quantifiers. Second, the cache must be prepared to answer a broad variety of queries without rerunning the mining algorithm. A main contribution of this paper is that we show how to postpone restrict operations on the transactions from rule generation to rule retrieval from the cache. That is, without actually rerunning the algorithm, we efficiently construct those rules from the cache that would have been generated if the mining algorithm were run on only a subset of the transactions. In addition we describe how we implemented our ideas on a conventional relational database system. We evaluate our prototype concerning response times in a pilot application at DaimlerChrysler. It turns out to satisfy easily the demands of interactive data mining.

机译：数据库中的知识发现是一个复杂，反复且高度交互的过程。在挖掘关联规则时，通常，规则生成算法的执行时间会极大地抑制交互性。我们的方法是接受一个可能很昂贵的运行，但是应该通过访问复杂的规则缓存以交互方式回答所有后续挖掘查询。但是，有两个关键方面。首先，对缓存的访问必须高效且舒适。因此，我们通过依赖于应用程序的属性对项目进行描述，从而丰富了基本的关联挖掘框架。此外，我们扩展了当前的挖掘查询语言，以通过“存在”和“任何”量词来处理这些属性。其次，缓存必须准备好回答各种各样的查询，而无需重新运行挖掘算法。本文的主要贡献在于，我们展示了如何推迟对事务的限制操作，从规则生成到从缓存中检索规则。也就是说，如果没有真正重新运行该算法，那么我们可以从缓存中高效地构造那些规则，如果挖掘算法仅在事务的一个子集上运行，则这些规则将已经生成。另外，我们描述了如何在常规的关系数据库系统上实现我们的想法。我们在戴姆勒克莱斯勒的试点应用中评估了有关响应时间的原型。事实证明，可以轻松满足交互式数据挖掘的需求。

著录项

来源
《Advances in Knowledge Discovery and Data Mining》|2002年|p.52-65|共14页
会议地点
作者
Jochen Hipp; Christoph Mangold; Ulrich Guentzer; Gholamreza Nakhaeizadeh;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类自动化技术、计算机技术;
关键词

相似文献

外文文献
中文文献
专利

1. An efficient retrieval using edge GLCM and association rule mining guided IPSO based artificial neural network [J] . Boomilingam Thenkalvi, Subramaniam Murugavalli Multimedia Tools and Applications . 2017,第20期

机译：基于边缘GLCM和关联规则挖掘的基于IPSO的人工神经网络的有效检索
2. Mining Association Rules from No-SQL data bases using Map-Reduce Fuzzy Association Rule Mining Algorithm [J] . Chatakunta Praveen Kumar, Pole Anjaiah, Santosh Patil, International Journal of Applied Engineering Research . 2017,第21aPta1期

机译：使用地图减少模糊关联规则挖掘算法来自No-SQL数据基础的挖掘关联规则
3. Content-based image retrieval using association rule mining with soft relevance feedback [J] . Peng-Yeng Yin, Shin-Huei Li Journal of visual communication & image representation . 2006,第5期

机译：使用具有软相关性反馈的关联规则挖掘的基于内容的图像检索
4. Efficient Rule Retrieval and Postponed Restrict Operations for Association Rule Mining [C] . Jochen Hipp, Christoph Mangold, Ulrich Guentzer, Pacific-Asia Conference on Knowledge Discovery and Data Mining . 2002

机译：高效的规则检索和关联规则挖掘的限制操作
5. Association rule mining and quantitative association rule mining among infrequent items. [D] . Zhou, Ling. 2007

机译：罕见项目之间的关联规则挖掘和定量关联规则挖掘。
6. TSARM-UDP: An Efficient Time Series Association Rules Mining Algorithm Based on Up-to-Date Patterns [O] . Qiang Zhao, Qing Li, Deshui Yu, 2021

机译：TSARM-UDP：基于最新模式的有效时间序列关联规则挖掘算法
7. Efficient Rule Retrieval and Postponed Restrict Operations for Association Rule Mining [O] . Jochen Hipp, Christoph Mangold, Ulrich Güntzer, 2002

机译：关联规则挖掘的有效规则检索和延迟约束操作

Efficient Rule Retrieval and Postponed Restrict Operations for Association Rule Mining

摘要

著录项

相似文献

相关主题

期刊订阅