Sampling-based sequential subgroup mining

机译：基于采样的顺序子组挖掘

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Subgroup discovery is a learning task that aims at finding interesting rules from classified examples. The search is guided by a utility function, trading off the coverage of rules against their statistical unusualness. One shortcoming of existing approaches is that they do not incorporate prior knowledge. To this end a novel generic sampling strategy is proposed. It allows to turn pattern mining into an iterative process. In each iteration the focus of subgroup discovery lies on those patterns that are unexpected with respect to prior knowledge and previously discovered patterns. The result of this technique is a small diverse set of understandable rules that characterise a specified property of interest. As another contribution this article derives a simple connection between subgroup discovery and classifier induction. For a popular utility function this connection allows to apply any standard rule induction algorithm to the task of subgroup discovery after a step of stratified resampling. Theproposed techniques are empirically compared to state of the art subgroup discovery algorithms.

机译：小组发现是一项学习任务，旨在从分类示例中找到有趣的规则。搜索以效用函数为指导，以权衡规则的覆盖范围和统计上的异常性为代价。现有方法的一个缺点是它们没有合并先验知识。为此，提出了一种新颖的通用采样策略。它允许将模式挖掘转换为迭代过程。在每次迭代中，子组发现的重点都在于那些相对于先验知识和先前发现的模式而言出乎意料的模式。这项技术的结果是形成了一组可理解的小规则，这些规则描述了指定的感兴趣属性。作为另一贡献，本文得出了亚组发现与分类器归纳之间的简单联系。对于流行的实用程序功能，此连接允许在分层重采样步骤之后将任何标准规则归纳算法应用于子组发现任务。将所提议的技术与现有技术的子组发现算法进行经验比较。

著录项

来源
《ACM SIGKDD international conference on Knowledge discovery in data mining》|2005年|P.265-274|共10页
会议地点
作者
Martin Scholz; PMartin Scholz;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类计算技术、计算机技术;
关键词
subgroup discovery;

机译：小组发现;

相似文献

外文文献
中文文献
专利

1. Mining top-k sequential patterns in transaction database graphs: A new challenging problem and a sampling-based approach [J] . Lei Mingtao, Chu Lingyang, Wang Zhefeng, World Wide Web . 2020,第1期

机译：在交易数据库图中挖掘top-k顺序模式：一个新的挑战性问题和一种基于采样的方法
2. A COMBINED DETERMINISTIC AND SAMPLING-BASED SEQUENTIAL BOUNDING METHOD FOR STOCHASTIC PROGRAMMING [J] . Peguy Pierre-Louis, Guzin Bayraksan, David P. Morton Proceedings of the Workshop on Principles of Advanced and Distributed Simulation . 2011,第CDaROM期

机译：随机规划的确定性和采样相结合的顺序绑定方法
3. Locality sensitive hashing for sampling-based algorithms in association rule mining [J] . Chyouhwa Chen, Shi-Jinn Horng, Chin-Pin Huang Expert Systems with Application . 2011,第10期

机译：关联规则挖掘中基于采样的算法的局部敏感哈希
4. Sampling-Based Sequential Subgroup Mining [C] . Martin Scholz Eleventh ACM SIGKDD International Conference on Knowledge Discovery and Data Mining(KDD'05); 20050821-24; Chicago,IL(US) . 2005

机译：基于采样的顺序子群挖掘
5. Event-oriented Analysis and Mitigating the Impact of Redundancy in Sequential Pattern Mining [D] . Singh, Rina. 2018

机译：面向事件的分析和减轻冗余对顺序模式挖掘的影响
6. Differentially Private Frequent Sequence Mining via Sampling-based Candidate Pruning [O] . Shengzhi Xu, Sen Su, Xiang Cheng, -1

机译：通过基于采样的候选修剪进行差分私有频繁序列挖掘
7. Sampling-Based Sequential Subgroup Mining [O] . Martin Scholz 2005

机译：基于抽样的顺序子组挖掘

Sampling-based sequential subgroup mining

摘要

著录项

相似文献

相关主题

期刊订阅