【24h】

Finding Maximal Non-redundant Association Rules in Tennis Data

机译:在网球数据中查找最大非冗余关联规则

获取原文

摘要

The concept of association rules is well-known in data mining. But often redundancy and subsumption are not considered, and standard approaches produce thousands or even millions of resulting association rules. Without further information or post-mining approaches, this huge number of rules is typically useless for the domain specialist -which is an instance of the infamous pattern explosion problem. In this work, we present a new definition of redundancy and subsumption based on the confidence and the support of the rules and propose post-mining to prune a set of association rules. In a case study, we apply our method to association rules mined from spatio-temporal data. The data represent the trajectories of the ball in tennis matches - more precisely, the points/times the tennis ball hits the ground. The goal is to analyze the strategies of the players and to try to improve their performance by looking at the resulting association rules. Here, the domain specialist was able to select useful rules during post-mining. The proposed approach is general and could also be applied to other spatio-temporal data with a similar structure.
机译:关联规则的概念在数据挖掘中是众所周知的。但是通常不考虑冗余和包容性,标准方法会产生成千上万的关联规则。如果没有进一步的信息或采后的方法,那么大量的规则通常对领域专家来说是无用的,这是臭名昭著的模式爆炸问题的一个实例。在这项工作中,我们基于规则的置信度和支持提出了冗余和包含的新定义,并提出了挖掘后的方法以修剪一组关联规则。在一个案例研究中,我们将我们的方法应用于从时空数据中提取的关联规则。数据代表网球比赛中球的轨迹-更准确地说,是网球击中地面的点数/时间。目的是分析玩家的策略,并通过查看由此产生的关联规则来尝试改善他们的表现。在这里,领域专家能够在挖掘后的过程中选择有用的规则。所提出的方法是通用的,并且也可以应用于具有类似结构的其他时空数据。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号