...
首页> 外文期刊>Knowledge and Information Systems >Constraining and summarizing association rules in medical data
【24h】

Constraining and summarizing association rules in medical data

机译:约束和汇总医学数据中的关联规则

获取原文
获取原文并翻译 | 示例

摘要

Association rules are a data mining technique used to discover frequent patterns in a data set. In this work, association rules are used in the medical domain, where data sets are generally high dimensional and small. The chief disadvantage about mining association rules in a high dimensional data set is the huge number of patterns that are discovered, most of which are irrelevant or redundant. Several constraints are proposed for filtering purposes, since our aim is to discover only significant association rules and accelerate the search process. A greedy algorithm is introduced to compute rule covers in order to summarize rules having the same consequent. The significance of association rules is evaluated using three metrics: support, confidence and lift. Experiments focus on discovering association rules on a real data set to predict absence or existence of heart disease. Constraints are shown to significantly reduce the number of discovered rules and improve running time. Rule covers summarize a large number of rules by producing a succinct set of rules with high-quality metrics.
机译:关联规则是一种数据挖掘技术,用于发现数据集中的频繁模式。在这项工作中,关联规则用于医学领域,在该领域中,数据集通常是高维的,并且数据量较小。在高维数据集中挖掘关联规则的主要缺点是发现了大量的模式,其中大多数是不相关或多余的。出于过滤目的,提出了一些约束条件,因为我们的目的是仅发现重要的关联规则并加快搜索过程。为了总结具有相同结果的规则,引入了贪婪算法来计算规则覆盖率。关联规则的重要性使用三个指标进行评估:支持,信心和提升。实验着重于发现真实数据集上的关联规则以预测心脏病的缺失或存在。约束条件可以显着减少发现的规则数量并缩短运行时间。规则涵盖通过生成一组具有高质量度量的简洁规则来总结大量规则。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号