...
首页> 外文期刊>SIGKDD explorations >An Enhanced Relevance Criterion For More Concise Supervised Pattern Discovery
【24h】

An Enhanced Relevance Criterion For More Concise Supervised Pattern Discovery

机译:增强的相关性准则,用于更简洁的监督模式发现

获取原文
获取原文并翻译 | 示例

摘要

Supervised local pattern discovery aims to find subsets of a database with a high statistical unusualness in the distribution of a target attribute. Local pattern discovery is often used to generate a human-understandable representation of the most interesting dependencies in a data set. Hence, the more crisp and concise the output is, the better. Unfortunately, standard algorithm often produce very large and redundant outputs. In this paper, we introduce Δ-relevance, a definition of a more strict criterion of relevance. It will allow us to significantly reduce the output space, while being able to guarantee that every local pattern has a Δ-relevant representative which is almost as good in a clearly defined sense. We show empirically that Δ-relevance leads to a considerable reduction of the amount of returned patterns. We also demonstrate that in a top-k setting, the removal of not Δ-relevant patterns improves the quality of the result set.
机译:有监督的局部模式发现旨在查找目标属性分布中统计异常高的数据库子集。本地模式发现通常用于生成人类可理解的数据集中最有趣的依赖关系表示。因此,输出越清晰明了,效果越好。不幸的是,标准算法通常会产生非常大的冗余输出。在本文中,我们介绍了Δ相关性,这是更严格的相关性标准的定义。这将使我们能够显着减少输出空间,同时能够确保每个局部模式都有一个与Δ相关的代表,在清晰定义的意义上,该代表几乎相同。我们凭经验表明,Δ相关性导致返回模式的数量大大减少。我们还证明,在top-k设置中,去除与Δ不相关的模式可以提高结果集的质量。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号