【24h】

A Metric for Selection of the Most Promising Rules

机译:选择最有前途的规则的指标

获取原文

摘要

The process of Knowledge Discovery in Databases pursues the goal of extracting useful knowledge from large amounts of data. It comprises a pre-processing step, application of a data-mining algorithm and post-processing of results. When rule induction is applied for datamining one must be prepared to deal with the generation of a large number of rules. In these circumstances it is important to have a way of selecting the rules that have the highest predictive power. We propose a metric for selection of the n rules with the highest average distance between them. We defend that applying our metric to select the rules that are more distant improves the system prediction capabilities against other criteria for rule selection. We present an application example and empirical results produced from a synthesized data set on a financial domain.
机译:数据库中的知识发现过程追求从大量数据提取有用知识的目标。它包括预处理步骤,应用数据挖掘算法和结果后处理。当规则诱导应用于Datamining时,必须准备好处理大量规则的产生。在这种情况下,有一种选择具有最高预测力的规则是重要的。我们提出了一个测量值,以选择N规则,其中与它们之间的最高平均距离。我们捍卫将我们的指标应用于更远的规则,以改善系统预测能力,以解决规则选择的其他标准。我们提出了一种从金融领域的合成数据产生的应用示例和经验结果。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号