首页> 外文期刊>IEEE Expert >Capturing knowledge through top-down induction of decision trees
【24h】

Capturing knowledge through top-down induction of decision trees

机译:通过自上而下的决策树归纳来获取知识

获取原文
获取原文并翻译 | 示例
           

摘要

TDIDT (top-down induction of decision trees) methods for heuristic rule generation lead to unnecessarily complex representations of induced knowledge and are overly sensitive to noise in training data. Practical alternatives to TDIDT approaches which lead to more direct representations of the same knowledge, are examined. The alternatives are more immune to problems with spurious correlations in small data sets and to noise in initial training data. These knowledge representation problems and alternatives are examined in the context of chess, for which a TDIDT algorithm called the ID3 algorithm was originally devised. Modifications to the ID3 algorithm are proposed so that users can measure heuristically the information content of attributes to guide search. The program iteratively examines all positive instances remaining to be covered, along with negative training-set instances; search does not take place with irrelevant context restrictions. This algorithm is no more complex than TDIDT, just as fast and less sensitive to noise and it leads to clearer representations of the information present in training-set data.
机译:用于启发式规则生成的TDIDT(决策树的自上而下的归纳)方法导致不必要地复杂化了归纳知识的表示,并且对训练数据中的噪声过于敏感。研究了TDIDT方法的替代方法,这些方法可导致对相同知识的更直接表示。这些替代方案更不受小数据集中的虚假相关性问题和初始训练数据中的噪声的影响。在国际象棋的语境中研究了这些知识表示问题和替代方法,为此最初设计了称为ID3算法的TDIDT算法。提出了对ID3算法的修改,以便用户可以启发式地测量属性的信息内容,以指导搜索。该程序反复检查所有尚待覆盖的积极实例以及消极训练集实例;搜索不会在不相关的上下文限制下进行。该算法与TDIDT一样复杂,并且速度更快,对噪声更不敏感,并且可以更清晰地表示训练集数据中的信息。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号