...
首页> 外文期刊>Journal of The Institution of Engineers (India): Series B >Weighted Hybrid Decision Tree Model for Random Forest Classifier
【24h】

Weighted Hybrid Decision Tree Model for Random Forest Classifier

机译:随机森林分类器的加权混合决策树模型

获取原文
获取原文并翻译 | 示例
   

获取外文期刊封面封底 >>

       

摘要

Random Forest is an ensemble, supervised machine learning algorithm. An ensemble generates many classifiers and combines their results by majority voting. Random forest uses decision tree as base classifier. In decision tree induction, an attribute split/evaluation measure is used to decide the best split at each node of the decision tree. The generalization error of a forest of tree classifiers depends on the strength of the individual trees in the forest and the correlation among them. The work presented in this paper is related to attribute split measures and is a two step process: first theoretical study of the five selected split measures is done and a comparison matrix is generated to understand pros and cons of each measure. These theoretical results are verified by performing empirical analysis. For empirical analysis, random forest is generated using each of the five selected split measures, chosen one at a time. i.e. random forest using information gain, random forest using gain ratio, etc. The next step is, based on this theoretical and empirical analysis, a new approach of hybrid decision tree model for random forest classifier is proposed. In this model, individual decision tree in Random Forest is generated using different split measures. This model is augmented by weighted voting based on the strength of individual tree. The new approach has shown notable increase in the accuracy of random forest.
机译:Random Forest是一种集成的,受监督的机器学习算法。合奏生成许多分类器,并通过多数表决将其结果合并。随机森林使用决策树作为基础分类器。在决策树归纳中,属性拆分/评估度量用于确定决策树每个节点上的最佳拆分。树分类器森林的泛化误差取决于森林中各个树的强度以及它们之间的相关性。本文介绍的工作与属性拆分度量有关,它是一个两步过程:首先对五个所选拆分度量进行了理论研究,并生成了一个比较矩阵以了解每种度量的优缺点。这些理论结果通过进行实证分析得到了验证。为了进行经验分析,将使用五个选择的分割度量中的每一个,一次选择一个,来生成随机森林。下一步是基于这一理论和经验分析,提出了一种用于随机森林分类器的混合决策树模型的新方法。在此模型中,随机森林中的各个决策树是使用不同的拆分度量生成的。该模型通过基于单个树的强度的加权投票得到增强。新方法显示出随机森林的准确性显着提高。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号