首页> 外文会议>International Workshop on Pattern Recognition in Information Systems >Behavior of Consolidated Trees when using Resampling Techniques
【24h】

Behavior of Consolidated Trees when using Resampling Techniques

机译:使用重采样技术时综合树木的行为

获取原文
获取外文期刊封面目录资料

摘要

Many machine learning areas use subsampling techniques with different objectives: reducing the size of the training set, equilibrate the class imbalance of non-uniform cost error, etc. Subsampling affects severely to the behavior of classification algorithms. Decision trees induced form different subsamples of the same data set are very different in accuracy and structure. The final classifier is a single decision tree, so that it maintains the explaining capacity of the clssification. A comparison in error and structural stability of our algorithm and the C4.5 algorithm is one. The decision trees generated using the new algorithm, achieve smaller error rates and structurally more steady trees than C4.5 when using subsampling techniques.
机译:许多机器学习区域使用具有不同目标的超级样品技术:减少培训集的大小,平衡非均匀成本误差的类别不平衡等。分类对分类算法的行为严重影响。决策树诱导形状的相同数据集的不同子样本的准确性和结构非常不同。最终分类器是单个决策树,以便它保持CLSIFICE的解释能力。我们算法的误差和结构稳定性和C4.5算法的比较是一个。使用新算法生成的决策树,在使用子采样技术时,实现较小的误差率和结构上比C4.5更稳定的树木。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号