首页> 外文期刊>Knowledge-Based Systems >New margin-based subsampling iterative technique in modified random forests for classification
【24h】

New margin-based subsampling iterative technique in modified random forests for classification

机译:改进的随机森林中基于余量的新子采样迭代技术用于分类

获取原文
获取原文并翻译 | 示例
           

摘要

Diversity within base classifiers has been recognized as an important characteristic of an ensemble classifier. Data and feature sampling are two popular methods of increasing such diversity. This is exemplified by Random Forests (RFs), known as a very effective classifier. However real-world data remain challenging due to several issues, such as multi-class imbalance, data redundancy, and class noise. Ensemble margin theory is a proven effective way to improve the performance of classification models. It can be used to detect the most important instances and thus help ensemble classifiers to avoid the negative effects of the class noise and class imbalance. To obtain accurate classification results, this paper proposes the Ensemble-Margin Based Random Forests (EMRFs) method, which combines RFs and a new subsampling iterative technique making use of computed ensemble margin values. As for comparative analysis, the learning techniques considered are: SVM, AdaBoost, RFs and the Subsample based Random Forests (SubRFs). The SubRFs uses Out-Of-Bag (OOB) estimation to optimize the training size. The effectiveness of EMRFs is demonstrated on both balanced and imbalanced datasets. (C) 2019 Elsevier B.V. All rights reserved.
机译:基本分类器内的多样性已被认为是整体分类器的重要特征。数据和特征采样是增加这种多样性的两种流行方法。这被称为非常有效的分类器的随机森林(RF)举例说明。但是,由于多个问题,例如多类不平衡,数据冗余和类噪声,现实世界中的数据仍然具有挑战性。集成余量理论是提高分类模型性能的有效方法。它可用于检测最重要的实例,从而帮助集成分类器避免类噪声和类不平衡的负面影响。为了获得准确的分类结果,本文提出了一种基于集成余量的随机森林(EMRF)方法,该方法结合了RF和利用计算出的整体余量值的新的二次采样迭代技术。至于比较分析,考虑的学习技术是:SVM,AdaBoost,RF和基于子样本的随机森林(SubRF)。 SubRF使用袋外(OOB)估计来优化训练规模。 EMRF的有效性在平衡和不平衡数据集上都得到了证明。 (C)2019 Elsevier B.V.保留所有权利。

著录项

  • 来源
    《Knowledge-Based Systems》 |2019年第15期|104845.1-104845.12|共12页
  • 作者单位

    Xidian Univ Sch Elect Engn Xian 710071 Shaanxi Peoples R China|Chinese Acad Sci Inst Remote Sensing & Digital Earth Key Lab Digital Earth Sci Beijing 100094 Peoples R China;

    Univ Paris XIII Inst Galilee Lab Informat Proc & Transmiss L2TI Villetaneuse France;

    Chinese Acad Sci Inst Remote Sensing & Digital Earth Key Lab Digital Earth Sci Beijing 100094 Peoples R China;

    Xidian Univ Key Lab Radar Signal Proc Xian 710071 Shaanxi Peoples R China;

    Univ Ghent Dept Telecommun & Informat Proc IMEC TELIN Sint Pietersnieuwstr 41 B-9000 Ghent Belgium;

  • 收录信息
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

    Classification; Ensemble margin; Diversity; Random forests; Sub-sampling;

    机译:分类;合奏边缘多样性;随机森林;二次采样;

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号