首页> 外文期刊>Neural Networks and Learning Systems, IEEE Transactions on >Active Learning From Imbalanced Data: A Solution of Online Weighted Extreme Learning Machine
【24h】

Active Learning From Imbalanced Data: A Solution of Online Weighted Extreme Learning Machine

机译:从不平衡数据主动学习:在线加权极限学习机的解决方案

获取原文
获取原文并翻译 | 示例

摘要

It is well known that active learning can simultaneously improve the quality of the classification model and decrease the complexity of training instances. However, several previous studies have indicated that the performance of active learning is easily disrupted by an imbalanced data distribution. Some existing imbalanced active learning approaches also suffer from either low performance or high time consumption. To address these problems, this paper describes an efficient solution based on the extreme learning machine (ELM) classification model, called active online-weighted ELM (AOW-ELM). The main contributions of this paper include: 1) the reasons why active learning can be disrupted by an imbalanced instance distribution and its influencing factors are discussed in detail; 2) the hierarchical clustering technique is adopted to select initially labeled instances in order to avoid the missed cluster effect and cold start phenomenon as much as possible; 3) the weighted ELM (WELM) is selected as the base classifier to guarantee the impartiality of instance selection in the procedure of active learning, and an efficient online updated mode of WELM is deduced in theory; and 4) an early stopping criterion that is similar to but more flexible than the margin exhaustion criterion is presented. The experimental results on 32 binary-class data sets with different imbalance ratios demonstrate that the proposed AOW-ELM algorithm is more effective and efficient than several state-of-the-art active learning algorithms that are specifically designed for the class imbalance scenario.
机译:众所周知,主动学习可以同时提高分类模型的质量并降低训练实例的复杂性。但是,先前的一些研究表明,主动学习的表现很容易因数据分布不平衡而中断。一些现有的不平衡的主动学习方法也遭受性能低下或时间消耗高的困扰。为了解决这些问题,本文介绍了一种基于极限学习机(ELM)分类模型的有效解决方案,该模型称为主动在线加权ELM(AOW-ELM)。本文的主要贡献包括:1)详细讨论了实例分布不均衡会干扰主动学习的原因及其影响因素; 2。 2)采用层次聚类技术选择初始标记的实例,以尽可能避免遗漏聚类效应和冷启动现象; 3)选择加权ELM(WELM)作为基本分类器,以保证主动学习过程中实例选择的公正性,并从理论上推导了一种有效的WELM在线更新模式; 4)提出了一种提前停止准则,该准则与保证金耗尽准则相似但更灵活。在32个具有不同失衡比的二元类数据集上的实验结果表明,与为类不平衡场景专门设计的几种最新的主动学习算法相比,所提出的AOW-ELM算法更加有效。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号