首页> 外文期刊>Computational Intelligence Magazine, IEEE >Developing Interval-Based Cost-Sensitive Classifiers by Genetic Programming for Binary High-Dimensional Unbalanced Classification [Research Frontier]
【24h】

Developing Interval-Based Cost-Sensitive Classifiers by Genetic Programming for Binary High-Dimensional Unbalanced Classification [Research Frontier]

机译:通过基于二元高维不平衡分类的遗传编程开发基于间隔的成本敏感分类器[研究前沿]

获取原文
获取原文并翻译 | 示例
           

摘要

Cost-sensitive learning is a popular approach to addressing the problem of class imbalance for many classification algorithms in machine learning. However, most cost-sensitive algorithms are dependent on manually designed cost matrices. Unfortunately, in many cases, it is often not easy for humans, even experts, to accurately specify misclassification costs for different mistakes due to the lack of domain knowledge related to actual situations in some complex unbalanced problems. As a result, these cost-sensitive algorithms cannot be directly applied. This paper proposes a new genetic programmingbased approach to developing cost-sensitive classifiers that are independent of manually designed cost matrices. The proposed method is able to construct classifiers and learn cost intervals automatically and simultaneously. In the proposed method, a tree representation, terminal sets and function sets are designed and developed. We examine the effectiveness of the proposed method on ten high-dimensional unbalanced datasets. The experimental results show that the proposed method often outperforms compared methods for highdimensional unbalanced classification. Furthermore, according to the analysis of evolved trees, the constructed classifiers often only need a small number of features to achieve a good classification performance.
机译:成本敏感的学习是一种流行的方法来解决机器学习中许多分类算法的类别不平衡问题。然而,大多数成本敏感的算法依赖于手动设计的成本矩阵。不幸的是,在许多情况下,人类甚至专家往往不容易,以准确规定由于在一些复杂的不平衡问题中缺乏与实际情况相关的领域知识缺乏域名知识而进行错误分类成本。结果,不能直接应用这些成本敏感的算法。本文提出了一种新的遗传编程基础,用于开发与手动设计成本矩阵无关的成本敏感分类器。所提出的方法能够构建分类器并自动和同时学习成本间隔。在所提出的方法中,设计并开发了树形表示,终端组和功能集。我们在十个高维不平衡数据集中检查所提出的方法的有效性。实验结果表明,该方法往往优于比较高度不平衡分类的比较方法。此外,根据进化树的分析,构造的分类器通常只需要少量的特征来实现良好的分类性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号