...
首页> 外文期刊>Information Sciences: An International Journal >Multiobjective optimization of classifiers by means of 3D convex-hull-based evolutionary algorithms
【24h】

Multiobjective optimization of classifiers by means of 3D convex-hull-based evolutionary algorithms

机译:借助基于3D凸包的进化算法对分类器进行多目标优化

获取原文
获取原文并翻译 | 示例

摘要

The receiver operating characteristic (ROC) and detection error tradeoff (DET) curves are frequently used in the machine learning community to analyze the performance of binary classifiers. Recently, the convex-hull-based multiobjective genetic programming algorithm was proposed and successfully applied to maximize the convex hull area for binary classification problems by minimizing false positive rate and maximizing true positive rate at the same time using indicator-based evolutionary algorithms. The area under the ROC curve was used for the performance assessment and to guide the search. Here we extend this research and propose two major advancements: Firstly we formulate the algorithm in detection error tradeoff space, minimizing false positives and false negatives, with the advantage that misclassification cost tradeoff can be assessed directly. Secondly, we add complexity as an objective function, which gives rise to a 3D objective space (as opposed to a 2D previous ROC space). A domain specific performance indicator for 3D Pareto front approximations, the volume above DET surface, is introduced, and used to guide the indicator -based evolutionary algorithm to find optimal approximation sets. We assess the performance of the new algorithm on designed theoretical problems with different geometries of Pareto fronts and DET surfaces, and two application-oriented benchmarks: (1) Designing spam filters with low numbers of false rejects, false accepts, and low computational cost using rule ensembles, and (2) finding sparse neural networks for binary classification of test data from the UCI machine learning benchmark. The results show a high performance of the new algorithm as compared to conventional methods for multicriteria optimization. (C) 2016 Elsevier Inc. All rights reserved.
机译:机器学习社区中经常使用接收器工作特性(ROC)和检测误差折衷(DET)曲线来分析二进制分类器的性能。近年来,提出了一种基于凸包的多目标遗传规划算法,并通过基于指标的进化算法,通过最小化误报率和最大化真实率,成功地将凸包面积最大化用于二元分类问题。 ROC曲线下的区域用于性能评估并指导搜索。在这里,我们扩展了这项研究并提出了两个主要进展:首先,我们在检测误差权衡空间中制定了算法,最大程度地减少了误报和误报,其优点是可以直接评估误分类成本。其次,我们将复杂度添加为目标函数,从而产生了3D目标空间(与之前的2D ROC空间相对)。引入了用于3D Pareto前沿逼近的特定于领域的性能指标(DET曲面上方的体积),该指标用于指导基于指标的演化算法找到最佳逼近集。我们在具有不同几何形状的Pareto前沿和DET曲面的设计理论问题以及两个面向应用的基准测试中评估了该新算法的性能:(1)设计垃圾邮件过滤器,该过滤器具有较少的错误拒绝,错误接受和低计算成本规则集合,以及(2)从UCI机器学习基准中找到稀疏神经网络以对测试数据进行二进制分类。结果表明,与传统的多准则优化方法相比,该新算法具有较高的性能。 (C)2016 Elsevier Inc.保留所有权利。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号