首页> 外文会议>Conference on multisensor, multisource information fusion: Architectures, algorithms, and applications >Using classifier fusion to improve the performance of multiclass classification problems
【24h】

Using classifier fusion to improve the performance of multiclass classification problems

机译:使用分类器融合来提高多分类问题的性能

获取原文
获取外文期刊封面目录资料

摘要

The problem of multiclass classification is often modeled by breaking it down into a collection of binary classifiers, as opposed to jointly modeling all classes with a single primary classifier. Various methods can be found in the literature for decomposing the multiclass problem into a collection of binary classifiers. Typical algorithms that are studied here include each versus all remaining (EVAR), each versus all individually (EVAI), and output correction coding (OCC). With each of these methods a classifier fusion based decision rule is formulated utilizing the various binary classifiers to determine the correct classification of an unknown data point. For example, with EVAR the binary classifier with maximum output is chosen. For EVAI, the correct class is chosen using a majority voting rule, and with OCC a comparison algorithm based minimum Hamming distance metric is used. In this paper, it is demonstrated how these various methods perform utilizing the Bayesian Reduction Algorithm (BDRA) as the primary classifier. BDRA is a discrete data classification method that quantizes and reduces the dimensionality of feature data for best classification performance. In this case, BDRA is used to not only train the appropriate binary classifier pairs, but it is also used to train on the discrete classifier outputs to formulate the correct classification decision of unknown data points. In this way, it is demonstrated how to predict which binary classification based algorithm method (i.e., EVAR, EVAI, or OCC) performs best with BDRA. Experimental results are shown with real data sets taken from the Knowledge Extraction based on Evolutionary Learning (KEEL) and University of California at Irvine (UCI) Repositories of classifier Databases. In general, and for the data sets considered, it is shown that the best classification method, based on performance with unlabeled test observations, can be predicted form performance on labeled training data. Specifically, the best method is shown to have the least overall probability of error, and the binary classifiers have the least overall average quantization complexity.
机译:多类分类的问题通常是通过将其分解为二进制分类器的集合来建模的,这与使用单个主分类器联合对所有类进行建模相反。在文献中可以找到用于将多类问题分解为二进制分类器集合的各种方法。在这里研究的典型算法包括每个对所有剩余的(EVAR),每个对所有单独的(EVAI)以及输出校正编码(OCC)。通过这些方法中的每一种,利用各种二进制分类器来制定基于分类器融合的决策规则,以确定未知数据点的正确分类。例如,对于EVAR,选择具有最大输出的二进制分类器。对于EVAI,使用多数投票规则选择正确的类别,对于OCC,将使用基于比较算法的最小汉明距离度量。在本文中,演示了如何利用贝叶斯归约算法(BDRA)作为主要分类器来执行这些各种方法。 BDRA是一种离散的数据分类方法,它可以量化并减小特征数据的维数以实现最佳分类性能。在这种情况下,BDRA不仅用于训练适当的二进制分类器对,而且还用于训练离散分类器输出以制定未知数据点的正确分类决策。以这种方式,证明了如何预测哪种基于二进制分类的算法方法(即EVAR,EVAI或OCC)在BDRA中效果最佳。使用基于进化学习的知识提取(KEEL)和加利福尼亚大学欧文分校(UCI)分类器数据库存储库中的真实数据集显示了实验结果。通常,对于所考虑的数据集,可以证明,基于未标记测试观察结果的最佳分类方法,可以根据标记训练数据的表现来预测。具体而言,最佳方法显示出具有最小的总体错误概率,而二元分类器的总体平均量化复杂度也最小。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号