...
首页> 外文期刊>Mathematical Problems in Engineering >Multidass Boosting with Adaptive Group-Based kNN and Its Application in Text Categorization
【24h】

Multidass Boosting with Adaptive Group-Based kNN and Its Application in Text Categorization

机译:基于自适应组的kNN的多输入信号增强及其在文本分类中的应用

获取原文
获取原文并翻译 | 示例
   

获取外文期刊封面封底 >>

       

摘要

AdaBoost is an excellent committee-based tool for classification. However, its effectiveness and efficiency in multiclass categorization face the challenges from methods based on support vector machine (SVM), neural networks (NN), naive Bayes, and κ-nearest neighbor (κNN). This paper uses a novel multi-class AdaBoost algorithm to avoid reducing the multi-class classification problem to multiple two-class classification problems. This novel method is more effective. In addition, it keeps the accuracy advantage of existing AdaBoost. An adaptive group-based κNN method is proposed in this paper to build more accurate weak classifiers and in this way control the number of basis classifiers in an acceptable range. To further enhance the performance, weak classifiers are combined into a strong classifier through a double iterative weighted way and construct an adaptive group-based κNN boosting algorithm (AGκNN-AdaBoost). We implement AGκNN-AdaBoost in a Chinese text categorization system. Experimental results showed that the classification algorithm proposed in this paper has better performance both in precision and recall than many other text categorization methods including traditional AdaBoost. In addition, the processing speed is significantly enhanced than original AdaBoost and many other classic categorization algorithms.
机译:AdaBoost是基于委员会的出色分类工具。然而,其在多类分类中的有效性和效率面临着来自基于支持向量机(SVM),神经网络(NN),朴素贝叶斯和κ最近邻(κNN)的方法的挑战。本文使用一种新颖的多类AdaBoost算法来避免将多类分类问题简化为多个两类分类问题。这种新颖的方法更有效。此外,它保留了现有AdaBoost的准确性优势。本文提出了一种基于群体的自适应κNN方法,以建立更准确的弱分类器,并以此将基本分类器的数量控制在可接受的范围内。为了进一步提高性能,弱分类器通过双重迭代加权方式组合为强分类器,并构建了基于自适应组的κNN增强算法(AGκNN-AdaBoost)。我们在中文文本分类系统中实现AGκNN-AdaBoost。实验结果表明,与传统的AdaBoost等许多文本分类方法相比,本文提出的分类算法在精度和召回率上均具有更好的性能。此外,处理速度比原始的AdaBoost和许多其他经典分类算法显着提高。

著录项

  • 来源
    《Mathematical Problems in Engineering》 |2012年第7期|793490.1-793490.24|共24页
  • 作者单位

    School of Automation, Beijing Institute of Technology, Beijing 100081, China;

    School of Automation, Beijing Institute of Technology, Beijing 100081, China;

    School of Automation, Beijing Institute of Technology, Beijing 100081, China;

    School of Automation, Beijing Institute of Technology, Beijing 100081, China;

  • 收录信息
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号