首页> 外文会议>IEEE International Conference on Innovations in Information , Embedded and Communication Systems >Ensemble learning approach in improved K Nearest Neighbor algorithm for Text categorization
【24h】

Ensemble learning approach in improved K Nearest Neighbor algorithm for Text categorization

机译:改进的K最近邻算法在文本分类中的集成学习方法

获取原文

摘要

Due to the tremendous growth of digital content in World Wide Web (WWW), Text categorization has become an important tool to manage and organize text related data. This paper proposes an Ensemble Learning approach in Improved K Nearest Neighbor algorithm for Text Categorization (EINNTC), which consists of single pass clustering, Ensemble learning and KNN algorithm. The EINNTC method provides solution to traditional KNN classifier issues, by reducing the huge text similarity computation complexity, avoids an impact of noisy training sample, and expediting the process of finding K nearest neighbors. The experiments were carried out with standard benchmark Reuters dataset, and their empirical results shows that the proposed method outperforms the SVM and KNN classifiers.
机译:由于万维网(WWW)中数字内容的迅猛增长,文本分类已成为管理和组织文本相关数据的重要工具。提出了一种改进的K最近邻文本分类算法(EINNTC)中的集成学习方法,该方法由单遍聚类,集成学习和KNN算法组成。 EINNTC方法通过减少巨大的文本相似度计算复杂度,避免了嘈杂的训练样本的影响并加快了查找K个最近邻居的过程,为传统的KNN分类器问题提供了解决方案。实验是使用标准基准路透数据集进行的,其经验结果表明,所提出的方法优于SVM和KNN分类器。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号