首页> 外文会议>International Conference on Neural Information Processing >An improved KNN learning based Korean text classifier with heuristic information
【24h】

An improved KNN learning based Korean text classifier with heuristic information

机译:具有启发式信息的基于knn学习的韩语文本分类器

获取原文

摘要

Automatic text categorization is a problem of assigning predefined categories to free text documents based on the likelihood suggested by a training set of labelled texts. kNN learning based text classifier is a well known statistical approach and its algorithm is quite simple. While the method has been applied to many systems and shown relatively good performance, a through evaluation of the method has rarely been done. There are some parameters which play important roles in the performance of the method, decision function, k value of kNN, and size of feature set. This paper focuses on an improving method for a kNN learning based Korean text classifier by using heuristic information found experimentally. Our results show that kNN method with carefully chosen parameters is very significant in improving the performance and decreasing the size of feature set.
机译:自动文本分类是将预定义类别为自由文本文档分配给免费文本文档的问题,这是由标记文本的训练集的似天之行。基于KNN学习的文本分类器是一种众所周知的统计方法,其算法非常简单。虽然该方法已经应用于许多系统并显示出相对良好的性能,但是通过对该方法的评估很少完成。存在一些参数,在方法,决策功能,KNN的k值和特征集的大小的性能方面发挥重要作用。本文通过使用实验发现的启发式信息,侧重于基于KNN学习的韩国文本分类器的改进方法。我们的结果表明,具有精心挑选的参数的KNN方法在提高性能和降低功能集的大小方面非常显着。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号