【24h】

Text Categorization using Feature Projections

机译:使用特征投影的文本分类

获取原文
获取原文并翻译 | 示例

摘要

This paper proposes a new approach for text categorization, based on a feature projection technique. In our approach, training data are represented as the projections of training documents on each feature. The voting for a classification is processed on the basis of individual feature projections. The final classification of test documents is determined by a majority voting from the individual classifications of each feature. Our empirical results show that the proposed approach, Text Categorization using Feature Projections (TCFP), outperforms κ-NN, Rocchio, and Naieve Bayes. Most of all, TCFP is about one hundred times faster than κ-NN. Since TCFP algorithm is very simple, its implementation and training process can be done very easily. For these reasons, TCFP can be a useful classifier in the areas, which need a fast and high-performance text categorization task.
机译:本文提出了一种基于特征投影技术的文本分类新方法。在我们的方法中,训练数据表示为每个功能上训练文档的投影。分类的投票是根据各个要素预测进行的。测试文档的最终分类由每个功能的单独分类中的多数投票决定。我们的经验结果表明,所提出的方法,使用特征投影的文本分类(TCFP)优于κ-NN,Rocchio和Naieve Bayes。最重要的是,TCFP比κ-NN快一百倍。由于TCFP算法非​​常简单,因此其实现和培训过程非常容易。由于这些原因,TCFP可以在需要快速而高性能的文本分类任务的领域中成为有用的分类器。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号