【24h】

TCBPLK: A NEW METHOD OF TEXT CATEGORIZATION

机译:TCBPLK:文本分类的新方法

获取原文

摘要

This paper presents a new text categorization method based on P-L theory and Kohonen network, which called TCBPLK method.The Kohonen network is applied to realizing text categorization, which has a defect of too slowly speed of training.To text vector of high dimension, the defect is more obvious.Even the result of text categorization can not be acquired.The new method establishes vector space model of term weight by the theory of P-L, which enhances the function of the words from the viewpoint of categorization effect, and decreases the dimension of vector through eliminating redundant features.Experimental results confirm that TCBPLK method decreases the number of vector, and enhances the generalization and precision of text categorization.
机译:本文提出了一种基于PL理论和Kohonen网络的文本分类新方法,称为TCBPLK方法.Kohonen网络用于实现文本分类,存在训练速度太慢的缺点。该方法利用PL理论建立了术语权重的矢量空间模型,从分类效果的角度增强了词的功能,减小了维数。实验结果证明,TCBPLK方法减少了向量的数量,提高了文本分类的通用性和准确性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号