首页> 外文会议>International Symposium on Intelligence Computation and Applications >A New Approach of Feature Selection for Chinese Web Page Categorization
【24h】

A New Approach of Feature Selection for Chinese Web Page Categorization

机译:一种新的中文网页分类功能选择方法

获取原文

摘要

Feature selection is a key step of web page categorization. It can influence the accuracy of categorization directly as well as the efficiency. This paper proposes a new approach of feature selection based on Mutual Information algorithm. It brings in feature whose Mutual Information is negative and emphasizes the occurrence probabilities of features in different categories. Moreover, it makes some improvements on the web page preprocessing to reserve some useful features. The experiment shows that the new feature selection method improves the accuracy of categorization effectively.
机译:特征选择是网页分类的关键步骤。它可以直接影响分类的准确性以及效率。本文提出了一种基于互信息算法的特征选择的新方法。它带来了互信息是否定的特征,并强调不同类别的特征的发生概率。此外,它对网页预处理进行了一些改进以保留一些有用的功能。实验表明,新的特征选择方法有效地提高了分类的准确性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号