...
首页> 外文期刊>Journal of Intelligent Learning Systems and Applications >Improved Term Weighting Technique for Automatic Web Page Classification
【24h】

Improved Term Weighting Technique for Automatic Web Page Classification

机译:用于自动网页分类的改进的术语加权技术

获取原文
   

获取外文期刊封面封底 >>

       

摘要

Automatic web page classification has become inevitable for web directories due to the multitude of web pages in the World Wide Web. In this paper an improved Term Weighting technique is proposed for automatic and effective classification of web pages. The web documents are represented as set of features. The proposed method selects and extracts the most prominent features reducing the high dimensionality problem of classifier. The proper selection of features among the large set improves the performance of the classifier. The proposed algorithm is implemented and tested on a benchmarked dataset. The results show the better performance than most of the existing term weighting techniques.
机译:由于万维网中的网页众多,因此对于目录而言,自动网页分类已成为必然。本文提出了一种改进的术语加权技术,可以对网页进行自动有效的分类。 Web文档表示为一组功能。该方法选择并提取了最突出的特征,从而减少了分类器的高维问题。在大集合中正确选择特征可提高分类器的性能。所提出的算法是在基准数据集上实现和测试的。结果显示出比大多数现有术语加权技术更好的性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号