首页> 外文会议>Digital Libraries: Universal and Ubiquitous Access to Information >Hierarchical Classification of Web Pages Using Support Vector Machine
【24h】

Hierarchical Classification of Web Pages Using Support Vector Machine

机译:支持向量机的网页分层分类

获取原文
获取原文并翻译 | 示例

摘要

In this paper, a novel method for web page hierarchical classification is addressed. In our approach, SVM is used as the basic algorithm to separate any two sub-categories under the same parent node. In order to alleviate the ill shift of SVM classifier caused by imbalanced training data, we try to combine the original SVM classifier with BEV algorithm to create classifier called VOTEM. Then, a web document is assigned to a sub-category based on voting from all category-to-category classifiers. This hierarchical classification algorithm starts its work from the top of the hierarchical tree downward recursively until it triggers a stop condition or reaches the leaf nodes. And our experiment reveals that proposed algorithm obtains better results.
机译:本文提出了一种新的网页层次分类方法。在我们的方法中,将SVM用作基本算法,以将同一父节点下的任意两个子类别分开。为了减轻训练数据不均衡导致的SVM分类器的不适,我们尝试将原始SVM分类器与BEV算法结合起来,创建称为VOTEM的分类器。然后,基于来自所有类别到类别分类器的投票,将网络文档分配给子类别。该层次分类算法从层次树的顶部开始递归地开始其工作,直到触发停止条件或到达叶节点为止。实验表明,该算法取得了较好的效果。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号