首页> 外文会议>International Conference on Advances in Recent Technologies in Communication and Computing >Exploiting Negative Categories and Wikipedia Structures for Document Classification
【24h】

Exploiting Negative Categories and Wikipedia Structures for Document Classification

机译:利用否定类别和维基百科结构进行文档分类

获取原文

摘要

This paper explores the effect of profile based method for classification of Wikipedia XML documents. Our approach builds two profiles, exploiting the whole content, Initial Descriptions and links in the Wikipedia documents. For building profiles we use the negative category information which has shown to perform well for classifying unstructured texts. The performance of Cosine and Fractional Similarity metrics is also compared. The use of two classifiers and their weighted average improves the classification performance.
机译:本文探讨了基于概况的方法对维基百科XML文档分类的影响。我们的方法构建了两个配置文件,利用Wikipedia文档中的整个内容,初始描述和链接。对于构建配置文件,我们使用否定的负类信息,该信息已经表现出色,以便对分类非结构化文本进行分类。还比较了余弦和分数相似度量的性能。使用两个分类器及其加权平均值可以提高分类性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号