首页> 外文期刊>Pattern recognition letters >Optimizing the class information divergence for transductive classification of texts using propagation in bipartite graphs
【24h】

Optimizing the class information divergence for transductive classification of texts using propagation in bipartite graphs

机译:使用二部图中的传播优化类别信息差异以进行文本的归纳分类

获取原文
获取原文并翻译 | 示例

摘要

Transductive classification is an useful way to classify a collection of unlabeled textual documents when only a small fraction of this collection can be manually labeled. Graph-based algorithms have aroused considerable interests in recent years to perform transductive classification since the graph-based representation facilitates label propagation through the graph edges. In a bipartite graph representation, nodes represent objects of two types, here documents and terms, and the edges between documents and terms represent the occurrences of the terms in the documents. In this context, the label propagation is performed from documents to terms and then from terms to documents iteratively. In this paper we propose a new graph-based transductive algorithm that use the bipartite graph structure to associate the available class information of labeled documents and then propagate these class information to assign labels for unlabeled documents. By associating the class information to edges linking documents to terms we guarantee that a single term can propagate different class information to its distinct neighbors. We also demonstrated that the proposed method surpasses the algorithms for transductive classification based on vector space model or graphs when only a small number of labeled documents is available. (C) 2016 Elsevier B.V. All rights reserved.
机译:当只有很少一部分可以手动标记时,转导分类是对未标记文本文档集合进行分类的一种有用方法。近年来,基于图的算法引起了人们的极大兴趣,以进行转导分类,因为基于图的表示法促进了标签通过图边缘的传播。在二部图表示中,节点表示两种类型的对象,此处为文档和术语,文档和术语之间的边表示文档中术语的出现。在这种情况下,标签传播是从文档到术语,然后是从术语到文档进行迭代传播。在本文中,我们提出了一种新的基于图的转换算法,该算法使用二部图结构来关联已标记文档的可用类别信息,然后传播这些类别信息以为未标记文档分配标签。通过将类信息与将文档链接到术语的边相关联,我们保证单个术语可以将不同的类信息传播到其不同的邻居。我们还证明,当只有少量标记的文档可用时,所提出的方法优于基于矢量空间模型或图的转导分类算法。 (C)2016 Elsevier B.V.保留所有权利。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号