首页> 外文会议>European Conference on Artificial Intelligence >Improving Hierarchical Classification with Partial Labels
【24h】

Improving Hierarchical Classification with Partial Labels

机译:用部分标签提高分层分类

获取原文

摘要

In this paper, we address the problem of semi-supervised hierarchical learning when some cases are fully labeled while other cases are only partially labeled, named Hierarchical Partial Labels. Given a label hierarchy, a fully labeled example provides a path from the root node to a leaf node while a partially labeled example only provides a path from the root node to an internal node. We introduce a discriminative learning approach, called Partial HSVM, that incorporates partially labeled information into the hierarchical maximum margin-based learning framework. The partially labeled hierarchical learning problem is formulated as a quadratic optimization that minimizes the empirical risk with L2-norm regularization. We also present an efficient algorithm for the hierarchical classification in the presence of partially labeled information. In our experiments with the WIPO-alpha patent collection, we compare our proposed algorithm with two other baseline approaches: Binary HSVM, a standard approach to hierarchical classification, which builds a binary classifier (SVM) at each node in the hierarchy, and PL-SVM, a flat multiclass classifier which can take advantages of the partial label information. Our empirical results show that Partial HSVM outperforms Binary HSVM and PL-SVM across different performance metrics. The experimental results demonstrate that our proposed algorithm, Partial HSVM, combines the strength of both methods, the Binary HSVM and PL-SVM, since it utilizes both the hierarchical information and the partially labeled examples. In addition, we observe the positive correlation between the labeling effort in obtaining partially labeled data and the improvement in performance.
机译:在本文中,我们解决了一些案例在其他情况下完全标记时的半监督分层学习问题,而其他情况仅被部分标记,则为名为分层部分标签。给定标签层次结构时,完全标记的示例在局部标记的示例仅从根节点提供给内部节点时提供从根节点到叶节点的路径。我们介绍了一种判别的学习方法,称为部分HSVM,该方法将部分标记的信息包含在分层最大边缘的学习框架中。部分标记的分层学习问题被制定为二次优化,最小化L2-Norm正规的经验风险。我们还在存在部分标记的信息存在下为分层分类提供了一种有效的算法。在我们对WIPO-alpha专利集合的实验中,我们将所提出的算法与另外两种基线方法进行比较:二进制HSVM,分层分类的标准方法,其在层次结构中的每个节点处构建二进制分类器(SVM),以及PL- SVM,一个扁平的多字母分类器,可以采用部分标签信息的优势。我们的经验结果表明,部分HSVM在不同性能指标上优于二元HSVM和PL-SVM。实验结果表明,我们所提出的算法部分HSVM组合了两种方法,二元HSVM和PL-SVM的强度,因为它利用了分层信息和部分标记的示例。此外,我们在获得部分标记的数据和改进性能方面,观察标签努力之间的正相关性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号