首页> 外文会议>International conference on neural information processing >Maximum-Margin Framework for Training Data Synchronization in Large-Scale Hierarchical Classification
【24h】

Maximum-Margin Framework for Training Data Synchronization in Large-Scale Hierarchical Classification

机译:大规模分层分类中用于训练数据同步的最大利润框架

获取原文

摘要

In the context of supervised learning, the training data for large-scale hierarchical classification consist of (ⅰ) a set of input-output pairs, and (ⅱ) a hierarchy structure defining parent-child relation among class labels. It is often the case that the hierarchy structure given a-priori is not optimal for achieving high classification accuracy. This is especially true for web-taxonomies such as Yahoo! directory which consist of tens of thousand of classes. Furthermore, an important goal of hierarchy design is to render better navigability and browsing. In this work, we propose a maximum-margin framework for automatically adapting the given hierarchy by using the set of input-output pairs to yield a new hierarchy. The proposed method is not only theoretically justified but also provides a more principled approach for hierarchy flattening techniques proposed earlier, which are ad-hoc and empirical in nature. The empirical results on publicly available large-scale datasets demonstrate that classification with new hierarchy leads to better or comparable generalization performance than the hierarchy flattening techniques.
机译:在监督学习的情况下,用于大规模层次分类的训练数据由(ⅰ)一组输入输出对和(ⅱ)定义类标签之间的父子关系的层次结构组成。通常情况下,给定先验的层次结构对于实现高分类精度而言并不是最佳的。对于Web分类法(例如Yahoo!)尤其如此。目录包含数万个类。此外,层次结构设计的重要目标是提供更好的可导航性和浏览性。在这项工作中,我们提出了一个最大利润框架,该框架通过使用一组输入输出对来自动适应给定的层次结构以产生新的层次结构。所提出的方法不仅在理论上是合理的,而且还为较早提出的层次化扁平化技术提供了更原则性的方法,这些技术本质上是临时的和经验性的。可公开获得的大规模数据集的经验结果表明,与层次结构平坦化技术相比,使用新层次结构进行分类可以带来更好或更可比的泛化性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号