首页> 外文期刊>Journal of Intelligent Information Systems >Improving large-scale hierarchical classification by rewiring: a data-driven filter based approach
【24h】

Improving large-scale hierarchical classification by rewiring: a data-driven filter based approach

机译:通过重新布线改善大规模分层分类:一种基于数据驱动的过滤器的方法

获取原文
获取原文并翻译 | 示例
           

摘要

Hierarchical Classification (HC) is a supervised learning problem where unlabeled instances are classified into a taxonomy of classes. Several methods that utilize the hierarchical structure have been developed to improve the HC performance. However, in most cases apriori defined hierarchical structure by domain experts is inconsistent; as a consequence performance improvement is not noticeable in comparison to flat classification methods. We propose a scalable data-driven filter based rewiring approach to modify an expert-defined hierarchy. Experimental comparisons of top-down hierarchical classification with our modified hierarchy, on a wide range of datasets shows classification performance improvement over the baseline hierarchy (i.e., defined by expert), clustered hierarchy and flattening based hierarchy modification approaches. In comparison to existing rewiring approaches, our developed method (rewHier) is computationally efficient, enabling it to scale to datasets with large numbers of classes, instances and features. We also show that our modified hierarchy leads to improved classification performance for classes with few training samples in comparison to flat and state-of-the-art hierarchical classification approaches. Source Code: https://cs.gmu.edu/similar to mlbio/TaxMod/
机译:分层分类(HC)是一种有监督的学习问题,其中未标记的实例被分类为类的分类法。已经开发了几种利用分层结构的方法来改善HC性能。但是,在大多数情况下,领域专家先验定义的层次结构是不一致的。因此,与平面分类方法相比,性能改善并不明显。我们提出了一种基于可伸缩数据驱动过滤器的重新布线方法,以修改专家定义的层次结构。在广泛的数据集上对自上而下的分层分类与我们修改后的分层进行的实验比较显示,分类性能在基线分层(即由专家定义),聚类分层和基于扁平化的分层修改方法方面得到了改善。与现有的重新布线方法相比,我们开发的方法(rewHier)计算效率高,可扩展到具有大量类,实例和功能的数据集。我们还表明,与固定的和最新的分层分类方法相比,修改后的分层结构可提高培训样本较少的类的分类性能。源代码:https://cs.gmu.edu/与mlbio / TaxMod /类似

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号