首页> 外文会议>International Conference on String Processing and Information Retrieval >TreeBoost.MH: A Boosting Algorithm for Multi-label Hierarchical Text Categorization
【24h】

TreeBoost.MH: A Boosting Algorithm for Multi-label Hierarchical Text Categorization

机译:TreeBoost.mh:多标签分层文本分类的促进算法

获取原文

摘要

In this paper we propose TreeBoost.MH, an algorithm for multi-label Hierarchical Text Categorization (HTC) consisting of a hierarchical variant of AdaBoost.MH. TreeBoost.MH embodies several intuitions that had arisen before within HTC: e.g. the intuitions that both feature selection and the selection of negative training examples should be performed "locally", i.e. by paying attention to the topology of the classification scheme. It also embodies the novel intuition that the weight distribution that boosting algorithms update at every boosting round should likewise be updated "locally". We present the results of experimenting TreeBoost.MH on two HTC benchmarks, and discuss analytically its computational cost.
机译:在本文中,我们提出了TreeBoost.mh,该算法,用于多标签分层分类(HTC),由Adaboost.mh的分层变体组成。 TreeBoost.mh体现了几种在HTC中出现的直觉:例如,特征选择和否定训练示例的选择应该在“本地”,即通过关注分类方案的拓扑来执行。它还体现了一种新颖的直觉,即在每一个升压轮子上提升算法更新的重量分布同样应该在“本地”中更新。我们介绍了在两个HTC基准测试中进行了树木博托马特马谟的结果,并讨论了其计算成本。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号