【24h】

Selection Strategies for Multi-label Text Categorization

机译:多标签文本分类的选择策略

获取原文
获取原文并翻译 | 示例

摘要

In multi-label text categorization, determining the final set of classes that will label a given document is not trivial. It implies first to determine whether a class is suitable of being attached to the text and, secondly, the number of them that we have to consider. Different strategies for determining the size of the final set of assigned labels are studied here. We analyze several classification algorithms along with two main strategies for selection: by a fixed number of top ranked labels, or using per-class thresholds. Our experiments show the effects of each approach and the issues to consider when using them.
机译:在多标签文本分类中,确定将标记给定文档的最终类集并非易事。它意味着首先要确定一个类是否适合附加到文本上,其次是我们必须考虑的类数。本文研究了确定最终分配标签组大小的不同策略。我们分析了几种分类算法以及两种主要的选择策略:通过固定数量的排名最高的标签,或使用每个类别的阈值。我们的实验显示了每种方法的效果以及使用它们时应考虑的问题。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号