首页> 外文会议>2010 Third Pacific-Asia conference on web mining and web-based application. >Multi-label Text Classification with a Robust Label Dependent Representation
【24h】

Multi-label Text Classification with a Robust Label Dependent Representation

机译:具有可靠标签依赖表示的多标签文本分类

获取原文
获取原文并翻译 | 示例

摘要

Automatic text classification is the task of assigning unseen documents to a predefined set of classes or categories. Text Representation for classification have been traditionally approached with tf.idf due to its simplicity and good performance. Multi-label automatic text classification has been traditionally tackled in the literature either by transforming the problem to apply binary techniques or by adapting binary algorithms to work with multiple labels. We present tf.rrfl, a novel text representation for the multilabel classification approach. Our proposal focuses on modifying the data set input to the algorithm, differentiating the input by the label to evaluate. Performance of tf.rrfl was tested with a known benchmark and compared to alternative techniques. The results show improvement compared to alternative approaches in terms of Hamming loss.
机译:自动文本分类是将看不见的文档分配给预定义的一组类或类别的任务。由于tf.idf的简单性和良好的性能,传统上已使用tf.idf进行分类的文本表示。传统上,通过将问题转化为应用二进制技术或通过使二进制算法适用于多个标签来解决多标签自动文本分类问题。我们介绍了tf.rrfl,这是一种用于多标签分类方法的新颖文本表示形式。我们的建议着重于修改算法的数据集输入,通过标签区分输入以进行评估。使用已知基准测试了tf.rrfl的性能,并将其与替代技术进行了比较。结果表明,与汉明损失相比,替代方法有改进。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号