【24h】

On Applying Dimension Reduction for Multi-labeled Problems

机译:关于降维在多标签问题中的应用

获取原文
获取原文并翻译 | 示例

摘要

Traditional classification problem assumes that a data sample belongs to one class among the predefined classes. On the other hand, in a multi-labeled problem such as text categorization, data samples can belong to multiple classes and the task is to output a set of class labels associated with new unseen data sample. As common in text categorization problem, learning a classifier in a high dimensional space can be difficult, known as the curse of dimensionality. It has been shown that performing dimension reduction as a preprocessing step can improve classification performances greatly. Especially, Linear discriminant analysis (LDA) is one of the most popular dimension reduction methods, which is optimized for classification tasks. However, in applying LDA for a multi-labeled problem some ambiguities and difficulties can arise. In this paper, we study on applying LDA for a multi-labeled problem and analyze how an objective function of LDA can be interpreted in multi-labeled setting. We also propose a LDA algorithm which is effective in a multi-labeled problem. Experimental results demonstrate that by considering multi-labeled structures LDA can achieve computational efficiency and also improve classification performances greatly.
机译:传统的分类问题假设数据样本属于预定义类别中的一个类别。另一方面,在诸如文本分类之类的多标签问题中,数据样本可以属于多个类别,并且任务是输出与新的未见数据样本相关联的一组类别标签。作为文本分类问题中的常见问题,在高维空间中学习分类器可能很困难,这被称为维数诅咒。已经表明,执行降维作为预处理步骤可以大大提高分类性能。特别是,线性判别分析(LDA)是最流行的降维方法之一,它针对分类任务进行了优化。但是,在将LDA用于多标签问题时,可能会产生一些歧义和困难。在本文中,我们研究了将LDA应用于多标签问题,并分析了如何在多标签环境中解释LDA的目标函数。我们还提出了一种在多标签问题中有效的LDA算法。实验结果表明,通过考虑多标签结构,LDA可以实现计算效率,并且可以大大提高分类性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号