...
首页> 外文期刊>Expert Systems with Application >Multi-label classification using hierarchical embedding
【24h】

Multi-label classification using hierarchical embedding

机译:使用分层嵌入的多标签分类

获取原文
获取原文并翻译 | 示例
           

摘要

Multi-label learning is concerned with the classification of data with multiple class labels. This is in contrast to the traditional classification problem where every data instance has a single label. Multi-label classification (MLC) is a major research area in the machine learning community and finds application in several domains such as computer vision, data mining and text classification. Due to the exponential size of the output space, exploiting intrinsic information in feature and label spaces has been the major thrust of research in recent years and use of parametrization and embedding have been the prime focus in MLC. Most of the existing methods learn a single linear parametrization using the entire training set and hence, fail to capture nonlinear intrinsic information in feature and label spaces. To overcome this, we propose a piecewise-linear embedding which uses maximum margin matrix factorization to model linear parametrization. We hypothesize that feature vectors which conform to similar embedding are similar in some sense. Combining the above concepts, we propose a novel hierarchical matrix factorization method for multi-label classification. Practical multi-label classification problems such as image annotation, text categorization and sentiment analysis can be directly solved by the proposed method. We compare our method with six well-known algorithms on twelve benchmark datasets. Our experimental analysis manifests the superiority of our proposed method over state-of-art algorithm for multi-label learning. (C) 2017 Elsevier Ltd. All rights reserved.
机译:多标签学习与具有多个类别标签的数据分类有关。这与传统的分类问题相反,在传统的分类问题中,每个数据实例都有一个标签。多标签分类(MLC)是机器学习社区中的一个主要研究领域,在计算机视觉,数据挖掘和文本分类等多个领域都有应用。由于输出空间的指数大小,近年来,在特征和标签空间中利用固有信息一直是研究的主要方向,而参数化和嵌入的使用已成为MLC的主要重点。大多数现有方法使用整个训练集学习单个线性参数化,因此无法捕获特征和标签空间中的非线性固有信息。为了克服这个问题,我们提出了一种分段线性嵌入方法,该方法使用最大余量矩阵分解对线性参数化进行建模。我们假设符合相似嵌入的特征向量在某种意义上是相似的。结合以上概念,我们提出了一种新的用于多标签分类的层次矩阵分解方法。所提出的方法可以直接解决实际的多标签分类问题,例如图像标注,文本分类和情感分析。我们将我们的方法与十二种基准数据集上的六种著名算法进行了比较。我们的实验分析表明,我们提出的方法优于用于多标签学习的最新算法。 (C)2017 Elsevier Ltd.保留所有权利。

著录项

  • 来源
    《Expert Systems with Application》 |2018年第1期|263-269|共7页
  • 作者单位

    Univ Hyderabad, Sch Comp & Informat Sci, Artificial Intelligence Lab, Hyderbad 500046, Andhra Pradesh, India;

    Univ Hyderabad, Sch Comp & Informat Sci, Artificial Intelligence Lab, Hyderbad 500046, Andhra Pradesh, India|Cent Univ Rajasthan, Ajmer, Rajasthan, India;

    Univ Hyderabad, Sch Comp & Informat Sci, Artificial Intelligence Lab, Hyderbad 500046, Andhra Pradesh, India;

    Univ Hyderabad, Sch Comp & Informat Sci, Artificial Intelligence Lab, Hyderbad 500046, Andhra Pradesh, India;

    Univ Hyderabad, Sch Comp & Informat Sci, Artificial Intelligence Lab, Hyderbad 500046, Andhra Pradesh, India;

  • 收录信息
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

    Multi-label learning; Matrix factorization; Label correlation;

    机译:多标签学习;矩阵分解;标签相关;

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号