...
首页> 外文期刊>JMLR: Workshop and Conference Proceedings >A Bayesian Nonparametric Approach for Multi-label Classification
【24h】

A Bayesian Nonparametric Approach for Multi-label Classification

机译:多标签分类的贝叶斯非参数方法

获取原文
           

摘要

Many real-world applications require multi-label classification where multiple target labels are assigned to each instance. In multi-label classification, there exist the intrinsic correlations between the labels and features. These correlations are beneficial for multi-label classification task since they reflect the coexistence of the input and output spaces that can be exploited for prediction. Traditional classification methods have attempted to reveal these correlations in different ways. However, existing methods demand expensive computation complexity for finding such correlation structures. Furthermore, these approaches can not identify the suitable number of label-feature correlation patterns. In this paper, we propose a Bayesian nonparametric (BNP) framework for multi-label classification that can automatically learn and exploit the unknown number of multi-label correlation. We utilize the recent techniques in stochastic inference to derive the cheap (but efficient) posterior inference algorithm for the model. In addition, our model can naturally exploit the useful information from missing label samples. Furthermore, we extend the model to update parameters in an online fashion that highlights the flexibility of our model against the existing approaches. We compare our method with the state-of-the-art multi-label classification algorithms on real-world datasets using both complete and missing label settings. Our model achieves better classification accuracy while our running time is consistently much faster than the baselines in an order of magnitude.
机译:许多实际应用程序需要多标签分类,其中将多个目标标签分配给每个实例。在多标签分类中,标签和要素之间存在内在的关联。这些相关性对于多标签分类任务很有用,因为它们反映了可用于预测的输入和输出空间的共存。传统的分类方法试图以不同的方式揭示这些相关性。但是,现有方法需要昂贵的计算复杂度才能找到这种相关结构。此外,这些方法无法识别适当数量的标签特征相关模式。在本文中,我们提出了一种用于多标签分类的贝叶斯非参数(BNP)框架,该框架可以自动学习和利用未知数量的多标签相关性。我们利用随机推理中的最新技术来推导该模型的便宜(但有效)的后验算法。此外,我们的模型可以自然地利用缺失标签样本中的有用信息。此外,我们扩展了模型以在线方式更新参数,从而突出了模型相对于现有方法的灵活性。我们将我们的方法与使用完整和缺失标签设置的真实数据集上的最新多标签分类算法进行比较。我们的模型实现了更好的分类准确性,而我们的运行时间始终比基线快一个数量级。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号