首页> 外文期刊>Image Processing, IEEE Transactions on >Robust and Discriminative Labeling for Multi-Label Active Learning Based on Maximum Correntropy Criterion
【24h】

Robust and Discriminative Labeling for Multi-Label Active Learning Based on Maximum Correntropy Criterion

机译:基于最大熵准则的多标签主动学习的鲁棒和区分性标签

获取原文
获取原文并翻译 | 示例
       

摘要

Multi-label learning draws great interests in many real world applications. It is a highly costly task to assign many labels by the oracle for one instance. Meanwhile, it is also hard to build a good model without diagnosing discriminative labels. Can we reduce the label costs and improve the ability to train a good model for multi-label learning simultaneously? Active learning addresses the less training samples problem by querying the most valuable samples to achieve a better performance with little costs. In multi-label active learning, some researches have been done for querying the relevant labels with less training samples or querying all labels without diagnosing the discriminative information. They all cannot effectively handle the outlier labels for the measurement of uncertainty. Since maximum correntropy criterion (MCC) provides a robust analysis for outliers in many machine learning and data mining algorithms, in this paper, we derive a robust multi-label active learning algorithm based on an MCC by merging uncertainty and representativeness, and propose an efficient alternating optimization method to solve it. With MCC, our method can eliminate the influence of outlier labels that are not discriminative to measure the uncertainty. To make further improvement on the ability of information measurement, we merge uncertainty and representativeness with the prediction labels of unknown data. It cannot only enhance the uncertainty but also improve the similarity measurement of multi-label data with labels information. Experiments on benchmark multi-label data sets have shown a superior performance than the state-of-the-art methods.
机译:多标签学习在许多实际应用中引起了极大的兴趣。由oracle为一个实例分配许多标签是一项成本很高的任务。同时,不诊断歧视性标签也很难建立一个好的模型。我们是否可以降低标签成本并提高同时训练多标签学习模型的能力?主动学习通过查询最有价值的样本以较少的成本获得更好的性能,从而解决了训练样本较少的问题。在多标签主动学习中,已经进行了一些研究,以较少的训练样本来查询相关标签或查询所有标签而不诊断歧视性信息。他们都不能有效地处理用于不确定性测量的异常值标签。由于最大熵准则(MCC)为许多机器学习和数据挖掘算法中的异常值提供了可靠的分析,因此在本文中,我们通过融合不确定性和代表性来得出基于MCC的可靠的多标签主动学习算法,并提出了一种有效的方法交替优化方法来解决它。使用MCC,我们的方法可以消除对不确定性没有歧视性的异常标签的影响。为了进一步提高信息测量的能力,我们将不确定性和代表性与未知数据的预测标签合并在一起。它不仅增加了不确定性,而且改善了多标签数据与标签信息的相似度测量。在基准多标签数据集上进行的实验显示出比最新方法更好的性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号