...
首页> 外文期刊>Knowledge-Based Systems >An effective framework based on local cores for self-labeled semi-supervised classification
【24h】

An effective framework based on local cores for self-labeled semi-supervised classification

机译:基于本地核心的自我标签半监督分类的有效框架

获取原文
获取原文并翻译 | 示例
   

获取外文期刊封面封底 >>

       

摘要

Semi-supervised self-labeled methods apply unlabeled data to improve the performance of classifiers which are trained by labeled data alone. Nevertheless, applying unlabeled data may deteriorate the prediction accuracy. One of the causes is that there are insufficient labeled data for training an initial classifier in self-labeled methods. However, existing solutions for this problem of lacking sufficient initial labeled data still have technical defects. For example, they fail to deal with non-spherical data and improve insufficient initial labeled data effectively, when initial labeled data are extremely scarce. In this paper, we propose an effective semi-supervised self-labeled framework based on local cores, aiming to solve the problem of lacking adequate initial labeled data in self-labeled methods and overcome existing technical defects above. Main ideas of our framework include two sides: (a) inadequate initial labeled data are improved by adding predicted local cores to them, where local cores are predicted by active labeling or co-labeling; (b) we use any semi-supervised self-labeled method to train a given classifier on improved labeled data and updated unlabeled data. In our framework, local cores roughly reveal the data distribution, which helps the proposed framework work on spherical or non-spherical data sets. In addition, local cores also help our framework improve insufficient initial labeled data effectively, even when initial labeled data are extremely scarce. Experiments show that the proposed framework is compatible with tested self-labeled methods, and can help self-labeled methods train a k nearest neighbor or support vector machine, when initial labeled data are insufficient. (C) 2020 Elsevier B.V. All rights reserved.
机译:半监督自我标记方法应用未标记的数据,以提高由标签数据训练的分类器的性能。然而,应用未标记的数据可能会恶化预测准确性。其中一个原因是,在自标记方法中训练初始分类器的标记数据不足。但是,对于缺乏足够初始标记数据的解决问题的现有解决方案仍然具有技术缺陷。例如,当初始标记数据极为稀缺时,它们未能处理非球面数据并有效地改善初始标记数据不足。在本文中,我们提出了一种基于本地核心的有效的半监督自我标记框架,旨在解决缺乏自我标记方法中足够初始标记数据的问题,并克服上述现有技术缺陷。我们的框架的主要思想包括双方:(a)通过向它们添加预测的本地核心来改善初始标记数据的不足,其中通过主动标记或共同标记预测本地核心; (b)我们使用任何半监督的自我标记方法在改进的标记数据和更新的未标记数据上培训给定分类器。在我们的框架中,本地核心粗略地揭示了数据分布,帮助所提出的框架在球面或非球形数据集上工作。此外,当地核心也有助于我们的框架有效地改善初始标记数据的不足,即使初始标记数据非常稀缺。实验表明,当初始标记数据不足时,所提出的框架兼容测试的自我标记方法,并且可以帮助自标记的方法训练K最近邻居或支持向量机。 (c)2020 Elsevier B.v.保留所有权利。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号