...
首页> 外文期刊>Neurocomputing >Online semi-supervised learning with learning vector quantization
【24h】

Online semi-supervised learning with learning vector quantization

机译:在线半监督学习与学习矢量量化

获取原文
获取原文并翻译 | 示例
           

摘要

Online semi-supervised learning (OSSL) is a learning paradigm simulating human learning, in which the data appear in a sequential manner with a mixture of both labeled and unlabeled samples. Despite the recent advances, there are still many unsolved problems in this area. In this paper, we propose a novel OSSL method based on learning vector quantization (LVQ). LVQ classifiers, which represent the data of each class by a set of prototypes, have found their usage in a wide range of pattern recognition problems and can be naturally adapted to the online scenario by updating the prototypes with stochastic gradient optimization. However, most of the existing LVQ algorithms were designed for supervised classification. To extract useful information from unlabeled data, we propose two simple and computationally efficient methods based on clustering assumption. To be specific, we use the maximum conditional likelihood criterion for updating prototypes when data sample is labeled, and the Gaussian mixture clustering criterion or neural gas clustering criterion for adjusting prototypes when data sample is unlabeled. These two criteria are utilized alternatively according to the availability of label information to make full use of both supervised and unsupervised data to boost the performance. By extensive experiments, we show that the proposed method exhibits higher accuracy compared with the baseline methods and graph-based methods and is much more efficient than graph-based methods in both training and test time. (c) 2020 Elsevier B.V. All rights reserved.
机译:在线半监督学习(OSSL)是模拟人类学习的学习范例,其中数据以连续的方式出现,其中标记和未标记的样品的混合物。尽管最近的进步,但这领域还有许多未解决的问题。在本文中,我们提出了一种基于学习矢量量化(LVQ)的新型OSSL方法。 LVQ分类器代表每组原型的每个类的数据,发现它们在广泛的模式识别问题中使用了它们,并且可以通过更新具有随机梯度优化的原型来自然地适应在线情景。然而,大多数现有的LVQ算法被设计用于监督分类。要从未标记的数据中提取有用信息,我们提出了基于聚类假设的两个简单和计算有效的方法。具体而言,我们使用最大条件似然标准来更新原型当数据样本被标记时,并且在数据样本未标记时调整原型的高斯混合聚类标准或神经气体聚类标准。这两个标准可根据标签信息的可用性使用,以充分利用监督和无监督的数据来提高性能。通过广泛的实验,我们表明,与基于基线方法和基于图形的方法相比,该方法的准确性更高,并且比训练和测试时间的基于图的方法更有效。 (c)2020 Elsevier B.v.保留所有权利。

著录项

  • 来源
    《Neurocomputing》 |2020年第jul25期|467-478|共12页
  • 作者单位

    Chinese Acad Sci Inst Automat Natl Lab Pattern Recognit Beijing Peoples R China|Univ Chinese Acad Sci UCAS Sch Artificial Intelligence Beijing 100049 Peoples R China;

    Chinese Acad Sci Inst Automat Natl Lab Pattern Recognit Beijing Peoples R China|Univ Chinese Acad Sci UCAS Sch Artificial Intelligence Beijing 100049 Peoples R China;

    Chinese Acad Sci Inst Automat Natl Lab Pattern Recognit Beijing Peoples R China;

    Chinese Acad Sci Inst Automat Natl Lab Pattern Recognit Beijing Peoples R China|Univ Chinese Acad Sci UCAS Sch Artificial Intelligence Beijing 100049 Peoples R China|CAS Ctr Excellence Brain Sci & Intelligence Techn Beijing 100190 Peoples R China;

  • 收录信息 美国《科学引文索引》(SCI);美国《工程索引》(EI);
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

    Online learning; Semi-supervised classification; Learning vector quantization; Gaussian mixture distribution; Neural gas;

    机译:在线学习;半监督分类;学习矢量量化;高斯混合分布;神经气体;

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号