首页> 外文期刊>Journal of signal processing systems for signal, image, and video technology >Adaptive Feature Selection and Feature Fusion for Semi-supervised Classification
【24h】

Adaptive Feature Selection and Feature Fusion for Semi-supervised Classification

机译:半监督分类的自适应特征选择和特征融合

获取原文
获取原文并翻译 | 示例
获取外文期刊封面目录资料

摘要

Labeling of data is often difficult, expensive, and time consuming since efforts of experienced human annotators are required, and often we have large number of samples and noisy data. Co-training is a practical and powerful semi-supervised learning method as it yields high classification accuracy with a training data set containing only a small set of labeled data. For successful co-training performance, two important conditions need to be satisfied for the features: diversity and sufficiency. In this paper, we propose a novel mutual information based approach inspired by the idea of dependent component analysis to achieve feature splits that are maximally independent between-subsets (diverse) or within-subsets (sufficient). In addition, we demonstrate the application of the method to a real world problem, classification of laser tread mapping tire data. We introduce several features that are designed to highlight physical characteristics of the tire data, as well as local or global descriptors, such as histograms, gradients, or representations in other domains. Results from both simulations and tire image classification confirm that co-training with the proposed feature set and feature splits consistently yields higher accuracy than supervised classification, when using only a small set of labeled training data is available. The proposed method presents a very promising complement to time consuming and subjective expert labeling of data, reducing expert efforts to a minimum. Further results show that by using a probabilistic multi-layer perceptron classifier as the base learner in co-training, our method leads to very meaningful continuous measures for the progression of irregular wear on tire surface.
机译:由于需要经验丰富的人类注释者的努力,数据标记通常很困难,昂贵且耗时,并且我们经常有大量的样本和嘈杂的数据。协同训练是一种实用且功能强大的半监督学习方法,因为它使用仅包含少量标记数据的训练数据集来产生高分类精度。为了成功地进行联合培训,必须满足两个重要条件:多样性和充分性。在本文中,我们提出了一种新颖的基于互信息的方法,该方法受依赖组件分析的思想启发,以实现最大程度独立于子集之间(多样化)或子集内(足够)的特征分割。此外,我们演示了该方法在现实问题中的应用,即激光胎面贴图轮胎数据的分类。我们介绍了一些旨在突出轮胎数据的物理特征的功能,以及局部或全局描述符,例如直方图,坡度或其他域中的表示形式。来自模拟和轮胎图像分类的结果均证实,与仅使用一小组标记的训练数据时相比,与拟议的特征集和特征分割的共同训练比监督分类产生的准确性更高。所提出的方法为耗时和主观的专家数据标记提供了非常有前途的补充,从而将专家的工作量降到最低。进一步的结果表明,通过使用概率多层感知器分类器作为协同训练的基础学习者,我们的方法可为轮胎表面不规则磨损的进展提供非常有意义的连续测量方法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号