Rethinking deep active learning: Using unlabeled data at model training

机译：重新思考深度主动学习：在模型培训时使用未标记的数据

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Active learning typically focuses on training a model on few labeled examples alone, while unlabeled ones are only used for acquisition. In this work we depart from this setting by using both labeled and unlabeled data during model training across active learning cycles. We do so by using unsupervised feature learning at the beginning of the active learning pipeline and semi-supervised learning at every active learning cycle, on all available data. The former has not been investigated before in active learning, while the study of latter in the context of deep learning is scarce and recent findings are not conclusive with respect to its benefit. Our idea is orthogonal to acquisition strategies by using more data, much like ensemble methods use more models. By systematically evaluating on a number of popular acquisition strategies and datasets, we find that the use of unlabeled data during model training brings a spectacular accuracy improvement in image classification, compared to the differences between acquisition strategies. We thus explore smaller label budgets, even one label per class.

机译：主动学习通常侧重于单独绘制少数标记的示例的模型，而未标记的则仅用于采集。在这项工作中，我们通过在主动学习周期的模型培训期间使用标记和未标记的数据来离开此设置。我们通过在所有活动学习循环开始在主动学习管道的开始和半监督学习时使用无监督的功能学习，在所有可用数据上。前者尚未在积极学习之前进行调查，而后者在深度学习的背景下的研究是稀缺的，而最近的发现对于它的利益而不是决定性的。我们的想法是通过使用更多数据来获取策略的正交，很像集合方法使用更多型号。通过系统地评估许多流行的采购策略和数据集，我们发现在模型培训期间使用未标记的数据带来了图像分类中的壮观准确性，而采集策略之间的差异相比。因此，我们探索了较小的标签预算，甚至每班一个标签。

著录项

来源
《International Conference on Pattern Recognition》|2021年|1220-1227|共8页
会议地点
作者
Oriane Siméoni; Mateusz Budnik; Yannis Avrithis; Guillaume Gravier;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Training; Deep learning; Pipelines; Semisupervised learning; Data models; Pattern recognition; Image classification;

机译：培训;深入学习;管道;半质象学习;数据模型;模式识别;图像分类;

相似文献

外文文献
中文文献
专利

1. Wasserstein distance based deep adversarial transfer learning for intelligent fault diagnosis with unlabeled or insufficient labeled data [J] . Cheng Cheng, Zhou Beitong, Ma Guijun, Neurocomputing . 2020,第Octa7期

机译：基于Wasserstein距离的智能故障诊断的深势逆境转移，标记数据不足
2. Utilizing Unlabeled Data to Detect Electricity Fraud in AMI: A Semisupervised Deep Learning Approach [J] . Hu Tianyu, Guo Qinglai, Shen Xinwei, Neural Networks and Learning Systems, IEEE Transactions on . 2019,第11期

机译：利用未标记的数据检测AMI中的电欺诈：一种半监督式深度学习方法
3. Learning from Positive and Unlabeled Data 1: Classifier Training and Theoretical Analysis [J] . Marthinus Christoffel DU PLESSIS, Gang NIU, Masashi SUGIYAMA 電子情報通信学会技術研究報告. 情報論的学習理論と機械学習 . 2014,第306期

机译：从积极的和未标记的数据中学习1：分类器训练和理论分析
4. Personalized acoustic modeling by weakly supervised multi-task deep learning using acoustic tokens discovered from unlabeled data [C] . Cheng-Kuan Wei, Cheng-Tao Chung, Hung-Yi Lee, IEEE International Conference on Acoustics, Speech and Signal Processing . 2017

机译：通过使用从未标记数据中发现的声学令牌，通过弱监督多任务深度学习进行个性化声学建模
5. Evaluation of Synthetic Training Data and Training-Data-Augmentation Techniques for Object Detection in Ground-Penetrating Radar Data using Deep-Learning Models [D] . Ruggiero, Jean. 2021

机译：使用深度学习模型评估用于地面穿透雷达数据的对象检测的综合训练数据和训练数据增强技术
6. CEM500K a large-scale heterogeneous unlabeled cellular electron microscopy image dataset for deep learning [O] . Ryan Conrad, Kedar Narayan 2021

机译：CEM500K一个大型异构未标记的蜂窝电子显微镜图像数据集用于深度学习
7. Personalized Acoustic Modeling by Weakly Supervised Multi-Task Deep Learning using Acoustic Tokens Discovered from Unlabeled Data [O] . Wei, Cheng-Kuan, Chung, Cheng-Tao, Lee, Hung-Yi, 2017

机译：弱监督多任务深度的个性化声学建模使用未标记数据发现的声学令牌进行学习

Rethinking deep active learning: Using unlabeled data at model training

摘要

著录项

相似文献

相关主题

期刊订阅