Active Learning without Knowing Individual Instance Labels: A Pairwise Label Homogeneity Query Approach

首页> 外文期刊>IEEE Transactions on Knowledge and Data Engineering >Active Learning without Knowing Individual Instance Labels: A Pairwise Label Homogeneity Query Approach

【24h】

Active Learning without Knowing Individual Instance Labels: A Pairwise Label Homogeneity Query Approach

机译：不知道单个实例标签的主动学习：成对标签同质性查询方法

获取原文

获取原文并翻译 | 示例

获取外文期刊封面封底 >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Traditional active learning methods require the labeler to provide a class label for each queried instance. The labelers are normally highly skilled domain experts to ensure the correctness of the provided labels, which in turn results in expensive labeling cost. To reduce labeling cost, an alternative solution is to allow nonexpert labelers to carry out the labeling task without explicitly telling the class label of each queried instance. In this paper, we propose a new active learning paradigm, in which a nonexpert labeler is only asked “whether a pair of instances belong to the same class”, namely, a pairwise label homogeneity. Under such circumstances, our active learning goal is twofold: (1) decide which pair of instances should be selected for query, and (2) how to make use of the pairwise homogeneity information to improve the active learner. To achieve the goal, we propose a “Pairwise Query on Max-flow Paths” strategy to query pairwise label homogeneity from a nonexpert labeler, whose query results are further used to dynamically update a Min-cut model (to differentiate instances in different classes). In addition, a “Confidence-based Data Selection” measure is used to evaluate data utility based on the Min-cut model’s prediction results. The selected instances, with inferred class labels, are included into the labeled set to form a closed-loop active learning process. Experimental results and comparisons with state-of-the-art methods demonstrate that our new active learning paradigm can result in good performance with nonexpert labelers.

机译：传统的主动学习方法要求贴标器为每个查询的实例提供一个类标签。贴标人员通常是技术娴熟的领域专家，以确保所提供标签的正确性，从而导致昂贵的贴标成本。为了降低标记成本，另一种解决方案是允许非专业标记人员执行标记任务，而无需明确告知每个查询实例的类标记。在本文中，我们提出了一种新的主动学习范式，其中仅询问非专家标记者“一对实例是否属于同一类”，即成对标记同质性。在这种情况下，我们的主动学习目标是双重的：（1）确定应该选择哪一对实例进行查询；（2）如何利用成对的同质性信息来改善主动学习者。为了实现该目标，我们提出了一种“最大流路径上的成对查询”策略，以从非专家标签器中查询成对标签的同质性，其查询结果还用于动态更新最小切割模型（以区分不同类别的实例）。此外，“最小化数据选择”措施用于根据最小切割模型的预测结果评估数据的实用性。带有推断类标签的所选实例将包含在标签集中以形成闭环主动学习过程。实验结果和与最先进方法的比较表明，我们的新的主动学习范例可以在非专家标签机上产生良好的性能。

著录项

来源
《IEEE Transactions on Knowledge and Data Engineering》 |2014年第4期|808-822|共15页
作者

展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
Active learning; pairwise label homogeneity; weak labeling;

机译：主动学习;成对标签同质;弱标签;

相似文献

外文文献
中文文献
专利

1. An improved multi-instance multi-label learning algorithm based on representative instances selection and label correlations [J] . Chanjuan Liu, Tongtong Chen, Hailin Zou, International Journal of Grid and Utility Computing . 2018,第3期

机译：基于代表性实例选择和标签关联的改进多实例多标签学习算法
2. Multi-label learning based on label-specific features and local pairwise label correlation [J] . Weng Wei, Lin Yaojin, Wu Shunxiang, Neurocomputing . 2018,第jana17期

机译：基于标签特定功能和局部成对标签相关性的多标签学习
3. A New multi-instance multi-label learning approach for image and text classification [J] . Yan Kaobi, Li Zhixin, Zhang Canlong Multimedia Tools and Applications . 2016,第13期

机译：用于图像和文本分类的多实例多标签学习新方法
4. Do They Belong to the Same Class? Active Learning by Querying Pairwise Label Homogeneity [C] . Yifan Fu, Bin Li, Xingquan Zhu, ACM international conference on information and knowledge management . 2011

机译：他们属于同一个阶级吗？通过查询成对标签同质性进行主动学习
5. Synthesis of Isotopically Labeled Co-Enzyme to Probe the Active Site of Tryptophan synthase/ New Synthetic Approach to Tetrahydrocannabinol Analogs [D] . Bastin, Baback 2015

机译：同位素标记辅酶的合成以探测色氨酸合酶的活性位点/四氢大麻酚类似物的新合成方法
6. Multi-Instance Multilabel Learning with Weak-Label for Predicting Protein Function in Electricigens [O] . Jian-Sheng Wu, Hai-Feng Hu, Shan-Cheng Yan, -1

机译：带有弱标签的多实例多标签学习用于预测脑电蛋白的功能
7. Who should label what? instance allocation in multiple expert active learning [O] . Byron C. Wallace, Kevin Small, Carla E. Brodley, 2011

机译：谁应该标注什么？多专家主动学习中的实例分配

Active Learning without Knowing Individual Instance Labels: A Pairwise Label Homogeneity Query Approach

摘要

著录项

相似文献

相关主题

期刊订阅