...
首页> 外文期刊>Expert Systems with Application >Regression with re-labeling for noisy data
【24h】

Regression with re-labeling for noisy data

机译:回归并重新标记嘈杂的数据

获取原文
获取原文并翻译 | 示例

摘要

Active learning, which focuses on building an accurate prediction model with a reduced cost by actively querying which instances should be labeled for training, has been successfully employed in several real-world applications involving expensive labeling costs. Although most existing active learning strategies have focused on labeling unlabeled instances, it has been shown that improving the quality of previously annotated labels is also important when the annotator produces noisy labels. In this study, we propose a novel active learning framework for regression, which is effective for the scenarios with noisy annotators, by providing a new sampling strategy named exploration-refinement (ER) sampling. The ER sampling performs two main steps: exploration and refinement. The exploration step involves finding unlabeled instances to be labeled, and the refinement step seeks to improve the accuracy of already labeled instances. The experimental results on several benchmark datasets demonstrate the effectiveness of the ER sampling with statistical significance. (C) 2018 Elsevier Ltd. All rights reserved.
机译:主动学习侧重于通过主动查询应标记哪些实例进行训练来以降低的成本构建准确的预测模型,这种主动学习已成功应用于涉及昂贵标记成本的几种实际应用中。尽管大多数现有的主动学习策略都集中在标记未标记的实例上,但是已经表明,当注释者产生嘈杂的标签时,提高先前注释标签的质量也很重要。在这项研究中,我们提出了一种新的主​​动回归学习框架,该框架通过提供一种称为探究细化(ER)采样的新采样策略,对于带有嘈杂注释符的场景非常有效。 ER采样执行两个主要步骤:探索和完善。探索步骤涉及找到要标记的未标记实例,而精炼步骤则试图提高已标记实例的准确性。在几个基准数据集上的实验结果证明了ER抽样的有效性,具有统计意义。 (C)2018 Elsevier Ltd.保留所有权利。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号