首页> 外文会议>Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies >Active~2 Learning: Actively reducing redundancies in Active Learning methods for Sequence Tagging and Machine Translation
【24h】

Active~2 Learning: Actively reducing redundancies in Active Learning methods for Sequence Tagging and Machine Translation

机译:活跃〜2学习:积极减少序列标记和机器翻译的主动学习方法中的冗余

获取原文

摘要

While deep learning is a powerful tool for natural language processing (NLP) problems, successful solutions to these problems rely heavily on large amounts of annotated samples. However, manually annotating data is expensive and time-consuming. Active Learning (AL) strategies reduce the need for huge volumes of labeled data by iteratively selecting a small number of examples for manual annotation based on their estimated utility in training the given model. In this paper, we argue that since AL strategies choose examples independently, they may potentially select similar examples, all of which may not contribute significantly to the learning process. Our proposed approach. Active~2 Learning (A~2L), actively adapts to the deep learning model being trained to eliminate such redundant examples chosen by an AL strategy. We show that A~2L is widely applicable by using it in conjunction with several different AL strategies and NLP tasks. We empirically demonstrate that the proposed approach is further able to reduce the data requirements of state-of-the-art AL strategies by ≈ 3 - 25% on an absolute scale on multiple NLP tasks while achieving the same performance with virtually no additional computation overhead.
机译:虽然深度学习是自然语言处理(NLP)问题的强大工具,但对这些问题的成功解决方案严重依赖于大量注释样本。然而,手动注释数据昂贵且耗时。主动学习(AL)策略通过迭代地选择基于其训练模型的估计效用来迭代选择手动注释的少数示例来减少大量标记数据的需求。在本文中,我们认为,由于Al策略独立选择示例,因此它们可能潜在地选择类似的示例,所有这些例子都可能对学习过程没有显着贡献。我们提出的方法。有效〜2学习(A〜2L),积极适应培训的深度学习模型,以消除由AL策略选择的这种多余示例。我们表明,通过使用它与几种不同的AL策略和NLP任务结合使用它是广泛适用的。我们经验证明,所提出的方法进一步能够通过在多个NLP任务上的绝对级别上通过绝对级别来降低最先进的AL策略的数据要求,同时实现与几乎没有额外的计算开销的相同性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号