Active~2 Learning: Actively reducing redundancies in Active Learning methods for Sequence Tagging and Machine Translation

机译：活跃〜2学习：积极减少序列标记和机器翻译的主动学习方法中的冗余

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

While deep learning is a powerful tool for natural language processing (NLP) problems, successful solutions to these problems rely heavily on large amounts of annotated samples. However, manually annotating data is expensive and time-consuming. Active Learning (AL) strategies reduce the need for huge volumes of labeled data by iteratively selecting a small number of examples for manual annotation based on their estimated utility in training the given model. In this paper, we argue that since AL strategies choose examples independently, they may potentially select similar examples, all of which may not contribute significantly to the learning process. Our proposed approach. Active~2 Learning (A~2L), actively adapts to the deep learning model being trained to eliminate such redundant examples chosen by an AL strategy. We show that A~2L is widely applicable by using it in conjunction with several different AL strategies and NLP tasks. We empirically demonstrate that the proposed approach is further able to reduce the data requirements of state-of-the-art AL strategies by ≈ 3 - 25% on an absolute scale on multiple NLP tasks while achieving the same performance with virtually no additional computation overhead.

机译：虽然深度学习是自然语言处理（NLP）问题的强大工具，但对这些问题的成功解决方案严重依赖于大量注释样本。然而，手动注释数据昂贵且耗时。主动学习（AL）策略通过迭代地选择基于其训练模型的估计效用来迭代选择手动注释的少数示例来减少大量标记数据的需求。在本文中，我们认为，由于Al策略独立选择示例，因此它们可能潜在地选择类似的示例，所有这些例子都可能对学习过程没有显着贡献。我们提出的方法。有效〜2学习（A〜2L），积极适应培训的深度学习模型，以消除由AL策略选择的这种多余示例。我们表明，通过使用它与几种不同的AL策略和NLP任务结合使用它是广泛适用的。我们经验证明，所提出的方法进一步能够通过在多个NLP任务上的绝对级别上通过绝对级别来降低最先进的AL策略的数据要求，同时实现与几乎没有额外的计算开销的相同性能。

著录项

来源
《Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies》|2021年|1982-1995|共14页
会议地点
作者
Rishi Hazra; Parag Dutta; Shubham Gupta; Mohammed Abdul Qaathir; Ambedkar Dukkipati;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. "Active Learning Systems and Methods for Rapid Porting of Machine Translation Systems to New Language Pairs Or New Domains" in Patent Application Approval Process [J] . Robotics and Machine Learning . 2012,第52期

机译：专利申请批准过程中的“用于将机器翻译系统快速移植到新语言对或新域的主动学习系统和方法”
2. Machine-Learning Methods Enable Exhaustive Searches for Active Bimetallic Facets and Reveal Active Site Motifs for CO2 Reduction [J] . Ulissi Zachary W., Tang Michael T., Xiao Jianping, ACS catalysis . 2017,第10期

机译：机器学习方法使有效的Bimetallic方面的详尽搜索，并显示CO2减少的有源站点图案
3. A proposal of method to make active learning from class to self-study using active note taking and active textbook system [J] . Shin-nosuke Suzuki, Yutaro Akimoto, Yasuhiro Kobayashi, Procedia Computer Science . 2018,第1期

机译：使用主动笔记和主动教科书系统进行从课堂到自学的主动学习的方法的建议
4. Confidence-based Active Learning Methods for Machine Translation [C] . Varvara Logacheva, Lucia Specia Workshop on humans and computer-assisted translation . 2014

机译：基于信心的机器翻译主动学习方法
5. Active learning with support vector machines for imbalanced datasets and a method for stopping active learning based on stabilizing predictions. [D] . Bloodgood, Michael. 2009

机译：支持向量机用于不平衡数据集的主动学习，以及一种基于稳定预测的主动学习停止方法。
6. Predictive Models for the Characterization of Internal Defects in Additive Materials from Active Thermography Sequences Supported by Machine Learning Methods [O] . Manuel Rodríguez-Martín, José G. Fueyo, Diego Gonzalez-Aguilera, 2020

机译：从机器学习方法支持的有源热成像序列表征添加材料中内缺损的预测模型
7. Machine-Learning Methods Enable Exhaustive Searches for Active Bimetallic Facets and Reveal Active Site Motifs for CO_2 Reduction [O] . Ulissi, Zachary W., Tang, Michael T., Xiao, Jianping, 2017

机译：机器学习方法可对活跃的双金属小平面进行详尽搜索，并揭示减少CO_2的活跃点图形

Active~2 Learning: Actively reducing redundancies in Active Learning methods for Sequence Tagging and Machine Translation

摘要

著录项

相似文献

相关主题

期刊订阅