首页> 外文会议>Annual meeting of the Association for Computational Linguistics >Reinforced Training Data Selection for Domain Adaptation
【24h】

Reinforced Training Data Selection for Domain Adaptation

机译:增强训练数据选择以适应领域

获取原文

摘要

Supervised models suffer from the problem of domain shifting where distribution mismatch in the data across domains greatly affect model performance. To solve the problem, training data selection (TDS) has been proven to be a prospective solution for domain adaptation in leveraging appropriate data. However, conventional TDS methods normally requires a predefined threshold which is neither easy to set nor can be applied across tasks, and models are trained separately with the TDS process. To make TDS self-adapted to data and task, and to combine it with model training, in this paper, we propose a reinforcement learning (RL) framework that synchronously searches for training instances relevant to the target domain and learns better representations for them. A selection distribution generator (SDG) is designed to perform the selection and is updated according to the rewards computed from the selected data, where a predictor is included in the framework to ensure a task-specific model can be trained on the selected data and provides feedback to rewards. Experimental results from part-of-speech tagging, dependency parsing, and sentiment analysis, as well as ablation studies, illustrate that the proposed framework is not only effective in data selection and representation, but also generalized to accommodate different NLP tasks.
机译:监督模型会遭受域移动的问题,其中跨域的数据分布不匹配会极大地影响模型性能。为了解决该问题,训练数据选择(TDS)已被证明是利用适当数据进行领域自适应的前瞻性解决方案。但是,常规的TDS方法通常需要预定义的阈值,该阈值既不容易设置,也不能跨任务应用,并且使用TDS流程分别训练模型。为了使TDS适应数据和任务,并将其与模型训练相结合,我们提出了一种强化学习(RL)框架,该框架可以同步搜索与目标领域相关的训练实例,并为其学习更好的表示形式。选择分布生成器(SDG)设计为执行选择,并根据从所选数据计算出的奖励进行更新,其中在框架中包括预测变量,以确保可以在所选数据上训练特定于任务的模型并提供反馈奖励。词性标注,依赖性分析和情感分析以及消融研究的实验结果表明,提出的框架不仅可以有效地进行数据选择和表示,而且可以通用化以适应不同的NLP任务。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号