Reinforced Training Data Selection for Domain Adaptation

机译：增强训练数据选择以适应领域

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Supervised models suffer from the problem of domain shifting where distribution mismatch in the data across domains greatly affect model performance. To solve the problem, training data selection (TDS) has been proven to be a prospective solution for domain adaptation in leveraging appropriate data. However, conventional TDS methods normally requires a predefined threshold which is neither easy to set nor can be applied across tasks, and models are trained separately with the TDS process. To make TDS self-adapted to data and task, and to combine it with model training, in this paper, we propose a reinforcement learning (RL) framework that synchronously searches for training instances relevant to the target domain and learns better representations for them. A selection distribution generator (SDG) is designed to perform the selection and is updated according to the rewards computed from the selected data, where a predictor is included in the framework to ensure a task-specific model can be trained on the selected data and provides feedback to rewards. Experimental results from part-of-speech tagging, dependency parsing, and sentiment analysis, as well as ablation studies, illustrate that the proposed framework is not only effective in data selection and representation, but also generalized to accommodate different NLP tasks.

机译：监督模型会遭受域移动的问题，其中跨域的数据分布不匹配会极大地影响模型性能。为了解决该问题，训练数据选择（TDS）已被证明是利用适当数据进行领域自适应的前瞻性解决方案。但是，常规的TDS方法通常需要预定义的阈值，该阈值既不容易设置，也不能跨任务应用，并且使用TDS流程分别训练模型。为了使TDS适应数据和任务，并将其与模型训练相结合，我们提出了一种强化学习（RL）框架，该框架可以同步搜索与目标领域相关的训练实例，并为其学习更好的表示形式。选择分布生成器（SDG）设计为执行选择，并根据从所选数据计算出的奖励进行更新，其中在框架中包括预测变量，以确保可以在所选数据上训练特定于任务的模型并提供反馈奖励。词性标注，依赖性分析和情感分析以及消融研究的实验结果表明，提出的框架不仅可以有效地进行数据选择和表示，而且可以通用化以适应不同的NLP任务。

著录项

来源
《Annual meeting of the Association for Computational Linguistics》|2019年|1957-1968|共12页
会议地点
作者
Miaofeng Liu; Yan Song; Hongbin Zou; Tong Zhang;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. Training data independent image registration using generative adversarial networks and domain adaptation [J] . Pattern Recognition: The Journal of the Pattern Recognition Society . 2020,第期

机译：使用生成的对冲网络和域适应培训数据独立图像注册
2. Domain Adaptation for Crisis Data Using Correlation Alignment and Self-Training [J] . Hongmin Li, Oleksandra Sopova, Doina Caragea, International journal of information systems for crisis response and management . 2018,第4期

机译：使用关联对齐和自训练对危机数据进行域自适应
3. Effect of Data Imbalance on Unsupervised Domain Adaptation of Part-of-Speech Tagging and Pivot Selection Strategies [J] . Xia Cui, Frans Coenen, Danushka Bollegala JMLR: Workshop and Conference Proceedings . 2017,第1期

机译：数据不平衡对词性标记和数据透视选择策略的无监督域适应的影响
4. Reinforced Training Data Selection for Domain Adaptation [C] . Miaofeng Liu, Yan Song, Hongbin Zou, Annual meeting of the Association for Computational Linguistics . 2019

机译：增强域适应的训练数据选择
5. Domain adaptation with minimal training [D] . Kundu, Gourab 2014

机译：只需最少的培训即可适应领域
6. A Systematic Comparison of Data Selection Criteria for SMT Domain Adaptation [O] . Longyue Wang, Derek F. Wong, Lidia S. Chao, -1

机译：SMT域适配的数据选择标准的系统比较
7. Domain adaptation and training data acquisition in wide-coverage word sense disambiguation and its application to information retrieval [O] . ZHONG ZHI 2012

机译：广泛的词义消歧领域适应和训练数据获取及其在信息检索中的应用

Reinforced Training Data Selection for Domain Adaptation

摘要

著录项

相似文献

相关主题

期刊订阅