Towards Fast and Unified Transfer Learning Architectures for Sequence Labeling

机译：面向序列标记的快速统一转移学习体系结构

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Sequence labeling systems have advanced continuously using neural architectures over the past several years. However, these tasks require large sets of annotated data to achieve such performance. In particular, we focus on the Named Entity Recognition (NER) task on clinical notes, which is one of the most fundamental and critical problems for medical text analysis. Our work centers on effectively adapting these neural architectures towards low-resource settings using parameter transfer methods. We complement a standard hierarchical NER model with a general transfer learning framework, the Tunable Transfer Network (TTN) consisting of parameter sharing between the source and target tasks, and showcase scores significantly above the baseline architecture. Our best TTN model achieves 2-5% improvement over pre-trained language model BERT as well as its multi task extension MT-DNN in low resource settings. However, our proposed sharing scheme requires an exponential search over tied parameter sets to generate an optimal configuration. To mitigate the problem of exhaustively searching for model optimization, we propose the Dynamic Transfer Networks (DTN), a gated architecture which learns the appropriate parameter sharing scheme between source and target datasets. DTN achieves the improvements of the optimized transfer learning framework with just a single training setting, effectively removing the need for an exponential search.

机译：过去几年中，序列标记系统使用神经体系结构不断发展。但是，这些任务需要大量带注释的数据才能实现这种性能。特别是，我们将重点放在临床笔记上的命名实体识别（NER）任务上，这是医学文本分析中最基本，最关键的问题之一。我们的工作集中在使用参数传递方法有效地使这些神经体系结构适应资源匮乏的环境。我们用通用的转移学习框架，由源任务和目标任务之间的参数共享组成的可调转移网络（TTN）来补充标准的分层NER模型，并展示远高于基线架构的分数。我们的最佳TTN模型与预训练语言模型BERT及其在低资源设置下的多任务扩展MT-DNN相比，可实现2-5％的改进。但是，我们提出的共享方案需要对绑定的参数集进行指数搜索以生成最佳配置。为了缓解详尽搜索模型优化的问题，我们提出了动态传输网络（DTN），这是一种门控体系结构，用于学习源数据集和目标数据集之间的适当参数共享方案。 DTN只需一次培训即可实现对优化的转移学习框架的改进，从而有效地消除了对指数搜索的需求。

著录项

来源
《IEEE International Conference on Machine Learning and Applications》|2019年|1852-1859|共8页
会议地点
作者
Parminder Bhatia; Kristjan Arumae; Busra Celikkaya;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Task analysis; Training; Standards; Logic gates; Buildings; Labeling; Bit error rate;

机译：任务分析;培训;标准;逻辑门;建筑物;标签;误码率;

相似文献

外文文献
中文文献
专利

1. Towards easier and faster sequence labeling for natural language processing: A search-based probabilistic online learning framework (SAPO) [J] . Sun Xu, Ma Shuming, Zhang Yi, Information Sciences: An International Journal . 2019,第期

机译：为了更轻松，更快地序列标记用于自然语言处理：基于搜索的概率在线学习框架（SAPO）
2. Unified Deep Learning Architecture for Modeling Biology Sequence [J] . Hongjie Wu, Chengyuan Cao, Xiaoyan Xia, IEEE/ACM transactions on computational biology and bioinformatics . 2018,第5期

机译：用于生物学序列建模的统一深度学习架构
3. Multi-label classification via learning a unified object-label graph with sparse representation [J] . Yao Lina, Sheng Quan Z., Ngu Anne H. H., World Wide Web . 2016,第6期

机译：通过学习具有稀疏表示的统一对象标签图进行多标签分类
4. Towards Fast and Unified Transfer Learning Architectures for Sequence Labeling [C] . Parminder Bhatia, Kristjan Arumae, Busra Celikkaya IEEE International Conference on Machine Learning and Applications . 2019

机译：对序列标记的快速和统一转移学习架构
5. Transfer Learning Techniques for Sequence Labeling in Network File System Specifications [D] . Singh, Amanpreet. 2021

机译：在网络文件系统规范中传输序列标记的学习技术
6. RNA‐guided endonuclease – in situ labelling (RGEN‐ISL): a fast CRISPR/Cas9‐based method to label genomic sequences in various species [O] . Takayoshi Ishii, Veit Schubert, Solmaz Khosravi, -1

机译：RNA引导的内切核酸酶-原位标记（RGEN‐ISL）：一种基于CRISPR / Cas9的快速方法可标记各种物种的基因组序列
7. Towards easier and faster sequence labeling for natural language processing: A search-based probabilistic online learning framework (SAPO) [O] . Xu Sun, Shuming Ma, Yi Zhang, 2019

机译：为了更轻松，更快的自然语言处理序列标记：基于搜索的概率在线学习框架（SAPO）

Towards Fast and Unified Transfer Learning Architectures for Sequence Labeling

摘要

著录项

相似文献

相关主题

期刊订阅