首页> 外文会议>IEEE International Conference on Machine Learning and Applications >Towards Fast and Unified Transfer Learning Architectures for Sequence Labeling

【24h】

Towards Fast and Unified Transfer Learning Architectures for Sequence Labeling

机译：对序列标记的快速和统一转移学习架构

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Sequence labeling systems have advanced continuously using neural architectures over the past several years. However, these tasks require large sets of annotated data to achieve such performance. In particular, we focus on the Named Entity Recognition (NER) task on clinical notes, which is one of the most fundamental and critical problems for medical text analysis. Our work centers on effectively adapting these neural architectures towards low-resource settings using parameter transfer methods. We complement a standard hierarchical NER model with a general transfer learning framework, the Tunable Transfer Network (TTN) consisting of parameter sharing between the source and target tasks, and showcase scores significantly above the baseline architecture. Our best TTN model achieves 2-5% improvement over pre-trained language model BERT as well as its multi task extension MT-DNN in low resource settings. However, our proposed sharing scheme requires an exponential search over tied parameter sets to generate an optimal configuration. To mitigate the problem of exhaustively searching for model optimization, we propose the Dynamic Transfer Networks (DTN), a gated architecture which learns the appropriate parameter sharing scheme between source and target datasets. DTN achieves the improvements of the optimized transfer learning framework with just a single training setting, effectively removing the need for an exponential search.

机译：序列标签系统在过去几年中使用神经架构进行了高级。但是，这些任务需要大量的注释数据来实现这种性能。特别是，我们专注于临床笔记上的命名实体识别（ner）任务，这是医学文本分析的最基本和严重问题之一。我们的工作中心正在通过参数传输方法有效地调整这些神经架构对低资源设置。我们使用一般传输学习框架补充标准的分层NER模型，可调谐传输网络（TTN）由源和目标任务之间的参数共享组成，并且在基线架构上显着展示得分。我们最好的TTN模型在低资源设置中实现了2-5％的语言模型BERT及其多任务扩展MT-DNN。但是，我们提出的共享方案要求通过绑定参数集的指数搜索来生成最佳配置。为了减轻令人遗憾地搜索模型优化的问题，我们提出了动态传输网络（DTN），一个门控架构，它在源和目标数据集之间学习适当的参数共享方案。 DTN实现了单一训练设置的优化转移学习框架的改进，有效地删除了指数搜索的需求。

著录项

来源
《IEEE International Conference on Machine Learning and Applications 》|2019年|1 v.|共8页
会议地点
作者
Parminder Bhatia; Kristjan Arumae; Busra Celikkaya;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类计算机软件 ;
关键词
Task analysis; Training; Standards; Logic gates; Buildings; Labeling; Bit error rate;

机译：任务分析;培训;标准;逻辑门;建筑物;标签;误码率;

相似文献

外文文献
中文文献
专利

1. Towards easier and faster sequence labeling for natural language processing: A search-based probabilistic online learning framework (SAPO) [J] . Sun Xu, Ma Shuming, Zhang Yi, Information Sciences: An International Journal . 2019 ,第期

机译：为了更轻松，更快地序列标记用于自然语言处理：基于搜索的概率在线学习框架（SAPO）
2. Unified Deep Learning Architecture for Modeling Biology Sequence [J] . Hongjie Wu, Chengyuan Cao, Xiaoyan Xia, IEEE/ACM transactions on computational biology and bioinformatics . 2018 ,第5期

机译：用于生物学序列建模的统一深度学习架构
3. Multi-label classification via learning a unified object-label graph with sparse representation [J] . Yao Lina, Sheng Quan Z., Ngu Anne H. H., World Wide Web . 2016 ,第6期

机译：通过学习具有稀疏表示的统一对象标签图进行多标签分类
4. Towards Fast and Unified Transfer Learning Architectures for Sequence Labeling [C] . Parminder Bhatia, Kristjan Arumae, Busra Celikkaya IEEE International Conference on Machine Learning and Applications . 2019

机译：面向序列标记的快速统一转移学习体系结构
5. Transfer Learning Techniques for Sequence Labeling in Network File System Specifications [D] . Singh, Amanpreet. 2021

机译：在网络文件系统规范中传输序列标记的学习技术
6. RNA‐guided endonuclease – in situ labelling (RGEN‐ISL): a fast CRISPR/Cas9‐based method to label genomic sequences in various species [O] . Takayoshi Ishii, Veit Schubert, Solmaz Khosravi, -1

机译：RNA引导的内切核酸酶-原位标记（RGEN‐ISL）：一种基于CRISPR / Cas9的快速方法可标记各种物种的基因组序列
7. Towards easier and faster sequence labeling for natural language processing: A search-based probabilistic online learning framework (SAPO) [O] . Xu Sun, Shuming Ma, Yi Zhang, 2019

机译：为了更轻松，更快的自然语言处理序列标记：基于搜索的概率在线学习框架（SAPO）

Towards Fast and Unified Transfer Learning Architectures for Sequence Labeling

摘要

著录项

相似文献

相关主题

期刊订阅