PANLP at MEDIQA 2019: Pre-trained Language Models, Transfer Learning and Knowledge Distillation

机译：PANLP在MEDIQA 2019上：预训练的语言模型，迁移学习和知识提炼

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

This paper describes the models designated for the MEDIQA 2019 shared tasks by the team PANLP. We take advantages of the recent advances in pre-trained bidirectional transformer language models such as BERT (Devlin et al., 2018) and MT-DNN (Liu et al., 2019b). We find that pre-trained language models can significantly outperform traditional deep learning models. Transfer learning from the NLI task to the RQE task is also experimented, which proves to be useful in improving the results of fine-tuning MT-DNN large. A knowledge distillation process is implemented, to distill the knowledge contained in a set of models and transfer it into an single model, whose performance turns out to be comparable with that obtained by the ensemble of that set of models. Finally, for test submissions, model ensemble and a re-ranking process are implemented to boost the performances. Our models participated in all three tasks and ranked the 1st place for the RQE task, and the 2nd place for the NLI task, and also the 2nd place for the QA task.

机译：本文介绍了由PANLP团队指定用于MEDIQA 2019共享任务的模型。我们利用BERT（Devlin等人，2018）和MT-DNN（Liu等人，2019b）等预训练双向转换器语言模型的最新进展。我们发现，经过预训练的语言模型可以大大优于传统的深度学习模型。还对从NLI任务到RQE任务的转移学习进行了实验，这被证明有助于改善MT-DNN的微调结果。实施知识提炼过程，以提炼一组模型中包含的知识并将其转移到单个模型中，该模型的性能与该组模型的集成所获得的性能相当。最后，对于测试提交，实施了模型集成和重新排序过程以提高性能。我们的模型参与了所有三个任务，在RQE任务中排名第一，在NLI任务中排名第二，在QA任务中排名第二。

著录项

来源
《SIGBioMed workshop on biomedical natural language processing;Annual meeting of the Association for Computational Linguistics 》|2019年|380-388|共9页
会议地点 Florence(IT)
作者
Wei Zhu; Xiaofeng Zhou; Keqiang Wang; Xun Luo; Xiepeng Li; Yuan Ni; Guotong Xie;
展开▼
作者单位

Pingan Health Tech Shanghai China;

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. Agreement of two pre-trained deep-learning neural networks built with transfer learning with six pathologists on 6000 patches of prostate cancer from Gleason2019 Challenge [J] . Mircea-Sebastian ?erb?nescu, Carmen-Nicoleta Oancea, Costin Teodor Streba, Romanian Journal of Morphology and Embryology . 2020 ,第2期

机译：两个预先训练的深度学习神经网络的协议，随着6000个癌症癌症的六个病理学家，来自Gleason2019挑战
2. Injecting Event Knowledge into Pre-Trained Language Models for Event Extraction [J] . Zining Yang, Siyu Zhan, Mengshu Hou, Computer Science & Information Technology . 2020 ,第14期

机译：将事件知识注入预先培训的语言模型以进行事件提取
3. Transfer Learning of Pre-trained Transformers for Covid-19 Hoax Detection in Indonesian Language [J] . Lya Hulliyyatus Suadaa, Ibnu Santoso, Amanda Tabitha Bulan Panjaitan Indonesian Journal of Computing and Cybernetics Systems . 2021 ,第3期

机译：在印度尼西亚语中的Covid-19 Hoax检测预训练变压器的转移学习
4. PANLP at MEDIQA 2019: Pre-trained Language Models, Transfer Learning and Knowledge Distillation [C] . Wei Zhu, Xiaofeng Zhou, Keqiang Wang, SIGBioMed workshop on biomedical natural language processing . 2019

机译：Mediqa Panlp 2019：预先接受的语言模型，转移学习和知识蒸馏
5. Exploration of second language preservice teachers' cognition and learning. Study I: The role of second language preservice teachers' cognitive processes and the relationship between theory and practice. Study II: Second language preservice teachers' cognitive and affective learning processes. Study III: Second language preservice teachers' accessing of background knowledge and the role of context. [D] . Dahlman, Anne Pauliina. 2005

机译：探索第二语言职前教师的认知和学习。研究一：第二语言职前教师认知过程的作用以及理论与实践之间的关系。研究二：第二语言职前教师的认知和情感学习过程。研究三：第二语言职前教师对背景知识的访问和上下文的作用。
6. Agreement of two pre-trained deep-learning neural networks built with transfer learning with six pathologists on 6000 patches of prostate cancer from Gleason2019 Challenge [O] . Mircea-Sebastian Şerbănescu, Carmen-Nicoleta Oancea, Costin Teodor Streba, 2020

机译：两个预先训练的深度学习神经网络的协议随着6000个癌症癌症的六个病理学家来自Gleason2019挑战
7. PANLP at MEDIQA 2019: Pre-trained Language Models, Transfer Learning and Knowledge Distillation [O] . Wei Zhu, Xiaofeng Zhou, Keqiang Wang, 2019

机译：Mediqa Panlp 2019：预先接受的语言模型，转移学习和知识蒸馏

PANLP at MEDIQA 2019: Pre-trained Language Models, Transfer Learning and Knowledge Distillation

摘要

著录项

相似文献

相关主题

期刊订阅