首页> 外文会议>International Workshop on Language Technology Platforms;Language Resources and Evaluation Conference >Transfer learning applied to text classification in Spanish radiological reports
【24h】

Transfer learning applied to text classification in Spanish radiological reports

机译:转移学习应用于西班牙放射学报告中的文本分类

获取原文

摘要

Pre-trained text encoders have rapidly advanced the state-of-the-art on many Natural Language Processing tasks. This paper presents the use of transfer learning methods applied to the automatic detection of codes in radiological reports in Spanish. Assigning codes to a clinical document is a popular task in NLP and in the biomedical domain. These codes can be of two types: standard classifications (e.g. ICD-10) or specific to each clinic or hospital. In this study we show a system using specific radiology clinic codes. The dataset is composed of 208,167 radiology reports labeled with 89 different codes. The corpus has been evaluated with three methods using the BERT model applied to Spanish: Multilingual BERT, BETO and XLM. The results are interesting obtaining 70% of Fl -score with a pre-trained multilingual model.
机译:经过预训练的文本编码器已在许多自然语言处理任务上迅速发展了最新技术。本文介绍了转移学习方法在西班牙语放射报告中自动检测代码中的应用。将代码分配给临床文档是NLP和生物医学领域的一项常见任务。这些代码可以分为两种类型:标准分类(例如ICD-10)或特定于每个诊所或医院的代码。在这项研究中,我们展示了使用特定放射科诊所代码的系统。该数据集由208,167份放射学报告组成,这些报告标记有89种不同的代码。使用适用于西班牙语的BERT模型用三种方法对语料库进行了评估:多语言BERT,BETO和XLM。使用预训练的多语言模型获得F1分数的70%的结果很有趣。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号