Named Entity Recognition in Spanish Biomedical Literature: Short Review and Bert Model

机译：西班牙生物医学文献中的命名实体识别：简短回顾和伯特模型

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Entity Recognition (NER) is the first step for knowledge acquisition when we deal with an unknown corpus of texts. Having received these entities, we have an opportunity to form parameters space and to solve problems of text mining as concept normalization, speech recognition, etc. The recent advances in NER are related to the technology of contextualized word embeddings, which transforms text to the form being effective for Deep Learning. In the paper, we show how NER model detects pharmacological substances, compounds, and proteins in the dataset obtained from the Spanish Clinical Case Corpus (SPACCC). To achieve this goal, we train from scratch the BERT language representation model and fine-tune it for our problem. As it is expected, this model shows better results than the NER model trained over the standard word embeddings. We further conduct an error analysis showing the origins of models' errors and proposing strategies to further improve the model's quality.

机译：当我们处理未知的文本语料库时，实体识别（NER）是知识获取的第一步。收到这些实体后，我们就有机会形成参数空间并解决文本挖掘的问题，例如概念归一化，语音识别等。NER的最新进展与上下文化词嵌入技术有关，后者将文本转换为形式对深度学习有效。在本文中，我们展示了NER模型如何检测从西班牙临床案例语料库（SPACCC）获得的数据集中的药理物质，化合物和蛋白质。为了实现这个目标，我们从头开始训练BERT语言表示模型，并针对我们的问题进行微调。不出所料，该模型显示出比通过标准单词嵌入训练的NER模型更好的结果。我们进一步进行错误分析，以显示模型错误的根源，并提出进一步提高模型质量的策略。

著录项

来源
《Conference of Open Innovations Association》|2019年|1-7|共7页
会议地点
作者
Liliya Akhtyamova;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
data mining; error analysis; knowledge acquisition; learning (artificial intelligence); medical information systems; natural language processing; speech recognition; text analysis;

机译：数据挖掘;错误分析;知识获取;学习（人工智能）;医疗信息系统;自然语言处理;语音识别;文本分析;

相似文献

外文文献
中文文献
专利

1. ABioNER: A BERT-Based Model for Arabic Biomedical Named-Entity Recognition [J] . Nada Boudjellal, Huaping Zhang, Asif Khan, Complexity . 2021,第a期

机译：abioLer：一种基于BERT的阿拉伯生物医学名称实体识别模型
2. Chinese named entity recognition model based on BERT [J] . Hongshuai Liu, Ge Jun, Yuanyuan Zheng MATEC Web of Conferences . 2021,第a期

机译：基于伯特的中国名称实体识别模型
3. Disease named entity recognition from biomedical literature using a novel convolutional neural network [J] . Zhehuan Zhao, Zhihao Yang, Ling Luo, BMC Medical Genomics . 2017,第5期

机译：使用新型卷积神经网络从生物医学文献中将疾病命名为实体识别
4. Knowledge-Based Approach for Named Entity Recognition in Biomedical Literature: A Use Case in Biomedical Software Identification [C] . Muhammad Amith, Yaoyun Zhang, Hua Xu, International Conference on Industrial Engineering and Other Applications of Applied Intelligent Systems . 2017

机译：基于知识的生物医学文献中命名实体识别方法：生物医学软件识别中的用例
5. Unsupervised Biomedical Named Entity Recognition [D] . Ghiasvand, Omid. 2017

机译：无监督的生物医学命名实体识别
6. Long short-term memory RNN for biomedical named entity recognition [O] . Chen Lyu, Bo Chen, Yafeng Ren, 2017

机译：长短期记忆RNN用于生物医学命名实体识别
7. Biomedical named entity recognition using BERT in the machine reading comprehension framework [O] . Cong Sun, Zhihao Yang, Lei Wang, 2021

机译：生物医学命名实体识别在机器阅读理解框架中使用BERT

Named Entity Recognition in Spanish Biomedical Literature: Short Review and Bert Model

摘要

著录项

相似文献

相关主题

期刊订阅