Recognizing Biomedical Named Entities Based on the Sentence Vector/Twin Word Embeddings Conditioned Bidirectional LSTM

机译：基于条件向量双向LSTM的句子向量/双词嵌入识别生物医学命名实体

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

As a fundamental step in biomedical information extraction tasks, biomedical named entity recognition remains challenging. In recent years, the neural network has been applied on the entity recognition to avoid the complex hand-designed features, which are derived from various linguistic analyses. However, performance of the conventional neural network systems is always limited to exploiting long range dependencies in sentences. In this paper, we mainly adopt the bidirectional recurrent neural network with LSTM unit to identify biomedical entities, in which the twin word embeddings and sentence vector are added to rich input information. Therefore, the complex feature extraction can be skipped. In the testing phase, Viterbi algorithm is also used to filter the illogical label sequences. The experimental results conducted on the BioCreative Ⅱ GM corpus show that our system can achieve an F-score of 88.61 %, which outperforms CRF models using the complex hand-designed features and is 6.74 % higher than RNNs.

机译：作为生物医学信息提取任务的基本步骤，生物医学命名实体识别仍然具有挑战性。近年来，神经网络已经应用于实体识别以避免复杂的手工设计功能，这些功能来自各种语言分析。然而，传统的神经网络系统的性能总是限于利用句子中的长距离依赖性。在本文中，我们主要通过LSTM单元采用双向反复性神经网络来识别生物医学实体，其中将双词嵌入和句子向量添加到丰富的输入信息。因此，可以跳过复杂的特征提取。在测试阶段，Viterbi算法还用于过滤不合逻辑的标签序列。对生物重建ⅡM血管语料库进行的实验结果表明，我们的系统可以达到88.61％的F分，这优于CRF模型，使用复杂的手动设计的功能，比RNN高6.74％。

著录项

来源
《China national conference on computational linguistics;International symposium on natural language processing based on naturally annotated big data》|2016年|165-176|共12页
会议地点
作者
Lishuang Li; Liuke Jin; Yuxin Jiang; Degen Huang;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
LSTM; Twin word embeddings; Sentence vector; Viterbi algorithm;

机译：LSTM;双字嵌入;句子向量;维特比算法;

相似文献

外文文献
中文文献
专利

1. A Bidirectional LSTM Approach with Word Embeddings for Sentence Boundary Detection [J] . Chenglin Xu, Lei Xie, Xiong Xiao Journal of signal processing systems for signal, image, and video technology . 2018,第7期

机译：带有词嵌入的双向LSTM方法用于句子边界检测
2. D3NER: biomedical named entity recognition using CRF-biLSTM improved with fine-tuned embeddings of various linguistic information [J] . Thanh Hai Dang, Hoang-Quynh Le, Nguyen Trang M., Bioinformatics . 2018,第20期

机译：D3ner：使用CRF-Bilstm的生物医学命名实体识别改进了各种语言信息的微调嵌入
3. Biomedical named entity recognition based on Glove-BLSTM-CRF model [J] . Ning Gelin, Bai Yunli Journal of Computational Methods in Sciences and Engineering . 2021,第1期

机译：基于Glove-BLSTM-CRF模型的生物医学命名实体识别
4. Recognizing Biomedical Named Entities Based on the Sentence Vector/Twin Word Embeddings Conditioned Bidirectional LSTM [C] . Lishuang Li, Liuke Jin, Yuxin Jiang, China National Conference on Computational Linguistics . 2016

机译：基于句子向量/双词嵌入的生物医学命名实体嵌入条件双向LSTM
5. Recognizing named entities in biomedical texts [D] . Gu, Baohua. 2008

机译：识别生物医学文本中的命名实体
6. SBLC: a hybrid model for disease named entity recognition based on semantic bidirectional LSTMs and conditional random fields [O] . Kai Xu, Zhanfan Zhou, Tao Gong, 2018

机译：SBLC：基于语义双向LSTM和条件随机场的疾病命名实体识别混合模型
7. Comparing CNN and LSTM character-level embeddings in BiLSTM-CRF models for chemical and disease named entity recognition [O] . Zenan Zhai, Dat Quoc Nguyen, Karin Verspoor 2018

机译：比较CNN和LSTM字符级嵌入在Bilstm-CRF模型中的化学和疾病名为实体识别

Recognizing Biomedical Named Entities Based on the Sentence Vector/Twin Word Embeddings Conditioned Bidirectional LSTM

摘要

著录项

相似文献

相关主题

期刊订阅