Exploring Word Embedding for Drug Name Recognition

机译：探索词嵌入以实现药物名称识别

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
相似文献
相关主题

摘要

This paper describes a machine learning-based approach that uses word embedding features to recognize drug names from biomedical texts. As a starting point, we developed a baseline system based on Conditional Random Field (CRF) trained with standard features used in current Named Entity Recognition (NER) systems. Then, the system was extended to incorporate new features, such as word vectors and word clusters generated by the Word2Vec tool and a lexicon feature from the DINTO ontology. We trained the Word2vec tool over two different corpus: Wikipedia and MedLine. Our main goal is to study the effectiveness of using word embeddings as features to improve performance on our baseline system, as well as to analyze whether the DINTO ontology could be a valuable complementary data source integrated in a machine learning NER system. To evaluate our approach and compare it with previous work, we conducted a series of experiments on the dataset of SemEval-2013 Task 9.1 Drug Name Recognition.

机译：本文介绍了一种基于机器学习的方法，该方法使用单词嵌入功能从生物医学文本中识别药物名称。首先，我们开发了基于条件随机场（CRF）的基线系统，该条件系统经过训练，并使用了当前命名实体识别（NER）系统中使用的标准功能。然后，该系统进行了扩展，以合并新功能，例如Word2Vec工具生成的单词向量和单词簇以及DINTO本体中的词典功能。我们在两个不同的语料库上训练了Word2vec工具：Wikipedia和MedLine。我们的主要目标是研究使用词嵌入作为提高基线系统性能的功能的有效性，并分析DINTO本体是否可以作为集成在机器学习NER系统中的有价值的补充数据源。为了评估我们的方法并将其与以前的工作进行比较，我们对SemEval-2013 Task 9.1药物名称识别的数据集进行了一系列实验。

著录项

来源
《6th Workshop on health text mining and information analysis》|2015年|64-72|共9页
会议地点 Lisbon(PT)
作者
Isabel Segura-Bedmar; Victor Suarez-Paniagua; Paloma Martinez;
展开▼
作者单位

Computer Science Department University Carlos Ⅲ of Madrid, Spain;

Computer Science Department University Carlos Ⅲ of Madrid, Spain;

Computer Science Department University Carlos Ⅲ of Madrid, Spain;

展开▼
会议组织
原文格式 PDF
正文语种 eng
中图分类
关键词
入库时间 2022-08-26 14:23:28

相似文献

外文文献
中文文献
专利

1. Effects of Semantic Features on Machine Learning-Based Drug Name Recognition Systems: Word Embeddings vs. Manually Constructed Dictionaries [J] . Buzhou Tang, Qingcai Chen, Shengyu Liu, Information . 2015,第4期

机译：语义特征对基于机器学习的药物名称识别系统的影响：单词嵌入与手动构建词典
2. The activation of embedded words in spoken word recognition [J] . Zhang Xujin, Samuel Arthur G. Journal of memory and language . 2015,第Null期

机译：语音识别中嵌入单词的激活
3. Embedded words in visual word recognition: Does the left hemisphere see the rain in brain? [J] . McCormick S.F., Davis C.J., Brysbaert M. Journal of experimental psychology. Learning, memory, and cognition . 2010,第5期

机译：视觉单词识别中的嵌入单词：左半球是否看到脑部下雨？
4. Exploring Word Embedding for Drug Name Recognition [C] . Isabel Segura-Bedmar, Victor Suarez-Paniagua, Paloma Martinez Workshop on health text mining and information analysis . 2015

机译：探索嵌入药物名称识别的词
5. Specificity of the b Test, Dot Counting Test, Rey 15-Item Test Plus Recognition, and Rey Word Recognition Test in Monolingual Spanish Speakers Embedded Measure of Effort [D] . Robles, Luz Alehida 2013

机译：b语言测试，点计数测试，Rey 15项测试加识别和Rey单词识别测试在说西班牙语的嵌入式工作量中的特异性
6. The Activation of Embedded Words in Spoken Word Recognition [O] . Xujin Zhang, Arthur G. Samuel -1

机译：语音识别中嵌入单词的激活
7. Effects of Semantic Features on Machine Learning-Based Drug Name Recognition Systems: Word Embeddings vs. Manually Constructed Dictionaries [O] . Shengyu Liu, Buzhou Tang, Qingcai Chen, 2015

机译：语义特征对基于机器学习的药品名称识别系统的影响：词语嵌入与手工构建词典

Exploring Word Embedding for Drug Name Recognition

摘要

著录项

相似文献

相关主题

期刊订阅