Entity Extraction in Biomedical Corpora: An Approach to Evaluate Word Embedding Features with PSO based Feature Selection

机译：生物医学技术中的实体提取：一种评估基于PSO的特征选择词嵌入功能的方法

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Text mining has drawn significant atten tion in recent past due to the rapid growth in biomedical and clinical records. Entity extraction is one of the fundamental com ponents for biomedical text mining. In this paper, we propose a novel approach of feature selection for entity extraction that exploits the concept of deep learning and Particle Swarm Optimization (PSO). The system utilizes word embedding features along with several other features extracted by studying the properties of the datasets. We obtain an interesting observation that compact word embedding features as de termined by PSO are more effective com pared to the entire word embedding fea ture set for entity extraction. The pro posed system is evaluated on three bench mark biomedical datasets such as GENIA, GENETAG and AiMed. The effective ness of the proposed approach is evident with significant performance gains over the baseline models as well as the other ex isting systems. We observe improvements of 7.86%, 5.27% and 7.25% F-measure points over the baseline models for GE NIA, GENETAG, and AiMed dataset re spectively.

机译：由于生物医学和临床记录的快速增长，文本挖掘在过去的过去造成了显着的效力。实体提取是生物医学文本挖掘的基本组合。在本文中，我们提出了一种新的特征选择方法，用于利用深度学习和粒子群优化概念（PSO）的概念。该系统利用单词嵌入功能以及通过研究数据集的属性提取的几个其他功能。我们获得了一个有趣的观察，即PSO所定位的Comply Word嵌入功能是更有效的COM削减了嵌入用于实体提取的FEA TURE设置的整个单词。 Pro姿势系统在三个台面标记生物医学数据集（如Genia，Genetag和瞄准）上进行评估。所提出的方法的有效性是显而易见的，在基线模型以及其他EX的系统上具有显着性能。我们观察GE NIA，GENETAG和AIMASET的基线模型中的7.86％，5.27％和7.25％的F测量点的改进。

著录项

来源
《Conference of the European Chapter of the Association for Computational Linguistics》|2017年|xxxviii p. 643-1280|共12页
会议地点
作者
Shweta Yadav; Asif Ekbal; Sriparna Saha; Pushpak Bhattacharyya;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类程序设计、软件工程;
关键词

相似文献

外文文献
中文文献
专利

1. Feature selection for entity extraction from multiple biomedical corpora: A PSO-based approach [J] . Shweta Yadav, Asif Ekbal, Sriparna Saha Soft computing: A fusion of foundations, methodologies and applications . 2018,第20期

机译：来自多种生物医学的实体提取的特征选择：基于PSO的方法
2. Information theoretic-PSO-based feature selection: an application in biomedical entity extraction [J] . Yadav Shweta, Ekbal Asif, Saha Sriparna Knowledge and information systems . 2019,第3期

机译：基于信息理论上的PSO的特征选择：生物医学实体提取的应用
3. Face recognition using transform domain feature extraction and PSO-based feature selection [J] . Krisshna N. L. Ajit, Deepak V. Kadetotad, Manikantan K., Applied Soft Computing . 2014,第Null期

机译：使用变换域特征提取和基于PSO的特征选择进行人脸识别
4. Entity Extraction in Biomedical Corpora: An Approach to Evaluate Word Embedding Features with PSO based Feature Selection [C] . Shweta Yadav, Asif Ekbal, Sriparna Saha, Conference of the European Chapter of the Association for Computational Linguistics . 2017

机译：生物医学语料库中的实体提取：一种基于PSO的特征选择评估词嵌入特征的方法
5. Advancing Biomedical Named Entity Recognition with Multivariate Feature Selection and Semantically Motivated Features. [D] . Leaman, James Robert, Jr. 2013

机译：具有多元特征选择和语义动机特征的生物医学命名实体识别。
6. Evaluating Word Representation Features in Biomedical Named Entity Recognition Tasks [O] . Buzhou Tang, Hongxin Cao, Xiaolong Wang, -1

机译：在生物医学命名实体识别任务中评估单词表示功能
7. Entity Extraction in Biomedical Corpora: An Approach to Evaluate Word Embedding Features with PSO based Feature Selection [O] . Shweta Yadav, Asif Ekbal, Sriparna Saha, 2017

机译：生物医学技术中的实体提取：一种评估基于PSO的特征选择词嵌入功能的方法

Entity Extraction in Biomedical Corpora: An Approach to Evaluate Word Embedding Features with PSO based Feature Selection

摘要

著录项

相似文献

相关主题

期刊订阅