...
首页> 外文期刊>International journal of machine learning and cybernetics >A comparative study for biomedical named entity recognition
【24h】

A comparative study for biomedical named entity recognition

机译:生物医学命名实体识别的比较研究

获取原文
获取原文并翻译 | 示例
           

摘要

AbstractWith high-throughput technologies applied in biomedical research, the quantity of biomedical literatures grows exponentially. It becomes more and more important to quickly as well as accurately extract knowledge from manuscripts, especially in the era of big data. Named entity recognition (NER), aiming at identifying chunks of text that refers to specific entities, is essentially the initial step for information extraction. In this paper, we will review the three models of biomedical NER and two famous machine learning methods, Hidden Markov Model and Conditional Random Fields, which have been widely applied in bioinformatics. Based on these two methods, six excellent biomedical NER tools are compared in terms of programming language, feature sets, underlying mathematical methods, post-processing techniques and flowcharts. Experimental results of these tools against two widely used corpora, GENETAG and JNLPBA, are conducted. The comparison varies from different entity types to the overall performance. Furthermore, we put forward suggestions about the selection of Bio-NER tools for different applications.
机译: Abstract 随着在生物医学研究中应用的高通量技术,生物医学文献的数量不断增长指数增长。快速而准确地从手稿中提取知识变得越来越重要,尤其是在大数据时代。旨在识别引用特定实体的文本块的命名实体识别(NER)本质上是信息提取的第一步。在本文中,我们将回顾生物医学神经网络的三种模型以及两种著名的机器学习方法,即隐马尔可夫模型和条件随机场,它们已在生物信息学中得到广泛应用。基于这两种方法,从编程语言,功能集,基础数学方法,后处理技术和流程图方面比较了六个出色的生物医学NER工具。这些工具针对两种广泛使用的语料库GENETAG和JNLPBA进行了实验结果。比较从不同的实体类型到整体绩效都不同。此外,我们针对不同应用场合的Bio-NER工具的选择提出了建议。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号