...
首页> 外文期刊>International journal of software science and computational intelligence >Machine Learning Approach for Multi-Layered Detection of Chemical Named Entities in Text
【24h】

Machine Learning Approach for Multi-Layered Detection of Chemical Named Entities in Text

机译:文本中化学命名实体的多层检测的机器学习方法

获取原文
获取原文并翻译 | 示例
           

摘要

Identification of chemical named entities in text and subsequent linkage of information to biological events is of immense value to fulfill the knowledge needs of pharmaceutical and chemical R&D. A significant amount of investigation has been carried out since a decade for identifying chemical named entities at morphological level. However, a barrier still remains in terms of value proposition to scientists at chemistry level. Therefore, the work described here aims to circumvent the information barrier by adaptation of a Conditional Random Fields-based approach for identifying chemical named entities at various levels namely generic chemical level, morphological level, and chemistry level. Substantial effort has been invested on generation of suitable multi-level annotated corpora. Recommended machine learning practices such as active learning-based training corpus generation and feature optimization have been systematically performed. Evaluation of system performance and benchmarking against the other state-of-the-approaches showed improved results.
机译:在文本中识别化学命名实体以及随后将信息链接到生物事件具有巨大价值,可以满足制药和化学研发的知识需求。自从十年以来,已经进行了大量的研究以鉴定形态学上的化学命名实体。但是,在化学领域科学家的价值主张方面仍然存在障碍。因此,本文所述的工作旨在通过采用基于条件随机场的方法来规避信息障碍,该方法用于在各种级别(即通用化学级别,形态学级别和化学级别)上识别化学命名实体。已投入大量精力来生成合适的多层注释语料库。系统地执行了推荐的机器学习实践,例如基于主动学习的训练语料库生成和功能优化。对系统性能的评估和对其他方法的基准测试显示出改进的结果。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号