首页> 外文期刊>Bioinformatics >ChemSpot: a hybrid system for chemical named entity recognition
【24h】

ChemSpot: a hybrid system for chemical named entity recognition

机译:ChemSpot:用于化学命名实体识别的混合系统

获取原文
获取原文并翻译 | 示例
       

摘要

Motivation: The accurate identification of chemicals in text is important for many applications, including computer-assisted reconstruction of metabolic networks or retrieval of information about substances in drug development. But due to the diversity of naming conventions and traditions for such molecules, this task is highly complex and should be supported by computational tools. Results: We present ChemSpot, a named entity recognition (NER) tool for identifying mentions of chemicals in natural language texts, including trivial names, drugs, abbreviations, molecular formulas and International Union of Pure and Applied Chemistry entities. Since the different classes of relevant entities have rather different naming characteristics, ChemSpot uses a hybrid approach combining a Conditional Random Field with a dictionary. It achieves an F-1 measure of 68.1% on the SCAI corpus, outperforming the only other freely available chemical NER tool, OSCAR4, by 10.8 percentage points.
机译:动机:准确识别文本中的化学物质对于许多应用都很重要,包括计算机辅助的代谢网络重构或检索有关药物开发中物质的信息。但是由于此类分子的命名约定和传统的多样性,该任务非常复杂,应由计算工具来支持。结果:我们提供了ChemSpot,这是一种命名实体识别(NER)工具,用于识别自然语言文本中提及的化学物质,包括琐碎的名称,药物,缩写,分子式以及国际纯粹与应用化学联合会。由于不同类别的相关实体具有相当不同的命名特征,因此ChemSpot使用将条件随机字段与字典结合在一起的混合方法。它在SCAI语料库上实现的F-1测度为68.1%,比其他唯一可免费获得的化学NER工具OSCAR4高出10.8个百分点。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号