首页> 外文会议>Workshop on biomedical natural language processing >Temporal Expression Recognition for Cell Cycle Phase Concepts in Biomedical Literature
【24h】

Temporal Expression Recognition for Cell Cycle Phase Concepts in Biomedical Literature

机译:生物医学文献中细胞周期阶段概念的时间表达识别。

获取原文

摘要

In this paper, we present a system for recognizing temporal expressions related to cell cycle phase (CCP) concepts in biomedical literature. We identified 11 classes of cell cycle related temporal expressions, for which we made extensions to TIMEX3, arranging them in an ontology derived from the Gene Ontology. We annotated 310 abstracts from PubMed. Annotation guidelines were developed, consistent with existing time-related annotation guidelines for TimeML. Two anno-tators participated in the annotation. We achieved an inter-annotator agreement of 0.79 for an exact span match and 0.82 for relaxed constraints. Our approach is a hybrid of machine learning to recognize temporal expressions and a rule-based approach to map them to the ontology. We trained a named entity recognizer using Conditional Random Fields (CRF) models. An off-the-shelf implementation of the linear chain CRF model was used. We obtained an F-score of 0.77 for temporal expression recognition. We achieved 0.79 macro-averagee F-score and 0.78 micro-averaged F-score for mapping to the ontology.
机译:在本文中,我们提出了一种识别生物医学文献中与细胞周期阶段(CCP)概念相关的时间表达的系统。我们确定了11个类别的细胞周期相关时间表达式,我们将扩展到Timex3,以源自基因本体的本体论。我们注释了来自Pubmed的310个摘要。制定注释指南,与Timeml的现有时间相关的注释指南一致。两个Anno-Tators参与了注释。我们实现了一个完全跨度匹配和0.82的注释器协议0.79,为放松的约束。我们的方法是机器学习的混合动力,以识别时间表达式和基于规则的方法来将它们映射到本体。我们使用条件随机字段(CRF)模型训练了一个命名实体识别器。使用了线性链CRF模型的搁板实现。对于时间表达识别,我们获得了0.77的F分。我们达到了0.79宏普通的F分和0.78微平均F分,以便映射到本体。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号