【24h】

Learning Recursive Patterns for Biomedical Information Extraction

机译:学习递归模式的生物医学信息提取

获取原文
获取原文并翻译 | 示例

摘要

Information in text form remains a greatly unexploited source of biological information. Information Extraction (IE) techniques are necessary to map this information into structured representations that allow facts relating domain-relevant entities to be automatically recognized. In biomedical IE tasks, extracting patterns that model implicit relations among entities is particularly important since biological systems intrinsically involve interactions among several entities. In this paper, we resort to an Inductive Logic Programming (ILP) approach for the discovery of mutual recursive patterns from text. Mutual recursion allows dependencies among entities to be explored in data and extraction models to be applied in a context-sensitive mode. In particular, IE models are discovered in form of classification rules encoding the conditions to fill a pre-defined information template. An application to a real-world dataset composed by publications selected to support biologists in the task of automatic annotation of a genomic database is reported.
机译:文本形式的信息仍然是生物信息的未开发来源。信息提取(IE)技术是将此信息映射到结构化表示形式中所必需的,该结构化表示形式可以自动识别与域相关实体有关的事实。在生物医学IE任务中,提取模型化实体之间隐式关系的模式尤为重要,因为生物系统本质上涉及多个实体之间的相互作用。在本文中,我们采用归纳逻辑编程(ILP)方法从文本中发现相互递归模式。相互递归允许在数据中探索实体之间的依赖性,并以上下文相关模式应用提取模型。特别是,以分类规则的形式发现IE模型,该规则对条件进行编码以填充预定义的信息模板。报告了一种由选定的出版物组成的对现实世界数据集的应用,这些出版物被选为支持生物学家完成基因组数据库的自动注释任务。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号