首页> 外国专利> Automated extraction of bio-entity relationships from literature

Automated extraction of bio-entity relationships from literature

机译:从文献中自动提取生物实体关系

摘要

Automated, standardized and accurate extraction of relationships within text. Automatic extraction of such relationships/information allows the information to be stored in structured form so that it can be easily and accurately retrieved when needed. Such information can be used to build online search engines for highly specific and accurate information retrieval. Generally, according to the current invention, extracting such information (i.e., relationships within text) from raw text can be accomplished using natural language processing (NLP) and graph theoretic algorithm. Examples of such textual relationships include, but are not limited to, biological relationships between biological terms such as proteins, genes, pathways, diseases and drugs. The current methodology is also able to recognize negative dependences in context, match patterns, and provide a shortest path between related words.
机译:自动,标准化和准确地提取文本中的关系。这种关系/信息的自动提取允许以结构化形式存储信息,以便在需要时可以轻松,准确地检索信息。此类信息可用于构建在线搜索引擎,以进行高度特定和准确的信息检索。通常,根据本发明,可以使用自然语言处理(NLP)和图论算法从原始文本中提取这种信息(即,文本内的关系)。这种文本关系的例子包括但不限于生物学术语例如蛋白质,基因,途径,疾病和药物之间的生物学关系。当前的方法还能够识别上下文中的负面依存关系,匹配模式,并提供相关词之间的最短路径。

著录项

  • 公开/公告号US9542528B2

    专利类型

  • 公开/公告日2017-01-10

    原文格式PDF

  • 申请/专利号US201414534777

  • 发明设计人 JINFENG ZHANG;

    申请日2014-11-06

  • 分类号G06F19/24;G06F17/27;G06F19/28;

  • 国家 US

  • 入库时间 2022-08-21 13:41:11

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号