首页> 外文期刊>Journal of biomedical informatics. >A shallow parser based on closed-class words to capture relations in biomedical text.
【24h】

A shallow parser based on closed-class words to capture relations in biomedical text.

机译:基于闭级单词的浅析解析器,以捕获生物医学文本的关系。

获取原文
获取原文并翻译 | 示例
       

摘要

Natural language processing for biomedical text currently focuses mostly on entity and relation extraction. These entities and relations are usually pre-specified entities, e.g., proteins, and pre-specified relations, e.g., inhibit relations. A shallow parser that captures the relations between noun phrases automatically from free text has been developed and evaluated. It uses heuristics and a noun phraser to capture entities of interest in the text. Cascaded finite state automata structure the relations between individual entities. The automata are based on closed-class English words and model generic relations not limited to specific words. The parser also recognizes coordinating conjunctions and captures negation in text, a feature usually ignored by others. Three cancer researchers evaluated 330 relations extracted from 26 abstracts of interest to them. There were 296 relations correctly extracted from the abstracts resulting in 90% precision of the relations and an average of 11 correct relations per abstract.
机译:生物医学文本的自然语言处理目前主要关注实体和关系提取。这些实体和关系通常是预先指定的实体,例如蛋白质和预先指定关系,例如,抑制关系。已经开发并评估了一个浅扫描器,用于自动从自由文本中自动捕获名词短语之间的关系。它使用启发式和名词phraser来捕获文本中感兴趣的实体。级联有限状态自动机结构结构各个实体之间的关系。自动机基于封闭类英语单词和模型泛型关系不限于特定单词。解析器还识别协调连词并在文本中捕获否定,通常由他人忽略的特征。三个癌症研究人员评估了330个关系从26个摘要中提取的关系。从摘要中有296个关系提取了296个关系,导致关系的90%精度,平均每摘要11个正确的关系。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号