...
首页> 外文期刊>Bioinformatics >MedScan, a natural language processing engine for MEDLINE abstracts
【24h】

MedScan, a natural language processing engine for MEDLINE abstracts

机译:MedScan,MEDLINE摘要的自然语言处理引擎

获取原文
获取原文并翻译 | 示例
           

摘要

Motivation: The importance of extracting biomedical information from scientific publications is well recognized. A number of information extraction systems for the biomedical domain have been reported, but none of them have become widely used in practical applications. Most proposals to date make rather simplistic assumptions about the syntactic aspect of natural language. There is an urgent need for a system that has broad coverage and performs well in real-text applications. Results: We present a general biomedical domain-oriented NLP engine called MedScan that efficiently processes sentences from MEDLINE abstracts and produces a set of regularized logical structures representing the meaning of each sentence. The engine utilizes a specially developed context-free grammar and lexicon. Preliminary evaluation of the system's performance, accuracy, and coverage exhibited encouraging results. Further approaches for increasing the coverage and reducing parsing ambiguity of the engine, as well as its application for information extraction are discussed.
机译:动机:从科学出版物中提取生物医学信息的重要性已得到公认。已经报道了许多用于生物医学领域的信息提取系统,但是没有一个在实际应用中被广泛使用。迄今为止,大多数提议都对自然语言的句法方面做出了相当简单的假设。迫切需要一种具有广泛覆盖范围并且在实文本应用程序中性能良好的系统。结果:我们提出了一个通用的面向生物医学领域的NLP引擎,称为MedScan,该引擎可以有效地处理MEDLINE摘要中的句子,并生成代表每个句子含义的一组规则化逻辑结构。该引擎利用了专门开发的无上下文语法和词典。对系统性能,准确性和覆盖范围的初步评估显示出令人鼓舞的结果。讨论了增加引擎覆盖率和减少解析歧义的其他方法,以及其在信息提取中的应用。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号