...
首页> 外文期刊>BMC Medical Informatics and Decision Making >Semantic biomedical resource discovery: a Natural Language Processing framework
【24h】

Semantic biomedical resource discovery: a Natural Language Processing framework

机译:语义生物医学资源发现:自然语言处理框架

获取原文
           

摘要

Background A plethora of publicly available biomedical resources do currently exist and are constantly increasing at a fast rate. In parallel, specialized repositories are been developed, indexing numerous clinical and biomedical tools. The main drawback of such repositories is the difficulty in locating appropriate resources for a clinical or biomedical decision task, especially for non-Information Technology expert users. In parallel, although NLP research in the clinical domain has been active since the 1960s, progress in the development of NLP applications has been slow and lags behind progress in the general NLP domain. The aim of the present study is to investigate the use of semantics for biomedical resources annotation with domain specific ontologies and exploit Natural Language Processing methods in empowering the non-Information Technology expert users to efficiently search for biomedical resources using natural language. Methods A Natural Language Processing engine which can “translate” free text into targeted queries, automatically transforming a clinical research question into a request description that contains only terms of ontologies, has been implemented. The implementation is based on information extraction techniques for text in natural language, guided by integrated ontologies. Furthermore, knowledge from robust text mining methods has been incorporated to map descriptions into suitable domain ontologies in order to ensure that the biomedical resources descriptions are domain oriented and enhance the accuracy of services discovery. The framework is freely available as a web application at ( http://calchas.ics.forth.gr/ ). Results For our experiments, a range of clinical questions were established based on descriptions of clinical trials from the ClinicalTrials.gov registry as well as recommendations from clinicians. Domain experts manually identified the available tools in a tools repository which are suitable for addressing the clinical questions at hand, either individually or as a set of tools forming a computational pipeline. The results were compared with those obtained from an automated discovery of candidate biomedical tools. For the evaluation of the results, precision and recall measurements were used. Our results indicate that the proposed framework has a high precision and low recall, implying that the system returns essentially more relevant results than irrelevant. Conclusions There are adequate biomedical ontologies already available, sufficiency of existing NLP tools and quality of biomedical annotation systems for the implementation of a biomedical resources discovery framework, based on the semantic annotation of resources and the use on NLP techniques. The results of the present study demonstrate the clinical utility of the application of the proposed framework which aims to bridge the gap between clinical question in natural language and efficient dynamic biomedical resources discovery.
机译:背景技术当前确实存在着大量的公共可用生物医学资源,并且它们以快速的速度不断增加。同时,还开发了专门的存储库,索引了许多临床和生物医学工具。这种存储库的主要缺点是难以为临床或生物医学决策任务找到合适的资源,特别是对于非信息技术专家用户而言。同时,尽管自1960年代以来在临床领域进行NLP研究一直很活跃,但是NLP应用程序的开发进展缓慢,并且落后于一般NLP领域的进展。本研究的目的是研究语义在具有特定领域本体的生物医学资源注释中的使用,并利用自然语言处理方法授权非信息技术专家用户使用自然语言有效地搜索生物医学资源。方法已经实现了一种自然语言处理引擎,该引擎可以将自由文本“翻译”为目标查询,将临床研究问题自动转换为仅包含本体术语的请求描述。该实现基于集成本体论指导的自然语言文本信息提取技术。此外,已经结合了来自可靠文本挖掘方法的知识,以将描述映射到合适的领域本体中,以确保生物医学资源描述是面向领域的,并增强了服务发现的准确性。该框架可作为Web应用程序免费获得,网址为(http://calchas.ics.forth.gr/)。结果对于我们的实验,根据ClinicalTrials.gov注册中心对临床试验的描述以及临床医生的建议,建立了一系列临床问题。领域专家在工具库中手动识别了可用于解决当前临床问题的可用工具,这些工具既可以单独使用,也可以作为一组构成计算管线的工具来解决。将结果与从自动发现候选生物医学工具中获得的结果进行了比较。为了评估结果,使用了精度和召回率测量。我们的结果表明,所提出的框架具有较高的准确性和较低的查全率,这意味着该系统返回的结果实质上比不相关的结果更为重要。结论基于资源的语义注释和NLP技术的使用,已经有足够的生物医学本体论,足够的现有NLP工具和质量的生物医学注释系统来实施生物医学资源发现框架。本研究的结果证明了所提出框架的应用的临床实用性,该框架旨在弥合自然语言中的临床问题与有效的动态生物医学资源发现之间的差距。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号