首页> 外文期刊>Artificial intelligence in medicine >SemBioNLQA: A semantic biomedical question answering system for retrieving exact and ideal answers to natural language questions
【24h】

SemBioNLQA: A semantic biomedical question answering system for retrieving exact and ideal answers to natural language questions

机译:SemBioNLQA:一种语义生物医学问题解答系统,用于检索对自然语言问题的准确和理想答案

获取原文
获取原文并翻译 | 示例
       

摘要

Background and objective: Question answering (QA), the identification of short accurate answers to users questions written in natural language expressions, is a longstanding issue widely studied over the last decades in the open-domain. However, it still remains a real challenge in the biomedical domain as the most of the existing systems support a limited amount of question and answer types as well as still require further efforts in order to improve their performance in terms of precision for the supported questions. Here, we present a semantic biomedical QA system named SemBioNLQA which has the ability to handle the kinds of yeso, factoid, list, and summary natural language questions.Methods: This paper describes the system architecture and an evaluation of the developed end-to-end biomedical QA system named SemBioNLQA, which consists of question classification, document retrieval, passage retrieval and answer extraction modules. It takes natural language questions as input, and outputs both short precise answers and summaries as results. The SemBioNLQA system, dealing with four types of questions, is based on (1) handcrafted lexico-syntactic patterns and a machine learning algorithm for question classification, (2) PubMed search engine and UMLS similarity for document retrieval, (3) the BM25 model, stemmed words and UMLS concepts for passage retrieval, and (4) UMLS metathesaurus, BioPortal synonyms, sentiment analysis and term frequency metric for answer extraction.Results and conclusion: Compared with the current state-of-the-art biomedical QA systems, SemBioNLQA, a fully automated system, has the potential to deal with a large amount of question and answer types. SemBioNLQA retrieves quickly users' information needs by returning exact answers (e.g., "yes", "no", a biomedical entity name, etc.) and ideal answers (i.e., paragraph-sized summaries of relevant information) for yeso, factoid and list questions, whereas it provides only the ideal answers for summary questions. Moreover, experimental evaluations performed on biomedical questions and answers provided by the BioASQ challenge especially in 2015, 2016 and 2017 (as part of our participation), show that SemBioNLQA achieves good performances compared with the most current state-of-the-art systems and allows a practical and competitive alternative to help information seekers find exact and ideal answers to their biomedical questions.
机译:背景和目的:问题解答(QA)是对使用自然语言表达的用户问题的简短准确答案的识别,是在开放域中进行了广泛研究的一个长期问题。但是,由于大多数现有系统仅支持有限数量的问题和答案类型,因此在生物医学领域仍然是一个真正的挑战,并且仍需要进一步的努力以提高其在支持问题的准确性方面的性能。在这里,我们介绍了一个语义生物医学QA系统,名为SemBioNLQA,它能够处理是/否,事实,列表和摘要自然语言问题。方法:本文介绍了该系统的体系结构和对最终结果的评估。名为SemBioNLQA的端到端生物医学质量保证系统,它由问题分类,文档检索,段落检索和答案提取模块组成。它以自然语言问题作为输入,并输出简短的精确答案和摘要作为结果。 SemBioNLQA系统处理四种类型的问题,该系统基于(1)手工制作的词汇句法模式和用于问题分类的机器学习算法,(2)PubMed搜索引擎和用于文档检索的UMLS相似性,(3)BM25模型,词干和用于通行检索的UMLS概念,以及(4)UMLS词库,BioPortal同义词,情感分析和术语频率度量以用于答案提取。结果与结论:与当前最先进的生物医学QA系统相比,SemBioNLQA ,一种全自动系统,具有处理大量问题和答案类型的潜力。 SemBioNLQA通过返回正确的答案(例如“是”,“否”,生物医学实体名称等)和理想的答案(即相关信息的段落大小的摘要)来快速检索用户的信息需求,以确认是/否,事实。并列出问题,而它仅提供摘要问题的理想答案。此外,针对BioASQ挑战提供的生物医学问题和答案进行的实验评估,尤其是在2015年,2016年和2017年(作为我们参与的一部分)表明,与最新的最新系统相比,SemBioNLQA的性能良好。提供了一种实用且具有竞争力的替代方案,以帮助信息搜索者找到其生物医学问题的准确理想的答案。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号