首页> 美国卫生研究院文献>AMIA Summits on Translational Science Proceedings >Paraphrasing to improve the performance of Electronic Health Records Question Answering
【2h】

Paraphrasing to improve the performance of Electronic Health Records Question Answering

机译:改写以提高电子病历的性能问答

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

This paper describes a paraphrasing approach to improve the performance of question answering (QA) for electronic health records (EHRs). QA systems for structured EHR data usually rely on semantic parsing, which aims to generate machine-understandable logical forms from free-text questions. Training semantic parsers requires large datasets of question-logical form (QL) pairs, which are labor-intensive to create. Considering the scarcity of large QL datasets in the clinical domain, we propose a framework for expanding an existing dataset using paraphrasing. We experiment with different heuristics for multiple sample sizes and iterations to assess the effect of adding paraphrasing to the task of semantic parsing. We found that adding paraphrases to an existing dataset based on TERTHRESHOLD scores results in an improved performance in the majority (74%) of the experimental runs. Hence, the proposed paraphrasing-based framework has the potential to improve the performance of QA systems using a limited set of existing QL annotations.
机译:本文介绍了一种释义方法,以提高电子健康记录(EHR)的问答功能(QA)的性能。用于结构化EHR数据的QA系统通常依赖于语义解析,该解析旨在从自由文本问题生成机器可理解的逻辑形式。训练语义解析器需要使用问题-逻辑形式(QL)对的大型数据集,创建起来非常费力。考虑到临床领域中大型QL数据集的稀缺性,我们提出了使用释义扩展现有数据集的框架。我们针对多个样本大小和迭代使用不同的启发式方法进行实验,以评估在语义解析任务中添加释义的效果。我们发现,在基于TERTHRESHOLD分数的现有数据集中添加复述可以在大多数(74%)实验运行中提高性能。因此,所提出的基于释义的框架具有使用有限的一组现有QL注释来改善QA系统性能的潜力。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号