首页> 美国卫生研究院文献>AMIA Annual Symposium Proceedings >Validation of clinical problems using a UMLS-based semantic parser.
【2h】

Validation of clinical problems using a UMLS-based semantic parser.

机译:使用基于UMLS的语义解析器验证临床问题。

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

The capture and symbolization of data from the clinical problem list facilitates the creation of high-fidelity patient resumes for use in aggregate analysis and decision support. We report on the development of a UMLS-based semantic parser and present a preliminary evaluation of the parser in the recognition and validation of disease-related clinical problems. We randomly sampled 20% of the 26,858 unique non-dictionary clinical problems entered into OMR (Online Medical Record) between 1989 and August, 1997, and eliminated a series of qualified problem labels, e.g., history-of, to obtain a dataset of 4122 problem labels. Within this dataset, the authors identified 2810 labels (68.2%) as referring to a broad range of disease-related processes. The parser correctly recognized and validated 1398 of the 2810 disease-related labels (49.8 +/- 1.9%) and correctly excluded 1220 of 1312 non-disease-related labels (93.0 +/- 1.4%). 812 of the 1181 match failures (68.8%) were caused by terms either absent from UMLS or modifiers not accepted by the parser; 369 match failures (31.2%) were caused by labels having patterns not recognized by the parser. By enriching the UMLS lexicon with terms commonly found in provider-entered labels, it appears that performance of the parser can be significantly enhanced over a few subsequent iterations. This initial evaluation provides a foundation from which to make principled additions to the UMLS lexicon locally for use in symbolizing clinical data; further research is necessary to determine applicability to other health care settings.
机译:从临床问题列表中捕获和符号化数据有助于创建高保真度的患者简历,以用于汇总分析和决策支持。我们报告了基于UMLS的语义解析器的发展,并提出了对解析器在识别和验证与疾病相关的临床问题中的初步评估。我们从1989年至1997年8月之间进入OMR(在线医疗记录)的26,858种独特的非字典式临床问题中,随机抽取了20%,并消除了一系列合格的问题标签,例如病史,以获得4122个数据集问题标签。在该数据集中,作者确定了2810个标签(占68.2%),涉及广泛的疾病相关过程。解析器正确识别并验证了2810个疾病相关标签中的1398个(49.8 +/- 1.9%),并正确排除了1312个非疾病相关标签中的1220个(93.0 +/- 1.4%)。 1181个匹配失败中的812个(68.8%)是由于UMLS缺少或解析器不接受的修饰符引起的; 369个匹配失败(占31.2%)是由具有解析器无法识别的模式的标签引起的。通过使用提供者输入的标签中常见的术语来丰富UMLS词典,似乎可以在随后的几次迭代中显着提高解析器的性能。最初的评估提供了基础,可以在本地对UMLS词典进行有原则的添加以用于符号化临床数据;为了确定是否适用于其他医疗机构,有必要进行进一步的研究。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号