首页> 美国卫生研究院文献>Journal of the American Medical Informatics Association : JAMIA >Mayo clinical Text Analysis and Knowledge Extraction System (cTAKES): architecture component evaluation and applications
【2h】

Mayo clinical Text Analysis and Knowledge Extraction System (cTAKES): architecture component evaluation and applications

机译:Mayo临床文本分析和知识提取系统(cTAKES):体系结构组件评估和应用

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

We aim to build and evaluate an open-source natural language processing system for information extraction from electronic medical record clinical free-text. We describe and evaluate our system, the clinical Text Analysis and Knowledge Extraction System (cTAKES), released open-source at . The cTAKES builds on existing open-source technologies—the Unstructured Information Management Architecture framework and OpenNLP natural language processing toolkit. Its components, specifically trained for the clinical domain, create rich linguistic and semantic annotations. Performance of individual components: sentence boundary detector accuracy=0.949; tokenizer accuracy=0.949; part-of-speech tagger accuracy=0.936; shallow parser F-score=0.924; named entity recognizer and system-level evaluation F-score=0.715 for exact and 0.824 for overlapping spans, and accuracy for concept mapping, negation, and status attributes for exact and overlapping spans of 0.957, 0.943, 0.859, and 0.580, 0.939, and 0.839, respectively. Overall performance is discussed against five applications. The cTAKES annotations are the foundation for methods and modules for higher-level semantic processing of clinical free-text.
机译:我们旨在建立和评估一个开放源代码自然语言处理系统,以从电子病历临床自由文本中提取信息。我们描述并评估了我们的系统,即临床文本分析和知识提取系统(cTAKES),该系统在http://www.ibm.com/opensource/上发布。 cTAKES建立在现有的开源技术之上,即非结构化信息管理架构框架和OpenNLP自然语言处理工具包。其组件经过专门针对临床领域的培训,可创建丰富的语言和语义注释。各个组件的性能:句子边界检测器精度= 0.949;分词器精度= 0.949;词性标记器的准确度= 0.936;浅层解析器F分数= 0.924;已命名的实体识别器和系统级评估的F分数= 0.715(准确度)和0.824(重叠范围),概念映射,否定和状态属性的准确度(准确度和重叠范围)分别为0.957、0.943、0.859和0.580、0.939,以及分别为0.839。针对五个应用程序讨论了整体性能。 cTAKES注释是临床自由文本的高级语义处理方法和模块的基础。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号