Efficient identification of nationally mandated reportable cancer cases using natural language processing and machine learning

机译：使用自然语言处理和机器学习有效识别国家授权的可报告癌症病例

代理获取

本网站仅为用户提供外文OA文献查询和代理获取服务，本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文，但由于OA文献来源多样且变更频繁，仍可能出现获取不到、文献不完整或与标题不符等情况，如果获取不到我们将提供退款服务。请知悉。

页面导航

摘要
著录项
相似文献
相关主题

摘要

>Objective To help cancer registrars efficiently and accurately identify reportable cancer cases.>Material and Methods The Cancer Registry Control Panel (CRCP) was developed to detect mentions of reportable cancer cases using a pipeline built on the Unstructured Information Management Architecture – Asynchronous Scaleout (UIMA-AS) architecture containing the National Library of Medicine’s UIMA MetaMap annotator as well as a variety of rule-based UIMA annotators that primarily act to filter out concepts referring to nonreportable cancers. CRCP inspects pathology reports nightly to identify pathology records containing relevant cancer concepts and combines this with diagnosis codes from the Clinical Electronic Data Warehouse to identify candidate cancer patients using supervised machine learning. Cancer mentions are highlighted in all candidate clinical notes and then sorted in CRCP’s web interface for faster validation by cancer registrars.>Results CRCP achieved an accuracy of 0.872 and detected reportable cancer cases with a precision of 0.843 and a recall of 0.848. CRCP increases throughput by 22.6% over a baseline (manual review) pathology report inspection system while achieving a higher precision and recall. Depending on registrar time constraints, CRCP can increase recall to 0.939 at the expense of precision by incorporating a data source information feature.>Conclusion CRCP demonstrates accurate results when applying natural language processing features to the problem of detecting patients with cases of reportable cancer from clinical notes. We show that implementing only a portion of cancer reporting rules in the form of regular expressions is sufficient to increase the precision, recall, and speed of the detection of reportable cancer cases when combined with off-the-shelf information extraction software and machine learning.

机译：>目的以帮助癌症注册服务商有效，准确地识别可报告的癌症病例。>材料和方法开发了癌症注册控制面板（CRCP），用于通过管道检测提及的可报告癌症病例基于非结构化信息管理架构–异步横向扩展（UIMA-AS）架构，该架构包含美国国家医学图书馆的UIMA MetaMap注释器以及各种基于规则的UIMA注释器，这些注释器主要用于过滤涉及不可报告癌症的概念。 CRCP每晚检查病理报告，以识别包含相关癌症概念的病理记录，并将其与临床电子数据仓库中的诊断代码结合起来，以使用监督机器学习来识别候选癌症患者。在所有候选临床说明中都会突出显示癌症提及，然后在CRCP的Web界面中对癌症进行分类，以便癌症注册服务商更快地进行验证。>结果 CRCP的准确度为0.872，检测到的可报告的癌症病例的准确度为0.843，召回为0.848。与基线（手动检查）病理报告检查系统相比，CRCP的吞吐率提高了22.6％，同时实现了更高的精度和召回率。根据注册服务商的时间限制，CRCP可以通过合并数据源信息功能而以精确度为代价将召回率提高到0.939。>结论 CRCP在将自然语言处理功能应用于检测患有以下疾病的患者时证明了准确的结果临床记录中可报告癌症的病例。我们显示，与现成的信息提取软件和机器学习相结合，仅以正则表达式形式实施一部分癌症报告规则就足以提高检测可报告癌症病例的准确性，召回率和速度。

著录项

期刊名称 Journal of the American Medical Informatics Association : JAMIA
作者
John D Osborne; Matthew Wyatt; Andrew O Westfall; James Willig; Steven Bethard; Geoff Gordon;
展开▼
作者单位

展开▼
年(卷),期 2016(23),6
年度 2016
页码 1077–1084
总页数 8
原文格式 PDF
正文语种
中图分类
关键词
natural language processing machine learning information extraction neoplasms electronic health records user-computer interface;

机译：自然语言处理;机器学习;信息提取;肿瘤;电子病历;用户计算机界面;

相似文献

外文文献
中文文献
专利

1. Efficient identification of nationally mandated reportable cancer cases using natural language processing and machine learning [J] . Osborne John D., Wyatt Matthew, Westfall Andrew O., Journal of the American Medical Informatics Association : . 2016,第6期

机译：使用自然语言处理和机器学习有效识别国家授权的可报告癌症病例
2. Novel application of natural language processing and machine learning techniques to analyze qualitative patient-reported outcomes data: a report from the PEPR pediatric cancer survivorship study [J] . Lu Zhaohua, Baker Justin, Krull Kevin, Quality of life research: An international journal of quality of life aspects of treatment, care and rehabilitation . 2019,第Suppla1期

机译：新颖的自然语言处理和机器学习技术分析定性患者报告的结果数据：PEPR儿科癌症生存研究的报告
3. Machine Vision Methods, Natural Language Processing, and Machine Learning Algorithms for Automated Dispersion Plot Analysis and Chemical Identification from Complex Mixtures [J] . Yeap Danny, Hichwa Paul T., Rajapakse Maneeshin Y., Analytical chemistry . 2019,第16期

机译：机器视觉方法，自然语言处理和机器学习算法，用于自动分散绘图分析和复杂混合物的化学识别
4. UTA_DLNLP at SemEval-2016 Task 12: Deep Learning Based Natural Language Processing System for Clinical Information Identification from Clinical Notes and Pathology Reports [C] . Peng Li, Heng Huang International workshop on semantic evaluation;Conference of the North American Chapter of the Association for Computational Linguistics - Human Language Technologies . 2016

机译：UTA_DLNLP在SemEval-2016任务12：基于深度学习的自然语言处理系统，用于从临床笔记和病理报告中识别临床信息
5. Leveraging unstructured construction injury reports to predict safety outcomes and model safety risk using Natural Language Processing, Machine Learning, and probability theory [D] . Tixier, Antoine Jean-Pierre. 2015

机译：利用非结构化施工损伤报告以预测使用自然语言处理，机器学习和概率理论来预测安全结果和模型安全风险
6. Evaluation of an international medical E-learning course with natural language processing and machine learning [O] . Aditya Borakati 2021

机译：用自然语言处理和机器学习评估国际医学电子学习课程
7. Efficient identification of nationally mandated reportable cancer cases using natural language processing and machine learning [O] . John D Osborne, Matthew Wyatt, Andrew O Westfall, 2016

机译：利用自然语言加工和机器学习有效地识别国家授权的可报告癌症病例

Efficient identification of nationally mandated reportable cancer cases using natural language processing and machine learning

摘要

著录项

相似文献

相关主题

期刊订阅