Improving case definition of Crohn's disease and ulcerative colitis in electronic medical records using natural language processing: A novel informatics approach

AnanthakrishnanA.N.; CaiT.; SavovaG.; ChengS.-C.; ChenP.; PerezR.G.; GainerV.S.; MurphyS.N.; SzolovitsP.; XiaZ.; ShawS.; ChurchillS.; KarlsonE.W.; KohaneI.; PlengeR.M.; LiaoK.P.

首页> 外文期刊>Inflammatory bowel diseases >Improving case definition of Crohn's disease and ulcerative colitis in electronic medical records using natural language processing: A novel informatics approach

【24h】

Improving case definition of Crohn's disease and ulcerative colitis in electronic medical records using natural language processing: A novel informatics approach

机译：使用自然语言处理改善电子病历中克罗恩病和溃疡性结肠炎的病例定义：一种新颖的信息学方法

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Background: Previous studies identifying patients with inflammatory bowel disease using administrative codes have yielded inconsistent results. Our objective was to develop a robust electronic medical record-based model for classification of inflammatory bowel disease leveraging the combination of codified data and information from clinical text notes using natural language processing. Methods: Using the electronic medical records of 2 large academic centers, we created data marts for Crohn's disease (CD) and ulcerative colitis (UC) comprising patients with ≥1 International Classification of Diseases, 9th edition, code for each disease. We used codified (i.e., International Classification of Diseases, 9th edition codes, electronic prescriptions) and narrative data from clinical notes to develop our classification model. Model development and validation was performed in a training set of 600 randomly selected patients for each disease with medical record review as the gold standard. Logistic regression with the adaptive LASSO penalty was used to select informative variables. Results: We confirmed 399 CD cases (67%) in the CD training set and 378 UC cases (63%) in the UC training set. For both, a combined model including narrative and codified data had better accuracy (area under the curve for CD 0.95; UC 0.94) than models using only disease International Classification of Diseases, 9th edition codes (area under the curve 0.89 for CD; 0.86 for UC). Addition of natural language processing narrative terms to our final model resulted in classification of 6% to 12% more subjects with the same accuracy. Conclusions: Inclusion of narrative concepts identified using natural language processing improves the accuracy of electronic medical records case definition for CD and UC while simultaneously identifying more subjects compared with models using codified data alone.

机译：背景：先前的研究使用行政法规对炎症性肠病患者进行识别的结果不一致。我们的目标是利用自然语言处理技术，结合临床数据注释中的编码数据和信息，开发出基于健壮的电子病历的炎症性肠疾病分类模型。方法：我们使用2个大型学术中心的电子病历，创建了克罗恩病（CD）和溃疡性结肠炎（UC）的数据集市，其中包括≥1国际疾病分类（第9版）的患者，每种疾病的代码。我们使用经过整理的代码（即《国际疾病分类》，第9版代码，电子处方）和临床笔记中的叙述性数据来开发我们的分类模型。在针对每种疾病的600名随机选择患者的训练集中进行了模型开发和验证，并以病历审查作为金标准。用自适应LASSO罚分进行Logistic回归来选择信息量。结果：我们在CD训练集中确认了399例CD病例（67％），在UC训练集中确认了378 UC病例（63％）。对于这两种方法，包括叙述性数据和编码数据的组合模型的准确性（CD 0.95曲线下的面积; UC 0.94）要比仅使用疾病国际分类法第9版代码（CD曲线下的面积0.89; CD曲线下的面积0.86）更好。 UC）。在我们的最终模型中增加自然语言处理叙事术语，可以使相同精度的主题分类增加6％至12％。结论：包含使用自然语言处理识别的叙述概念可以提高CD和UC电子病历定义的准确性，同时与仅使用编码数据的模型相比，可以识别更多的主题。

著录项

来源
《Inflammatory bowel diseases》 |2013年第7期|共10页
作者
AnanthakrishnanA.N.; CaiT.; SavovaG.; ChengS.-C.; ChenP.; PerezR.G.; GainerV.S.; MurphyS.N.; SzolovitsP.; XiaZ.; ShawS.; ChurchillS.; KarlsonE.W.; KohaneI.; PlengeR.M.; LiaoK.P.;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类消化系及腹部疾病;
关键词
Crohn's disease; Disease cohort; Informatics; Natural language processing; Ulcerative colitis;

机译：克罗恩病;疾病队列;信息学;自然语言处理;溃疡性结肠炎;

相似文献

外文文献
中文文献
专利

1. Improving case definition of Crohn's disease and ulcerative colitis in electronic medical records using natural language processing: A novel informatics approach [J] . AnanthakrishnanA.N., CaiT., SavovaG., Inflammatory bowel diseases . 2013,第7期

机译：使用自然语言处理改善电子病历中克罗恩病和溃疡性结肠炎的病例定义：一种新颖的信息学方法
2. Natural language processing improves identification of colorectal cancer testing in the electronic medical record [J] . DennyJ.C., ChomaN.N., PetersonJ.F., Medical decision making: An international journal of the Society for Medical Decision Making . 2012,第1期

机译：自然语言处理可改善电子病历中大肠癌检测的识别
3. Efficient Reuse of Natural Language Processing Models for Phenotype-Mention Identification in Free-text Electronic Medical Records: A Phenotype Embedding Approach [J] . Honghan Wu, Karen Hodgson, Sue Dyson, JMIR Medical Informatics . 2019,第4期

机译：在自由文本电子医疗记录中有效地重用自然语言处理模型的表型提及识别：嵌入方法的表型
4. Improving Adherence to Clinical Pathways Through Natural Language Processing on Electronic Medical Records [C] . Noa P. Cruz, Lea Canales, Javier García Mu?oz, MEDINFO . 2019

机译：通过对电子医疗记录的自然语言处理改善对临床途径的粘附
5. Epigenetics of Crohn's Disease, Ulcerative Colitis, and Phenotypically Normal Individuals [D] . Phillips, Delisa L. 2017

机译：克罗恩病，溃疡性结肠炎和表型正常个体的表观遗传
6. Improving Case Definition of Crohn’s Disease and Ulcerative Colitis in Electronic Medical Records Using Natural Language Processing: A Novel Informatics Approach [O] . Ashwin N. Ananthakrishnan, Tianxi Cai, Guergana Savova, -1

机译：使用自然语言处理改善电子病历中克罗恩病和溃疡性结肠炎的病例定义：一种新型的信息学方法
7. Improving Case Definition of Crohnʼs Disease and Ulcerative Colitis in Electronic Medical Records Using Natural Language Processing [O] . Ananthakrishnan Ashwin N., Cai Tianxi, Savova Guergana, 2013

机译：利用自然语言处理改善电子病历中克罗恩病和溃疡性结肠炎的病例定义

Improving case definition of Crohn's disease and ulcerative colitis in electronic medical records using natural language processing: A novel informatics approach

摘要

著录项

相似文献

相关主题

期刊订阅