首页> 外文会议>Widening Natural Language Processing Workshop >Mapping of Narrative Text Fields To ICD-10 Codes Using Natural Language Processing and Machine Learning
【24h】

Mapping of Narrative Text Fields To ICD-10 Codes Using Natural Language Processing and Machine Learning

机译:使用自然语言处理和机器学习将叙述文本字段映射到ICD-10代码

获取原文

摘要

The assignment of ICD-10 codes is done manually, which is laborious and prone to errors. The use of natural language processing and machine learning approaches have been receiving increasing attention on automating the task of assigning ICD-10 codes. In this study we investigate the effect of different approaches on automating the task of assigning ICD-10 codes. To do this we use South African clinical dataset containing three narrative text fields (Clinical Summary, Presenting Complaints and Examination Findings). The following traditional machine learning algorithms, namely: Logistic Regression, Multinomial Naive Bayes, Support Vector Machine, Decision Tree, Random Forest and Extreme Gradient Boost were used as our classifiers. Our study results show the strong potential of automated ICD-10 coding from the narrative text fields. Extreme Gradient Boost outperformed other classifiers in automating the task of assigning ICD-10 codes based on the three narrative text fields with an accuracy of 79%, precision of 75%, and recall of 78%. While our worst classifier (Decision Tree) achieved the accuracy of 54%, precision of 60% and recall of 56%.
机译:ICD-10代码的分配是手动完成的,这是费力和易于错误的。使用自然语言处理和机器学习方法已经接受了对分配ICD-10代码的自动化的越来越关注。在这项研究中,我们调查不同方法对分配ICD-10代码的自动化任务的影响。为此,我们使用包含三个叙述文本领域的南非临床数据集(临床摘要,提出投诉和考试结果)。以下传统机器学习算法,即:逻辑回归,多项式天真贝叶斯,支持向量机,决策树,随机森林和极端梯度提升作为我们的分类器。我们的研究结果表明,从叙事文本领域的自动化ICD-10编码的强劲潜力。极端梯度提高了其他分类器,可以自动化基于三个叙述文本领域分配ICD-10代码的任务,精度为79%,精度为75%,并记得78%。虽然我们最糟糕的分类器(决策树)实现了54%,精度为60%并召回56%的准确性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号