首页> 外文会议>International Conference on Artificial Intelligence in Medicine >TransICD: Transformer Based Code-Wise Attention Model for Explainable ICD Coding
【24h】

TransICD: Transformer Based Code-Wise Attention Model for Explainable ICD Coding

机译:Transicd:基于变压器的守则注意模型,可解释ICD编码

获取原文

摘要

International Classification of Disease (ICD) coding procedure which refers to tagging medical notes with diagnosis codes has been shown to be effective and crucial to the billing system in medical sector. Currently, ICD codes are assigned to a clinical note manually which is likely to cause many errors. Moreover, training skilled coders also requires time and human resources. Therefore, automating the ICD code determination process is an important task. With the advancement of artificial intelligence theory and computational hardware, machine learning approach has emerged as a suitable solution to automate this process. In this project, we apply a transformer-based architecture to capture the interdependence among the tokens of a document and then use a code-wise attention mechanism to learn code-specific representations of the entire document. Finally, they are fed to separate dense layers for corresponding code prediction. Furthermore, to handle the imbalance in the code frequency of clinical datasets, we employ a label distribution aware margin (LDAM) loss function. The experimental results on the MIMIC-Ⅲ dataset show that our proposed model outperforms other baselines by a significant margin. In particular, our best setting achieves a micro-AUC score of 0.923 compared to 0.868 of bidirectional recurrent neural networks. We also show that by using the code-wise attention mechanism, the model can provide more insights about its prediction, and thus it can support clinicians to make reliable decisions.
机译:国际疾病分类(ICD)编码程序是指具有诊断码标记医疗票据的编码程序,已显示对医疗部门的计费系统有效和至关重要。目前,ICD代码手动分配给临床笔记,这可能会导致许多错误。此外,培训技术人员还需要时间和人力资源。因此,自动化ICD代码确定过程是一个重要任务。随着人工智能理论和计算硬件的进步,机器学习方法已成为自动化此过程的合适解决方案。在此项目中,我们应用基于变换器的架构,以捕获文档的令牌之间的相互依赖,然后使用代码明智的注意机制来学习整个文档的特定于代码的表示。最后,它们被馈送到分开的密集层以进行相应的码预测。此外,为了处理临床数据集的代码频率的不平衡,我们采用标签分布意识裕度(LDAM)丢失功能。模拟 - Ⅲ数据集的实验结果表明,我们所提出的模型优于其他基线,通过显着的边缘地形成其他基线。特别是,与双向经常性神经网络的0.868相比,我们的最佳设置达到0.923的微AUC评分。我们还表明,通过使用代码明智的关注机制,该模型可以提供更多关于其预测的见解,因此它可以支持临床医生做出可靠的决策。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号