...
首页> 外文期刊>JMIR Medical Informatics >A Deep Artificial Neural Network?Based Model for Prediction of Underlying Cause of Death From Death Certificates: Algorithm Development and Validation
【24h】

A Deep Artificial Neural Network?Based Model for Prediction of Underlying Cause of Death From Death Certificates: Algorithm Development and Validation

机译:深度人工神经网络?基于模型的死亡证书死亡潜在原因的模型:算法开发与验证

获取原文

摘要

Background Coding of underlying causes of death from death certificates is a process that is nowadays undertaken mostly by humans with potential assistance from expert systems, such as the Iris software. It is, consequently, an expensive process that can, in addition, suffer from geospatial discrepancies, thus severely impairing the comparability of death statistics at the international level. The recent advances in artificial intelligence, specifically the rise of deep learning methods, has enabled computers to make efficient decisions on a number of complex problems that were typically considered out of reach without human assistance; they require a considerable amount of data to learn from, which is typically their main limiting factor. However, the CépiDc (Centre d’épidémiologie sur les causes médicales de Décès) stores an exhaustive database of death certificates at the French national scale, amounting to several millions of training examples available for the machine learning practitioner. Objective This article investigates the application of deep neural network methods to coding underlying causes of death. Methods The investigated dataset was based on data contained from every French death certificate from 2000 to 2015, containing information such as the subject’s age and gender, as well as the chain of events leading to his or her death, for a total of around 8 million observations. The task of automatically coding the subject’s underlying cause of death was then formulated as a predictive modelling problem. A deep neural network?based model was then designed and fit to the dataset. Its error rate was then assessed on an exterior test dataset and compared to the current state-of-the-art (ie, the Iris software). Statistical significance of the proposed approach’s superiority was assessed via bootstrap. Results The proposed approach resulted in a test accuracy of 97.8% (95% CI 97.7-97.9), which constitutes a significant improvement over the current state-of-the-art and its accuracy of 74.5% (95% CI 74.0-75.0) assessed on the same test example. Such an improvement opens up a whole field of new applications, from nosologist-level batch-automated coding to international and temporal harmonization of cause of death statistics. A typical example of such an application is demonstrated by recoding French overdose-related deaths from 2000 to 2010. Conclusions This article shows that deep artificial neural networks are perfectly suited to the analysis of electronic health records and can learn a complex set of medical rules directly from voluminous datasets, without any explicit prior knowledge. Although not entirely free from mistakes, the derived algorithm constitutes a powerful decision-making tool that is able to handle structured medical data with an unprecedented performance. We strongly believe that the methods developed in this article are highly reusable in a variety of settings related to epidemiology, biostatistics, and the medical sciences in general.
机译:从死亡证书中死亡的潜在原因的背景编码是一种过程,现在是由专家系统(例如Iris软件)的潜在援助所进行的。因此,它是一种昂贵的过程,即可以遭受地理空间差异,从而严重损害国际一级死亡统计的可比性。最近人工智能的进步,特别是深度学习方法的兴起,使计算机能够在许多复杂问题上做出有效的决策,这些问题通常被认为没有人为援助;它们需要相当多的数据来学习,通常是它们的主要限制因素。然而,Cépidc(Centerd'épidémiologiesur lesMaremédicalesdedécès)在法国国家规模上储存了一份详尽的死亡证书数据库,达到了机器学习从业者可用的数百万次培训。目的本文调查了深度神经网络方法在潜在死亡原因中的应用。方法采用调查数据集根据2000年至2015年的每份法国死亡证明中所载的数据,其中包含受试者年龄和性别等信息,以及导致他或她死亡的事件链,总计约为800万观察。然后制定自动编码受试者的死亡原因的任务作为预测建模问题。然后是一个深度神经网络?然后设计基于模型并适合数据集。然后在外部测试数据集上评估其错误率,并与当前最先进的(即IRIS软件)进行比较。通过自举评估所提出的方法优势的统计显着性。结果拟议的方法导致测试准确性为97.8%(95%CI 97.7-97.9),这构成了对目前最先进的最新的显着改善,其准确性为74.5%(95%CI 74.0-75.0)在同一测试示例中评估。这种改进从核检学家级批量编码到死亡统计原因的国际和时间统一的核检学家级批量编码开辟了整个新的应用领域。通过从2000年到2010年核致法国过量相关的死亡证明了这种申请的典型例子。本文的结论表明,深层人工神经网络完全适合对电子健康记录的分析,并可以直接学习一组复杂的医疗规则来自庞大的数据集,没有任何明确的先验知识。虽然没有完全没有错误,但是派生算法构成了一个强大的决策工具,能够处理具有前所未有的性能的结构化医疗数据。我们强烈认为,本文中开发的方法在与流行病学,生物统计学和医学科学相关的各种环境中是高度可重复使用的。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号