首页> 外文会议>2011 International Conference on Quality, Reliability, Risk, Maintenance, and Safety Engineering >Survival analysis using neural network hazard model with incomplete covariate data
【24h】

Survival analysis using neural network hazard model with incomplete covariate data

机译:使用不完整协变量数据的神经网络危害模型进行生存分析

获取原文

摘要

This paper presents a new procedure to perform survival analysis when some covariate data are not available. A neural network hazard model is utilized here to model the relationship between covariates and the hazard. In order to consider incomplete covariates, the hidden layer target data are represented to be binary random variables. This will enable the training of the two-layer neural network hazard model to be decomposed into training of two single-layer structures. The training of input-hidden structure now becomes the logistic estimation problem with part of the input and all the output (the hidden layer target) missing. However, there are two major problems for this logistic estimation. It requires assumption about the distribution of the partially observed covariates. In addition, estimation for the logistic function will become complicated when the input data has missing values. Therefore, Instead of logistic function, the general location model is adopted to represent the mixed data set which involves missing values. The training of input-hidden structure thus becomes maximisation of the likelihood of mixed continuous data (covariates) and categorical data (hidden layer targets) within the general location model. The hidden layer targets link the two single structures and are updated iteratively. After each update, the expected values of the hidden layer targets are then used for the training of hidden-output structure of the neural network hazard model. This structure is now same as a generalised linear model (GLM) and is trained by the iteratively reweighted least squares (IRLS) approach. The training for both input-hidden and hidden-output structures will iterate until the estimation is converged. This new approach is applied to a group of bearing data. Parts of the data are deleted deliberately to create different realisations of incomplete covariate set. The numerical study demonstrates that this new approach is capable of handling the incomplete cova--riate data in the survival analysis and its results outperform those of conventional incomplete covariates handling approaches.
机译:当一些协变量数据不可用时,本文提出了一种新的程序来进行生存分析。这里利用神经网络危害模型对协变量和危害之间的关系进行建模。为了考虑不完整的协变量,将隐藏层目标数据表示为二进制随机变量。这将使对两层神经网络危害模型的训练可以分解为对两个单层结构的训练。输入隐藏结构的训练现在成为逻辑估计问题,部分输入和所有输出(隐藏层目标)都丢失了。但是,这种逻辑估计存在两个主要问题。它要求对部分观察到的协变量的分布进行假设。另外,当输入数据缺少值时,逻辑函数的估计将变得复杂。因此,代替逻辑函数,采用通用位置模型来表示涉及缺失值的混合数据集。输入隐藏结构的训练因此成为一般位置模型内混合连续数据(协变量)和分类数据(隐藏层目标)的可能性的最大化。隐藏层目标链接两个单独的结构并进行迭代更新。每次更新后,隐藏层目标的期望值随后将用于训练神经网络危害模型的隐藏输出结构。现在,该结构与广义线性模型(GLM)相同,并且通过迭代加权最小二乘(IRLS)方法进行训练。输入隐藏结构和隐藏输出结构的训练都将重复进行,直到收敛估计为止。这种新方法适用于一组方位数据。故意删除部分数据,以创建不完整协变量集的不同实现。数值研究表明,这种新方法能够处理不完全的气孔。 -- 生存分析中的原始数据及其结果优于传统的不完整协变量处理方法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号