首页> 外文会议>Australian National Health Informatics Conference >Model Selection and Variable Aggregation of Australian Hospital Data
【24h】

Model Selection and Variable Aggregation of Australian Hospital Data

机译:澳大利亚医院数据的模型选择与变聚

获取原文

摘要

Background Hospital administrative data commonly consist of hundreds of variables with many consisting of hundreds, if not thousands, of distinct categories, especially for disease groups. Conventional approaches to develop regression models for prediction either fail completely due to multicollinearity or sparsity issues or take too long and consume too many computer resources. Methods We demonstrate how regularisation and variable aggregation techniques such as Elastic Net can overcome some of these problems. Parameter estimates from univariate generalised linear models (GLM) and Elastic Net models were used to aggregate disease groups into a more manageable number and predict the probability of mortality for a given patient. Results When employed for variable aggregation and variable selection, Elastic Net models ran at least four times faster than GLMs, though producing a less discriminative model. When applied to final models for predicting hospital mortality, though, both Elastic Net and GLM models demonstrated similar predictive power and efficiently solved an otherwise complex problem. Conclusion Elastic Net regularisation and variable aggregation provide an efficient mechanism for solving healthcare modelling problems.
机译:背景技术医院管理数据通常由数百个变量组成,其中许多包括数百个,如果不是数千个,特别是疾病群体。常规方法为了提高预测的回归模型或者由于多含量或稀疏问题或花费太长并且消耗太多计算机资源而完全失败。方法我们展示了如何正则化和可变聚合技术,如弹性网可以克服其中一些问题。单变量推广线性模型(GLM)和弹性网模型的参数估计用于将疾病组聚集成更可管理的数量,并预测给定患者的死亡率。结果当用于可变聚合和可变选择时,弹性网模型的速度比GLM更快地运行,尽管产生了较少的鉴别模型。然而,当应用于预测医院死亡率的最终模型时,弹性网和GLM模型都表现出类似的预测力,有效地解决了一个复杂的问题。结论弹性净正规化和可变聚合提供了解决医疗保健建模问题的有效机制。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号