首页> 外文期刊>Journal of Biopharmaceutical Statistics >Stability Investigations of Multivariable Regression Models Derived from Low- and High-Dimensional Data
【24h】

Stability Investigations of Multivariable Regression Models Derived from Low- and High-Dimensional Data

机译:从低维和高维数据得出的多元回归模型的稳定性研究

获取原文
获取原文并翻译 | 示例
           

摘要

Multivariable regression models can link a potentially large number of variables to various kinds of outcomes, such as continuous, binary, or time-to-event endpoints. Selection of important variables and selection of the functional form for continuous covariates are key parts of building such models but are notoriously difficult due to several reasons. Caused by multicollinearity between predictors and a limited amount of information in the data, (in)stability can be a serious issue of models selected. For applications with a moderate number of variables, resampling-based techniques have been developed for diagnosing and improving multivariable regression models. Deriving models for high-dimensional molecular data has led to the need for adapting these techniques to settings where the number of variables is much larger than the number of observations. Three studies with a time-to-event outcome, of which one has high-dimensional data, are used to illustrate several techniques. Investigations at the covariate level and at the predictor level are seen to provide considerable insight into model stability and performance. While some areas are indicated where resampling techniques for model building still need further refinement, our case studies illustrate that these techniques can already be recommended for wider use.View full textDownload full textKey WordsComplexity, High-dimensional data, Resampling, Stability, Variable selectionRelated var addthis_config = { ui_cobrand: "Taylor & Francis Online", services_compact: "citeulike,netvibes,twitter,technorati,delicious,linkedin,facebook,stumbleupon,digg,google,more", pubid: "ra-4dff56cd6bb1830b" }; Add to shortlist Link Permalink http://dx.doi.org/10.1080/10543406.2011.629890
机译:多变量回归模型可以将潜在的大量变量链接到各种结果,例如连续,二进制或事件发生时间终点。重要变量的选择和连续协变量的功能形式的选择是构建此类模型的关键部分,但由于多种原因,这非常困难。由于预测变量之间的多重共线性和数据中的信息量有限,(不稳定)稳定性可能是所选模型的一个严重问题。对于变量数量适中的应用,已经开发了基于重采样的技术来诊断和改进多变量回归模型。高维分子数据的推导模型导致需要将这些技术适应于变量数量远大于观测数量的设置。具有事件发生时间的三项研究(其中一项具有高维数据)用于说明几种技术。在协变量级别和预测变量级别上的调查可以为模型稳定性和性能提供相当大的洞察力。虽然指出了一些需要进一步完善用于模型构建的重采样技术的领域,但我们的案例研究表明这些技术已经可以推荐用于更广泛的使用。查看全文下载全文关键词,高维数据,重采样,稳定性,变量选择相关变量addthis_config = {ui_cobrand:“泰勒和弗朗西斯在线”,servicescompact:“ citeulike,netvibes,twitter,technorati,delicious,linkedin,facebook,stumbleupon,digg,google,更多”,发布:“ ra-4dff56cd6bb1830b”};添加到候选列表链接永久链接http://dx.doi.org/10.1080/10543406.2011.629890

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号