首页> 外文期刊>Biometrical Journal >Modeling tails for collinear data with outliers in the English Longitudinal Study of Ageing: Quantile profile regression
【24h】

Modeling tails for collinear data with outliers in the English Longitudinal Study of Ageing: Quantile profile regression

机译:衰老中英语纵向研究中的异常值的共线数据建模尾部:分位数轮廓回归

获取原文
获取原文并翻译 | 示例
       

摘要

Research has shown that high blood glucose levels are important predictors of incident diabetes. However, they are also strongly associated with other cardiometabolic risk factors such as high blood pressure, adiposity, and cholesterol, which are also highly correlated with one another. The aim of this analysis was to ascertain how these highly correlated cardiometabolic risk factors might be associated with high levels of blood glucose in older adults aged 50 or older from wave 2 of the English Longitudinal Study of Ageing (ELSA). Due to the high collinearity of predictor variables and our interest in extreme values of blood glucose we proposed a new method, called quantile profile regression, to answer this question. Profile regression, a Bayesian nonparametric model for clustering responses and covariates simultaneously, is a powerful tool to model the relationship between a response variable and covariates, but the standard approach of using a mixture of Gaussian distributions for the response model will not identify the underlying clusters correctly, particularly with outliers in the data or heavy tail distribution of the response. Therefore, we propose quantile profile regression to model the response variable with an asymmetric Laplace distribution, allowing us to model more accurately clusters that are asymmetric and predict more accurately for extreme values of the response variable and/or outliers. Our new method performs more accurately in simulations when compared to Normal profile regression approach as well as robustly when outliers are present in the data. We conclude with an analysis of the ELSA.
机译:研究表明,高血糖水平是入射糖尿病的重要预测因子。然而,它们也与其他心脏造成的危险因素(例如高血压,肥胖和胆固醇)密切相关,这也彼此高度相关。该分析的目的是确定这些高度相关的心肌仪危险因素如何与来自老龄化(ELSA)的英语纵向研究的波浪2的50岁或以上的老年人的高水平血糖相关。由于预测因子变量的高相伴性,并且我们对血糖极端值的兴趣我们提出了一种叫做Simentile简介回归的新方法,以回答这个问题。个人资料回归,同时聚类响应和协变量的贝叶斯非参数模型,是一种强大的工具,可以模拟响应变量和协变量之间的关系,但使用响应模型的高斯分布混合的标准方法不会识别底层集群正确,特别是在数据或重型尾部分布的异常值的反应。因此,我们提出了使用非对称LAPLACE分布模拟响应变量的分位数轮廓回归,使我们能够更准确地模拟不对称的群集,并且可以更准确地预测响应变量和/或异常值的极端值。与正常配置文件回归方法相比,我们的新方法在模拟中更准确地在数据中相比,当异常值存在于数据中时稳健地。我们通过分析ELSA的结论。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号