...
【24h】

Uncertainty Quantification for Modern High-Dimensional Regression via Scalable Bayesian Methods

机译:通过可扩展贝叶斯方法对现代高维回归的不确定性量化

获取原文
获取原文并翻译 | 示例
   

获取外文期刊封面封底 >>

       

摘要

Tremendous progress has been made in the last two decades in the area of high-dimensional regression, especially in the "large p, small n" setting. Such sample starved settings inevitably lead to models which are potentially very unstable and hence quite unreliable. To this end, Bayesian shrinkage methods have generated a lot of recent interest in the modern high-dimensional regression and model selection context. Such methods span the wide spectrum of modern regression approaches and include among others, spike-and-slab priors, the Bayesian lasso, ridge regression, and global-local shrinkage priors such as the Horseshoe prior and the Dirichlet-Laplace prior. These methods naturally facilitate tractable uncertainty quantification and have thus been used extensively across diverse applications. A common unifying feature of these models is that the corresponding priors on the regression coefficients can be expressed as a scale mixture of normals. This property has been leveraged extensively to develop various three-step Gibbs samplers to explore the corresponding intractable posteriors. The convergence of such samplers however is very slow in high dimensions settings, making them disconnected to the very setting that they are intended to work in. To address this challenge, we propose a comprehensive and unifying framework to draw from the same family of posteriors via a class of tractable and scalable two-step blocked Gibbs samplers. We demonstrate that our proposed class of two-step blocked samplers exhibits vastly superior convergence behavior compared to the original three-step sampler in high-dimensional regimes on simulated data as well as data from a variety of applications including gene expression data, infrared spectroscopy data, and socio-economic/law enforcement data. We also provide a detailed theoretical underpinning to the new method by deriving explicit upper bounds for the (geometric) rate of convergence, and by proving that the proposed two-step sampl
机译:在高维回归领域的过去二十年中,尤其是“大P,小N”设置的巨大进展。这种样品饥饿的设置不可避免地导致模型可能非常不稳定,因此非常不可靠。为此,贝叶斯收缩方法对现代高维回归和模型选择上下文产生了很多近期兴趣。这种方法跨越了现代回归方法的广泛频谱,包括钉板,山羊,壁虎,脊回归和全球局部收缩前沿,如马蹄先前和迪里奇特拉普拉斯。这些方法自然促进了易于不确定性量化,因此已广泛使用各种应用。这些模型的共同统一特征是回归系数上的相应前置器可以表示为正常的刻度混合。此属性已广泛利用,以开发各种三步GIBBS采样器,以探索相应的难治性后海底。然而,这种采样器的收敛性在高维设置中非常慢,使它们断开到它们旨在解决的非常挑战。要解决这一挑战,我们提出了一个全面和统一的框架,可以通过同一家族绘制来自同一家族一类贸易和可扩展的两步阻塞GIBBS采样器。我们证明,与在模拟数据上的高维制度中的原始三步采样器中,我们所提出的两步阻塞采样器具有巨大的收敛行为以及来自包括基因表达数据,红外光谱数据的各种应用的高维度制度和社会经济/执法数据。我们还通过导出(几何)收敛率的明确上限,并通过证明所提出的两步SAMPL来提供新方法的详细的理论基础

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号