Dirichlet Lasso: A Bayesian approach to variable selection

Das Kiranmoy; Sobel Marc

首页> 外文期刊>Statistical modeling: applications in contemporary issues >Dirichlet Lasso: A Bayesian approach to variable selection

【24h】

Dirichlet Lasso: A Bayesian approach to variable selection

机译：Dirichlet Lasso：贝叶斯变量选择方法

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Selection of the most important predictor variables in regression analysis is one of the key problems statistical research has been concerned with for long time. In this article, we propose the methodology, Dirichlet Lasso (abbreviated as DLASSO) to address this issue in a Bayesian framework. In many modern regression settings, large set of predictor variables are grouped and the coefficients belonging to any one of these groups are either all redundant or all important in predicting the response; we say in those cases that the predictors exhibit a group structure. We show that DLASSO is particularly useful where the group structure is not fully known. We exploit the clustering property of Dirichlet Process priors to infer the possibly missing group information. The Dirichlet Process has the advantage of simultaneously clustering the variable coefficients and selecting the best set of predictor variables. We compare the predictive performance of DLASSO to Group Lasso and ordinary Lasso with real data and simulation studies. Our results demonstrate that the predictive performance of DLASSO is almost as good as that of Group Lasso when group label information is given; and superior to the ordinary Lasso for missing group information. For high dimensional data (e.g., genetic data) with missing group information, DLASSO will be a powerful approach of variable selection since it provides a superior predictive performance and higher statistical accuracy.

机译：回归分析中最重要的预测变量的选择是统计研究长期以来一直关注的关键问题之一。在本文中，我们提出了一种方法，即Dirichlet Lasso（缩写为DLASSO），以在贝叶斯框架中解决此问题。在许多现代回归设置中，将大量的预测变量进行分组，并且属于这些组中任一组的系数对于预测响应都是多余的，或者都是重要的。我们说在这些情况下，预测变量表现出群体结构。我们显示了DLASSO在组结构尚不完全明了的地方特别有用。我们利用Dirichlet Process先验的聚类属性来推断可能丢失的组信息。 Dirichlet过程的优点是可以同时对变量系数进行聚类并选择最佳的预测变量集。我们将DLASSO与组Lasso和普通Lasso的预测性能进行了比较，并进行了实际数据和模拟研究。我们的结果表明，当给出组标签信息时，DLASSO的预测性能几乎与Lasso组的预测性能相同。并且在丢失组信息方面优于普通的套索。对于缺少组信息的高维数据（例如遗传数据），DLASSO将是强大的变量选择方法，因为它提供了出色的预测性能和更高的统计准确性。

著录项

来源
《Statistical modeling: applications in contemporary issues》 |2015年第3期|共18页
作者
Das Kiranmoy; Sobel Marc;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类统计学;
关键词
Bayesian Lasso; Dirichlet process prior; Group lasso; Gibbs sampling; M-H algorithm;

机译：贝叶斯套索;Dirichlet过程先验;套索套;Gibbs采样;M-H算法;
入库时间 2022-08-18 15:08:10

相似文献

外文文献
中文文献
专利

1. Dirichlet Lasso: A Bayesian approach to variable selection [J] . Das Kiranmoy, Sobel Marc Statistical modeling: applications in contemporary issues . 2015,第3期

机译：Dirichlet Lasso：贝叶斯变量选择方法
2. Dose individualization and variable selection by using the Bayesian lasso in early phase dose finding trials [J] . Yasuyuki Kakurai, Shuhei Kaneko, Chikuma Hamada, Journal of the royal statistical society . 2019,第PTa2期

机译：在早期剂量寻找试验中使用贝叶斯套索进行剂量个体化和变量选择
3. Bayesian adaptive lasso with variational Bayes for variable selection in high-dimensional generalized linear mixed models [J] . Dao Thanh Tung, Minh-Ngoc Tran, Tran Manh Cuong Communications in Statistics . 2019,第1a2期

机译：贝叶斯自适应套索与变分贝叶斯，用于在高维广义线性混合模型中的可变选择
4. Variable selection using the Lasso-Cox model with Bayesian regularization [C] . Wenxin Lu, Zhuliang Yu, Zhenghui Gu, IEEE Conference on Industrial Electronics and Applications . 2018

机译：使用具有贝叶斯正则化的Lasso-Cox模型进行变量选择
5. Automated Variable Selection of Gamma-Ray Spectra by Utilization of LASSO and Elastic Net Techniques for Use in Nuclear Security Applications [D] . DiNova, Vincent A., Jr. 2019

机译：利用套索和弹性净技术用于核安全应用的自动变量选择伽马射线光谱
6. Correction to: MicroBVS: Dirichlet-tree multinomial regression models with Bayesian variable selection - an R package [O] . Matthew D. Koslovsky, Marina Vannucci 2020

机译：校正至：microbvs：Dirichlet-Tree多项式回归模型带贝叶斯变量选择 - A封装
7. Variable Selection by Bayesian Adaptive Lasso and Iterative Adaptive Lasso, with Application for Genome-wide Multiple Loci Mapping [O] . Sun Wei, Ibrahim Joseph G, Zou Fei 2009

机译：贝叶斯自适应套索和迭代自适应套索的变量选择及其在全基因组范围内多基因座定位的应用

Dirichlet Lasso: A Bayesian approach to variable selection

摘要

著录项

相似文献

相关主题

期刊订阅