Sparse Bayesian variable selection for classifying high-dimensional data

Yang Aijun; Lian Heng; Jiang Xuejun; Liu Pengfei

首页> 外文期刊>Statistics and Its Interface >Sparse Bayesian variable selection for classifying high-dimensional data

【24h】

Sparse Bayesian variable selection for classifying high-dimensional data

机译：用于高维数据分类的稀疏贝叶斯变量选择

获取原文

获取外文期刊封面目录资料

开具论文收录证明 >>

文献代查 >>

文献数据库（团队版） >>

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

Identifying differentially expressed genes for classifying experiment classes is an important application of microarrays. Methods for selecting important genes are of much significance in accurate classification. Owing to the large number of genes and many of them are irrelevant, insignificant or redundant, standard statistical methods do not work well. The modification of existing methods is needed to achieve better analysis of microarray data. We present a stochastic variable selection approach for gene selection with different two level hierarchical prior distributions for regression coefficients. These priors can be used as a sparsity-enforcing mechanism to perform gene selection for classification. Using simulation-based MCMC methods for simulating parameters from the posterior distribution, an efficient algorithm is developed and implemented. This algorithm is robust to the choices of initial values, and produces posterior probabilities of related genes for biological interpretation. To highlight the potential applications of the proposed approach, we provide examples of the well-known colon cancer data and leukemia data in microarray literature.

机译：鉴定用于分类实验类别的差异表达基因是微阵列的重要应用。选择重要基因的方法在准确分类中具有重要意义。由于基因数量众多，并且其中许多是无关，无关紧要或多余的，因此标准的统计方法效果不佳。需要对现有方法进行修改以更好地分析微阵列数据。我们提出了一种随机变量选择方法，用于选择具有不同两级先验分布的回归系数的基因。这些先验可以用作稀疏性增强机制来执行基因选择以进行分类。使用基于仿真的MCMC方法从后验分布中模拟参数，开发并实现了一种有效的算法。该算法对初始值的选择具有鲁棒性，并产生相关基因的后验概率以用于生物学解释。为了突出提出的方法的潜在应用，我们提供了微阵列文献中著名的结肠癌数据和白血病数据的示例。

著录项

来源
《Statistics and Its Interface》 |2018年第2期|共11页
作者
Yang Aijun; Lian Heng; Jiang Xuejun; Liu Pengfei;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种
中图分类统计学;
关键词

相似文献

外文文献
中文文献
专利

1. Sparse Bayesian variable selection for classifying high-dimensional data [J] . Yang Aijun, Lian Heng, Jiang Xuejun, Statistics and Its Interface . 2018,第2期

机译：稀疏贝叶斯变量选择用于分类高维数据
2. Bayesian variable selection in multinomial probit model for classifying high-dimensional data [J] . Yang Aijun, Li Yunxian, Tang Niansheng, Computational statistics . 2015,第2期

机译：多项式概率模型中的贝叶斯变量选择用于高维数据分类
3. Bayesian variable selection in multinomial probit model for classifying high-dimensional data [J] . Aijun Yang, Yunxian Li, Niansheng Tang, Computational Statistics . 2015,第2期

机译：多项式概率模型中的贝叶斯变量选择用于高维数据分类
4. Towards the Optimal Feature Selection in High-Dimensional: Bayesian Network Classifiers [C] . Tatjana Pavlenko, Mikael Hall, Dietrich von Rosen, International Conference on Soft Methods in Probability and Statistics(SMPS'2004); 200405; Oviedo(ES) . 2004

机译：面向高维：贝叶斯网络分类器的最佳特征选择
5. High-Dimensional Variable Selection for Genomics Data, from Both Frequentist and Bayesian Perspectives [D] . Ren, Jie. 2020

机译：基因组学数据的高维变量选择，来自频率和贝叶斯视角
6. Bayesian variable selection in modelling geographical heterogeneity in malaria transmission from sparse data: an application to Nouna Health and Demographic Surveillance System (HDSS) data Burkina Faso [O] . Eric Diboulo, Ali Sié, Diallo A Diadier, 2015

机译：稀疏数据在疟疾传播地理异质性建模中的贝叶斯变量选择：布基纳法索对努纳健康和人口监测系统（HDSS）数据的应用
7. Class-specific variable selection in high-dimensional discriminant analysis through Bayesian Sparsity [O] . Fanny Orlhac, Pierre-Alexandre Mattei, Charles Bouveyron, 2018

机译：通过贝叶斯稀疏性在高维判别分析中的特定类变量选择

Sparse Bayesian variable selection for classifying high-dimensional data

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅