首页> 外文期刊>Statistics and Its Interface >Sparse Bayesian variable selection for classifying high-dimensional data
【24h】

Sparse Bayesian variable selection for classifying high-dimensional data

机译:用于高维数据分类的稀疏贝叶斯变量选择

获取原文
获取外文期刊封面目录资料

摘要

Identifying differentially expressed genes for classifying experiment classes is an important application of microarrays. Methods for selecting important genes are of much significance in accurate classification. Owing to the large number of genes and many of them are irrelevant, insignificant or redundant, standard statistical methods do not work well. The modification of existing methods is needed to achieve better analysis of microarray data. We present a stochastic variable selection approach for gene selection with different two level hierarchical prior distributions for regression coefficients. These priors can be used as a sparsity-enforcing mechanism to perform gene selection for classification. Using simulation-based MCMC methods for simulating parameters from the posterior distribution, an efficient algorithm is developed and implemented. This algorithm is robust to the choices of initial values, and produces posterior probabilities of related genes for biological interpretation. To highlight the potential applications of the proposed approach, we provide examples of the well-known colon cancer data and leukemia data in microarray literature.
机译:鉴定用于分类实验类别的差异表达基因是微阵列的重要应用。选择重要基因的方法在准确分类中具有重要意义。由于基因数量众多,并且其中许多是无关,无关紧要或多余的,因此标准的统计方法效果不佳。需要对现有方法进行修改以更好地分析微阵列数据。我们提出了一种随机变量选择方法,用于选择具有不同两级先验分布的回归系数的基因。这些先验可以用作稀疏性增强机制来执行基因选择以进行分类。使用基于仿真的MCMC方法从后验分布中模拟参数,开发并实现了一种有效的算法。该算法对初始值的选择具有鲁棒性,并产生相关基因的后验概率以用于生物学解释。为了突出提出的方法的潜在应用,我们提供了微阵列文献中著名的结肠癌数据和白血病数据的示例。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号