Facilitating high-dimensional transparent classification via empirical Bayes variable selection

Bar Haim; Booth James; Wells Martin T.; Liu Kangyan

首页> 外文期刊>Applied stochastic models in business and industry >Facilitating high-dimensional transparent classification via empirical Bayes variable selection

【24h】

Facilitating high-dimensional transparent classification via empirical Bayes variable selection

机译：通过经验贝叶斯变量选择促进高维透明分类

获取原文

获取原文并翻译 | 示例

获取外文期刊封面目录资料

开具论文收录证明 >>

文献代查 >>

文献数据库（团队版） >>

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

We present a two-step approach to classification problems in the large P, small N setting, where the number of predictors may be larger than the sample size. We assume that the association between the predictors and the class variable has an approximate linear-logistic form, but we allow the class boundaries to be nonlinear. We further assume that the number of true predictors is relatively small. In the first step, we use a binomial generalized linear model to identify which predictors are associated with each class and then restrict the data set to these predictors and run a nonlinear classifier, such as a random forest or a support vector machine. We show that, without the variable screening step, the classification performance of both the random forest and support vector machine is degraded when many among the P predictors are not related to the class.

机译：我们在大的P，小n设置中提出了一种两步的分类问题，其中预测器的数量可以大于样本大小。我们假设预测器和类变量之间的关联具有近似的线性逻辑形式，但我们允许类边界是非线性的。我们进一步假设真正的预测器数量相对较小。在第一步中，我们使用二项式广义线性模型来识别哪些预测器与每个类相关联，然后将数据限制为这些预测器并运行非线性分类器，例如随机林或支持向量机。我们表明，如果没有变量筛选步骤，当P预测器中的许多与类相关时，随机林和支持向量机的分类性能会降低。

著录项

来源
《Applied stochastic models in business and industry》 |2018年第6期|共13页
作者
Bar Haim; Booth James; Wells Martin T.; Liu Kangyan;
展开▼
作者单位

Univ Connecticut Dept Stat Storrs CT 06269 USA;

Cornell Univ Dept Biol Stat &

Computat Biol Ithaca NY USA;

Cornell Univ Dept Biol Stat &

Computat Biol Ithaca NY USA;

Univ Connecticut Dept Stat Storrs CT 06269 USA;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类应用数学;
关键词
EM algorithm; generalized linear models; random forest; support vector machines; variable selection;

机译：EM算法;广义线性模型;随机森林;支持向量机;变量选择;

相似文献

外文文献
中文文献
专利

1. Facilitating high-dimensional transparent classification via empirical Bayes variable selection [J] . Bar Haim, Booth James, Wells Martin T., Applied stochastic models in business and industry . 2018,第6期

机译：通过经验贝叶斯变量选择促进高维透明分类
2. Empirical Bayes vs. fully Bayes variable selection [J] . Cui W, George EI Journal of Statistical Planning and Inference . 2008,第4期

机译：经验贝叶斯与完全贝叶斯变量选择
3. High-dimensional classification via nonparametric empirical Bayes and maximum likelihood inference [J] . Dicker Lee H., Zhao Sihai D. Biometrika . 2016,第1期

机译：通过非参数经验贝叶斯和最大似然推断进行高维分类
4. Automated Feature Weighting in Naive Bayes for High-dimensional Data Classification [C] . Lifei Chen, Shengrui Wang ACM international conference on information and knowledge management . 2012

机译：朴素贝叶斯中的自动特征权重用于高维数据分类
5. Empirical bayes variable selection in high-dimensional regression. [D] . Pungpapong, Vitara. 2012

机译：高维回归中的经验贝叶斯变量选择。
6. Integrating biological knowledge into variable selection: an empirical Bayes approach with an application in cancer biology [O] . Steven M Hill, Richard M Neve, Nora Bayani, 2012

机译：将生物学知识整合到变量选择中：经验贝叶斯方法及其在癌症生物学中的应用
7. Facilitating high‐dimensional transparent classification via empirical Bayes variable selection [O] . Haim Bar, James Booth, Martin T. Wells, 2018

机译：通过经验贝叶斯变量选择促进高维透明分类
8. Simultaneous Inference, and Ranking Selection Procedure: Bayes and Empirical Bayes Approach [R] . Gupta, S. S. 2001

机译：同时推理和排序选择程序：贝叶斯和经验贝叶斯方法

Facilitating high-dimensional transparent classification via empirical Bayes variable selection

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅