首页> 外文会议>International Conference on Pattern Recognition in Bioinformatics >Ensemble Logistic Regression for Feature Selection
【24h】

Ensemble Logistic Regression for Feature Selection

机译:功能选择的合奏逻辑回归

获取原文

摘要

This paper describes a novel feature selection algorithm embedded into logistic regression. It specifically addresses high dimensional data with few observations, which are commonly found in the biomedical domain such as microarray data. The overall objective is to optimize the predictive performance of a classifier while favoring also sparse and stable models. Feature relevance is first estimated according to a simple t-test ranking. This initial feature relevance is treated as a feature sampling probability and a multivariate logistic regression is iteratively reestimated on subsets of randomly and non-uniformly sampled features. At each iteration, the feature sampling probability is adapted according to the predictive performance and the weights of the logistic regression. Globally, the proposed selection method can be seen as an ensemble of logistic regression models voting jointly for the final relevance of features. Practical experiments reported on several microarray datasets show that the proposed method offers a comparable or better stability and significantly better predictive performances than logistic regression regularized with Elastic Net. It also outperforms a selection based on Random Forests, another popular embedded feature selection from an ensemble of classifiers.
机译:本文介绍了一种嵌入到逻辑回归的新颖特征选择算法。它具体地解决了几种观察结果的高维数据,其通常在生物医学领域中发现,例如微阵列数据。整体目标是优化分类器的预测性能,同时有利于稀疏和稳定的模型。首先根据简单的T-Test排名估计特征相关性。该初始特征相关性被视为特征采样概率,并且在随机和非均匀采样功能的子集上迭代地重新定位多变量逻辑回归。在每次迭代时,特征采样概率根据预测性能和逻辑回归的权重调整。在全球范围内,所提出的选择方法可以被视为逻辑回归模型的集合,该模型共同表达了特征的最终相关性。在几个微阵列数据集上报告的实际实验表明,该方法提供了比用弹性网规范化的后勤回归的可比性或更好的稳定性和明显更好的预测性能。它还超越了基于随机林的选择,另一个流行的嵌入功能从分类器的集合中选择。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号