首页> 外文期刊>Computational statistics & data analysis >Classification by ensembles from random partitions of high-dimensional data
【24h】

Classification by ensembles from random partitions of high-dimensional data

机译:通过对高维数据的随机分区进行集成进行分类

获取原文
获取原文并翻译 | 示例
           

摘要

A robust classification procedure is developed based on ensembles of classifiers, with each classifier constructed from a different set of predictors determined by a random partition of the entire set of predictors. The proposed methods combine the results of multiple classifiers to achieve a substantially improved prediction compared to the optimal single classifier. This approach is designed specifically for high-dimensional data sets for which a classifier is sought. By combining classifiers built from each subspace of the predictors, the proposed methods achieve a computational advantage in tackling the growing problem of dimensionality. For each subspace of the predictors, we build a classification tree or logistic regression tree. Our study shows, using four real data sets from different areas, that our methods perform consistently well compared to widely used classification methods. For unbalanced data, our approach maintains the balance between sensitivity and specificity more adequately than many other classification methods considered in this study.
机译:基于分类器的集合来开发鲁棒的分类程序,其中每个分类器由由整个预测器集合的随机划分确定的不同预测器集合构成。所提出的方法结合了多个分类器的结果,与最佳的单个分类器相比,获得了实质上改善的预测。该方法是专门为寻求分类器的高维数据集设计的。通过组合从预测变量的每个子空间构建的分类器,所提出的方法在解决不断增长的维数问题上获得了计算优势。对于预测变量的每个子空间,我们构建一个分类树或逻辑回归树。我们的研究表明,使用来自不同领域的四个真实数据集,与广泛使用的分类方法相比,我们的方法性能始终很好。对于不平衡的数据,我们的方法比本研究中考虑的许多其他分类方法更充分地保持了敏感性和特异性之间的平衡。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号