Random projections as regularizers: learning a linear discriminant from fewer observations than dimensions

Durrant Robert J.; Kaban Ata

首页> 外文期刊>Machine Learning >Random projections as regularizers: learning a linear discriminant from fewer observations than dimensions

【24h】

Random projections as regularizers: learning a linear discriminant from fewer observations than dimensions

机译：随机投影作为正则化器：从少于维度的观察值中学习线性判别

获取原文

获取原文并翻译 | 示例

获取外文期刊封面封底 >>

开具论文收录证明 >>

文献代查 >>

团队文献服务 >>

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

We prove theoretical guarantees for an averaging-ensemble of randomly projected Fisher linear discriminant classifiers, focusing on the case when there are fewer training observations than data dimensions. The specific form and simplicity of this ensemble permits a direct and much more detailed analysis than existing generic tools in previous works. In particular, we are able to derive the exact form of the generalization error of our ensemble, conditional on the training set, and based on this we give theoretical guarantees which directly link the performance of the ensemble to that of the corresponding linear discriminant learned in the full data space. To the best of our knowledge these are the first theoretical results to prove such an explicit link for any classifier and classifier ensemble pair. Furthermore we show that the randomly projected ensemble is equivalent to implementing a sophisticated regularization scheme to the linear discriminant learned in the original data space and this prevents overfitting in conditions of small sample size where pseudo-inverse FLD learned in the data space is provably poor. Our ensemble is learned from a set of randomly projected representations of the original high dimensional data and therefore for this approach data can be collected, stored and processed in such a compressed form. We confirm our theoretical findings with experiments, and demonstrate the utility of our approach on several datasets from the bioinformatics domain and one very high dimensional dataset from the drug discovery domain, both settings in which fewer observations than dimensions are the norm.

机译：我们证明了随机投影的Fisher线性判别分类器的平均合集的理论保证，重点是训练观测少于数据维度的情况。该集合的特定形式和简单性使得它可以比以前的工作中的现有通用工具进行直接而详尽的分析。特别是，我们能够根据训练集得出集合整体化误差的精确形式，并在此基础上给出理论上的保证，这些保证将集合体的性能直接与学习的相应线性判别式联系起来。完整的数据空间。据我们所知，这是证明任何分类器和分类器集合对具有如此明确联系的第一个理论结果。此外，我们表明，随机投影的集合等效于对原始数据空间中学习的线性判别方法实施复杂的正则化方案，这可以防止在样本量较小的情况下过拟合，而在这种情况下，在数据空间中学习的伪逆FLD证明很差。我们的集成是从一组原始高维数据的随机投影表示中学习的，因此，对于这种方法，可以以这种压缩形式收集，存储和处理数据。我们通过实验证实了我们的理论发现，并证明了我们的方法在来自生物信息学领域的几个数据集和来自药物发现领域的一个非常高维度的数据集上的实用性，在这两种设置中，少于维度的观察值是常态。

著录项

来源
《Machine Learning》 |2015年第2期|257-286|共30页
作者
Durrant Robert J.; Kaban Ata;
展开▼
作者单位

Univ Waikato, Dept Stat, Hamilton 3240, New Zealand;

Univ Birmingham, Sch Comp Sci, Birmingham B15 2TT, W Midlands, England;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
Random projections; Ensemble learning; Linear discriminant analysis; Compressed learning; Learning theory;

机译：随机投影;集合学习;线性判别分析;压缩学习;学习理论;

相似文献

外文文献
中文文献
专利

1. Learning Linear Discriminant Projections for Dimensionality Reduction of Image Descriptors [J] . Cai HongpingMikolajczyk KrystianMatas Jiri Pattern Analysis and Machine Intelligence, IEEE Transactions on . 2011,第2期

机译：学习线性判别投影以减少图像描述符的维数
2. A model selection criterion for discriminant analysis of high-dimensional data with fewer observations [J] . Hyodo M., Yamada T., Srivastava M.S. Journal of Statistical Planning and Inference . 2012,第12期

机译：判别较少的高维数据判别分析的模型选择标准
3. CLASSIFICATION OF HIGH-DIMENSIONAL DATA: A RANDOM-MATRIX REGULARIZED DISCRIMINANT ANALYSIS APPROACH [J] . BIN YE, PENG LIU International Journal of Innovative Computing Information and Control . 2019,第3期

机译：高维数据分类：随机矩阵正则判别分析方法
4. Random-Matrix Regularized Discriminant Analysis of High-Dimensional Dataset [C] . Peng Liu, Bin Ye, Yangquan Guo, International Symposium on Distributed Computing and Applications for Business Engineering and Science . 2018

机译：高维数据集的随机矩阵正则判别分析
5. Random Neural Networks for Dimensionality Reduction and Regularized Supervised Learning [D] . Hu, Renjie. 2019

机译：随机神经网络，减少维度和正规化监督学习
6. Performance Improvement of Near-Infrared Spectroscopy-Based Brain-Computer Interface Using Regularized Linear Discriminant Analysis Ensemble Classifier Based on Bootstrap Aggregating [O] . Jaeyoung Shin, Chang-Hwan Im 2020

机译：基于Bootstrap聚合的正规化线性判别分析集体分类器的基于近红外光谱的脑电电脑界面性能改进
7. Random projections as regularizers: learning a linear discriminant from fewer observations than dimensions [O] . Durrant, Robert J., Kabán, Ata 2014

机译：随机投影作为正则化器：从少于维度的观察值中学习线性判别

Random projections as regularizers: learning a linear discriminant from fewer observations than dimensions

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅