...
首页> 外文期刊>Journal of Multivariate Analysis: An International Journal >Graph-based sparse linear discriminant analysis for high-dimensional classification
【24h】

Graph-based sparse linear discriminant analysis for high-dimensional classification

机译:基于图的高维分类的稀疏线性判别分析

获取原文
获取原文并翻译 | 示例
           

摘要

Linear discriminant analysis (LDA) is a well-known classification technique that enjoyed great success in practical applications. Despite its effectiveness for traditional low-dimensional problems, extensions of LDA are necessary in order to classify high dimensional data. Many variants of LDA have been proposed in the literature. However, most of these methods do not fully incorporate the structure information among predictors when such information is available. In this paper, we introduce a new high-dimensional LDA technique, namely graph-based sparse LDA (GSLDA), that utilizes the graph structure among the features. In particular, we use the regularized regression formulation for penalized LDA techniques, and propose to impose a structure-based sparse penalty on the discriminant vector beta. The graph structure can be either given or estimated from the training data. Moreover, we explore the relationship between the within-class feature structure and the overall feature structure. Based on this relationship, we further propose a variant of our proposed GSLDA to utilize effectively unlabeled data, which can be abundant in the semi-supervised learning setting. With the new regularization, we can obtain a sparse estimate of beta and more accurate and interpretable classifiers than many existing methods. Both the selection consistency of beta estimation and the convergence rate of the classifier are established, and the resulting classifier has an asymptotic Bayes error rate. Finally, we demonstrate the competitive performance of the proposed GSLDA on both simulated and real data studies. (C) 2018 Elsevier Inc. All rights reserved.
机译:线性判别分析(LDA)是一种着名的分类技术,在实际应用中取得了巨大成功。尽管具有传统的低维问题的有效性,但LDA的扩展是必要的,以便对高维数据进行分类。在文献中提出了LDA的许多变体。然而,当这些信息可用时,这些方法中的大多数不完全包含预测器之间的结构信息。在本文中,我们介绍了一种新的高维LDA技术,即基于图形的稀疏LDA(GSLDA),它利用了特征之间的图形结构。特别是,我们使用正则化回归制剂进行惩罚的LDA技术,并提出在判别载体β上施加基于结构的稀疏罚分。可以从训练数据给出或估计图形结构。此外,我们探讨了课堂内容结构与整体特征结构之间的关系。基于这种关系,我们进一步提出了我们提出的GSLDA的变种,以利用有效的未标记数据,这可能在半监督学习环境中丰富。通过新的正则化,我们可以获得比许多现有方法的Beta和更准确和可解释的分类器的稀疏估计。建立了Beta估计的选择一致性和分类器的收敛速率,并且所得到的分类器具有渐近贝叶斯误差率。最后,我们展示了拟议的GSLDA对模拟和实际数据研究的竞争性能。 (c)2018年Elsevier Inc.保留所有权利。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号