Weighted linear programming discriminant analysis for high‐dimensional binary classification

Yufei Wu; Guan Yu

摘要

Linear discriminant analysis (LDA) is widely used for various binary classification problems. In contrast to the LDA that estimates the precision matrix Ω and the mean difference vector δ in the classification rule separately, the linear programming discriminant (LPD) rule estimates the product Ωδ directly through a constrained ?1 minimization. The LPD rule has very good classification performance on many high‐dimensional binary classification problems. However, to estimate β* = Ωδ, the LPD rule uses equal weights for all the elements of β* in the constrained ?1 minimization. It may not deliver the optimal estimate of β*, and therefore the estimated discriminant direction can be suboptimal. In order to obtain better estimates of β* and the discriminant direction, we can heavily penalize βj in the constrained ?1 minimization if we suspect the jth feature is useless for the classification while moderately penalize βj if we suspect the jth feature is useful. In this paper, based on the LPD rule and some popular feature screening methods, we propose a new weighted linear programming discriminant (WLPD) rule for the high‐dimensional binary classification problem. The screening statistics used in the marginal two‐sample t‐test screening, Kolmogorov–Smirnov filter, and the maximum marginal likelihood screening will be used to construct appropriate weights for different elements of β* flexibly. Besides the linear programming algorithm, we develop a new alternating direction method of multipliers algorithm to solve the high‐dimensional constrained ?1 minimization problem efficiently. Our numerical studies show that our proposed WLPD rule can outperform LPD and serve as an effective binary classification tool.

机译：线性判别分析（LDA）广泛用于各种二进制分类问题。与估计分类规则中的精确矩阵ω和平均差向量Δ的LDA相反，线性编程判别（LPD）规则通过约束的Δ1最小化估计产品ωδ。 LPD规则在许多高维二进制分类问题上具有很好的分类性能。然而，为了估计β* =ωδ，LPD规则使用受约束的β*中的所有元素的相等权重。它可能无法提供β*的最佳估计，因此估计的判别方向可以是次优。为了获得更好的β*和判别方向估计，我们可以严重惩罚受约束的？1最小化的βj，如果我们怀疑第j个功能对于分类而无用，而如果我们怀疑第j个功能，则在中度惩罚βj是有用的。本文基于LPD规则和一些流行的特征筛选方法，我们提出了一种新的加权线性规划判别（WLPD）规则为高维二进制分类问题。在边缘两样T检验筛选，Kolmogorov-Smirnov滤波器和最大边缘似然筛选中使用的筛选统计数据将用于灵活地构建适当的β*的不同元件的重量。除了线性编程算法外，我们开发了一种新的乘法机算法的交替方向方法，以有效地解决高维约束的最小化问题。我们的数值研究表明，我们提出的WLPD规则可以优于LPD并用作有效的二进制分类工具。

Weighted linear programming discriminant analysis for high‐dimensional binary classification

摘要

著录项

相关主题

期刊订阅