Selection properties of type II maximum likelihood (empirical Bayes) in linear models with individual variance components for predictors

Tahira Jamil; Cajo J.F. ter Braak

首页> 外文期刊>Pattern recognition letters >Selection properties of type II maximum likelihood (empirical Bayes) in linear models with individual variance components for predictors

【24h】

Selection properties of type II maximum likelihood (empirical Bayes) in linear models with individual variance components for predictors

机译：线性模型中具有个体差异分量的线性II型最大似然（经验贝叶斯）的选择属性

获取原文

获取原文并翻译 | 示例

开具论文收录证明 >>

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

Maximum likelihood (ML) in the linear model overfits when the number of predictors (M) exceeds the number of objects (IV). One of the possible solution is the relevance vector machine (RVM) which is a form of automatic relevance detection and has gained popularity in the pattern recognition machine learning community by the famous textbook of Bishop (2006). RVM assigns individual precisions to weights of predictors which are then estimated by maximizing the marginal likelihood (type II ML or empirical Bayes). We investigated the selection properties of RVM both analytically and by experiments in a regression setting. We show analytically that RVM selects predictors when the absolute z-ratio (|least squares estimate|/ standard error) exceeds 1 in the case of orthogonal predictors and, for M = 2, that this still holds true for correlated predictors when the other z-ratio is large. RVM selects the stronger of two highly correlated predictors. In experiments with real and simulated data, RVM is outcompeted by other popular regular-ization methods (LASSO and/or PLS) in terms of the prediction performance. We conclude that type II ML is not the general answer in high dimensional prediction problems. In extensions of RVM to obtain stronger selection, improper priors (based on the inverse gamma family) have been assigned to the inverse precisions (variances) with parameters estimated by penalized marginal likelihood. We critically assess this approach and suggest a proper variance prior related to the Beta distribution which gives similar selection and shrinkage properties and allows a fully Bayesian treatment.

机译：当预测变量（M）的数量超过对象数量（IV）时，线性模型中的最大似然（ML）过拟合。可能的解决方案之一是关联向量机（RVM），这是一种自动关联检测，在著名的Bishop（2006）教科书中已在模式识别机器学习社区中获得普及。 RVM为预测变量的权重分配单独的精度，然后通过最大化边际可能性（II型ML或经验贝叶斯）来估算。我们分析了回归分析中的RVM的选择特性，并通过实验进行了研究。我们通过分析表明，在正交预测变量的情况下，当绝对z比率（|最小二乘估计| /标准误差）超过1时，RVM选择预测变量；对于M = 2，当其他z相对于相关预测变量时，这仍然适用-比率很大。 RVM选择两个高度相关的预测变量中的较强者。在具有真实数据和模拟数据的实验中，就预测性能而言，RVM优于其他流行的正则化方法（LASSO和/或PLS）。我们得出结论，在高维预测问题中，II型ML不是普遍的答案。在RVM的扩展中，为了获得更强的选择，已将不正确的先验（基于反伽马族）分配给反精度（方差），其参数由受罚边际似然估计。我们严格评估这种方法，并建议与Beta分布相关的适当方差，它具有相似的选择和收缩特性，并允许完全贝叶斯处理。

著录项

来源
《Pattern recognition letters》 |2012年第9期|p.1205-1212|共8页
作者
Tahira Jamil; Cajo J.F. ter Braak;
展开▼
作者单位

Biometris, Wageningen University and Research Centre, Box WO, 6700 AC Wageningen, The Netherlands;

Biometris, Wageningen University and Research Centre, Box WO, 6700 AC Wageningen, The Netherlands;

展开▼
收录信息美国《科学引文索引》(SCI);美国《工程索引》(EI);
原文格式 PDF
正文语种 eng
中图分类
关键词
automatic relevance detection; empirical bayes; LASSO; sparse model; type Ⅱ maximum likelihood; relevance vector machine;

机译：自动相关性检测;经验贝叶斯;套索;稀疏模型Ⅱ型最大似然;相关向量机;

相似文献

外文文献
中文文献
专利

1. A type of restricted maximum likelihood estimator of variance components in generalised linear mixed models [J] . Liao JG., Lipsitz SR. Biometrika . 2002,第2期

机译：广义线性混合模型中方差分量的一种受限最大似然估计
2. Likelihood ratio testing of variance components in the linear mixed-effects model using restricted maximum likelihood [J] . Morrell CH. Biometrics: Journal of the Biometric Society : An International Society Devoted to the Mathematical and Statistical Aspects of Biology . 1998,第4期

机译：使用限制的最大似然检验线性混合效应模型中方差分量的似然比
3. Asymptotic properties of maximum quasi-likelihood estimator in quasi-likelihood nonlinear models with misspecified variance function [J] . Tian Xia, Xue-Ren Wang, Xue-Jun Jiang Statistics . 2014,第4a6期

机译：带有错误指定方差函数的拟似然非线性模型中最大拟似然估计的渐近性质
4. Maximum Likelihood Estimation Methods for Variance Components in Linear Non-Orthogonal Small Size Design Models [C] . Dario Ferreira, Sandra S. Ferreira, Celia Nunes, International Conference on Numerical Analysis and Applied Mathematics . 2010

机译：线性非正交小型设计模型方差分量的最大似然估计方法
5. MAXIMUM LIKELIHOOD ESTIMATION IN MULTIVARIATE CONTINUOUS VARIABLE VARIANCE COMPONENTS PANEL MODELS [D] . CHEN, KUN-MAO. 1987

机译：多元连续可变方差组件面板模型中的最大似然估计
6. Empirical Bayes hierarchical models for regularizing maximum likelihood estimation in the matrix Gaussian Procrustes problem [O] . Douglas L. Theobald, Deborah S. Wuttke 2006

机译：经验贝叶斯分层模型用于正则化矩阵高斯Procrustes问题中的最大似然估计
7. Selection properties of Type II maximum likelihood (empirical bayes) linear models with individual variance components for predictors [O] . Jamil, T., Braak, C.J.F., ter 2012

机译：具有个体差异分量的II类最大似然（经验贝叶斯）线性模型的预测属性
8. Maximum Likelihood Algorithms for Linear Models with Unequal Variances [R] . Hemmerle, W. J. 1972

机译：不等方差线性模型的极大似然算法

Selection properties of type II maximum likelihood (empirical Bayes) in linear models with individual variance components for predictors

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅