No Unbiased Estimator of the Variance of K-Fold Cross-Validation

Bengio Yoshua; Grandvalet Yves

首页> 外文期刊>Journal of machine learning research >No Unbiased Estimator of the Variance of K-Fold Cross-Validation

【24h】

No Unbiased Estimator of the Variance of K-Fold Cross-Validation

机译：没有K折交叉验证方差的无偏估计

获取原文

获取外文期刊封面封底 >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Most machine learning researchers perform quantitative experiments to estimate generalization error and compare the performance of different algorithms (in particular, their proposed algorithm). In order to be able to draw statistically convincing conclusions, it is important to estimate the uncertainty of such estimates.This paper studies the very commonly used K-fold cross-validation estimator of generalization performance. The main theorem shows that there exists no universal (valid under all distributions) unbiased estimator of the variance of K-fold cross-validation. The analysis that accompanies this result is based on the eigen-decomposition of the covariance matrix of errors, which has only three different eigenvalues corresponding to three degrees of freedom of the matrix and three components of the total variance. This analysis helps to better understand the nature of the problem and how it can make naive estimators (that don't take into account the error correlations due to the overlap between training and test sets) grossly underestimate variance. This is confirmed by numerical experiments in which the three components of the variance are compared when the difficulty of the learning problem and the number of folds are varied. color="gray">

机译：大多数机器学习研究人员执行定量实验以估计泛化误差并比较不同算法（特别是他们提出的算法）的性能。为了能够得出统计上令人信服的结论，重要的是估计此类估计的不确定性。本文研究了通用性能的非常常用的K折交叉验证估计器。主定理表明，不存在K折交叉验证方差的通用（在所有分布下均有效）无偏估计量。伴随此结果进行的分析是基于误差协方差矩阵的本征分解，该协方差矩阵只有三个不同的特征值，分别对应于矩阵的三个自由度和总方差的三个分量。这种分析有助于更好地理解问题的性质以及它如何使幼稚的估计量（由于训练和测试集之间的重叠而没有考虑误差相关性）严重低估了方差。数值实验证实了这一点，其中，当学习问题的难度和倍数变化时，比较方差的三个分量。 color =“ gray”>

著录项

来源
《Journal of machine learning research》 |2004年第9期|共17页
作者
Bengio Yoshua; Grandvalet Yves;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种
中图分类计算技术、计算机技术;
关键词

相似文献

外文文献
中文文献
专利

1. Finite sample performance of a long run variance estimator based on exactly (almost) unbiased autocovariance estimators [J] . Yang Jingjing, Vogelsang Timothy J. Economics letters . 2018,第APRa期

机译：基于完全（几乎）无偏自协方差估计量的长期方差估计量的有限样本性能
2. Unbiased Estimator of Finite Population Variance Based on a Ratio Type Estimator [J] . Jehad Al-Jararha Electronic Journal of Applied Statistical Analysis . 2018,第1期

机译：基于比率类型估计器的有限总体方差的无偏估计器
3. Maximum likelihood estimators uniformly minimize distribution variance among distribution unbiased estimators in exponential families [J] . Vos Paul, Wu Qiang Bernoulli: official journal of the Bernoulli Society for Mathematical Statistics and Probability . 2015,第4期

机译：最大似然估计量均匀地最小化指数族中分布无偏估计量之间的分布方差
4. No Unbiased Estimator of the Variance of K-Fold Cross-Validation [C] . Yoshua Bengio, Yves Grandvalet Annual Conference on Neural Information Processing Systems . 2004

机译：k折交叉验证方差的无偏见估计
5. A comparison of estimators in hierarchical linear modeling: Restricted maximum likelihood versus bootstrap via minimum norm quadratic unbiased estimators. [D] . Delpish, Ayesha Nneka. 2006

机译：分层线性建模中估计量的比较：通过最小范数二次无偏估计量来限制最大似然与自举。
6. An Unbiased Estimator of Gene Diversity with Improved Variance for Samples Containing Related and Inbred Individuals of any Ploidy [O] . Alexandre M. Harris, Michael DeGiorgio 2017

机译：包含相关和近交个体的任何倍性样品的具有提高的方差的基因多样性无偏估计
7. No unbiased Estimator of the Variance of K-Fold Cross-Validation [O] . Yoshua Bengio, Yves Grandvalet 100

机译：没有K-Fold交叉验证方差的无偏估计
8. On the Uniformly Minimum Variance Unbiased Estimators of the Variance and its Reciprocal of an Inverse Gaussian Distribution [R] . Korwar, R. M. 1979

机译：关于逆高斯分布的方差的一致最小方差无偏估计及其倒数

No Unbiased Estimator of the Variance of K-Fold Cross-Validation

摘要

著录项

相似文献

相关主题

期刊订阅