Maximal Deviations of Incomplete U-statistics with Applications to Empirical Risk Sampling

机译：不完整U形统计数据的最大偏差与申请到经验风险抽样

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

It is the goal of this paper to extend the Empirical Risk Minimization (ERM) paradigm, from a practical perspective, to the situation where a natural estimate of the risk is of the form of a K-sample U-statistics, as it is the case in the K-partite ranking problem for instance. Indeed, the numerical computation of the empirical risk is hardly feasible if not infeasible, even for moderate samples sizes. Precisely, it involves averaging O(n~(d1+...+dK)) terms, when considering a U-statistic of degrees (d_1,..., dK) based on samples of sizes proportional to n. We propose here to consider a drastically simpler Monte-Carlo version of the empirical risk based on O(n) terms solely, which can be viewed as an in- complete generalized U-statistic, and prove that, remarkably, the approximation stage does not damage the ERM procedure and yields a learning rate of order O_P(1/{the square root of}n). Beyond a theoretical analysis guaranteeing the validity of this approach, numerical experiments are displayed for illustrative purpose.

机译：本文的目标是从实际角度扩大经验风险最小化（ERM）范式，以对风险的自然估计是K样本U统计的形式，因此是例如K-Partite排名问题。实际上，如果不可行，即使对于适度的样本尺寸，实际风险的数值计算几乎不可行。准确地说，它涉及平均o（n〜（d1 + ... + dk）术语，当考虑基于与n的尺寸样本的测量值（d_1，...，dk）。我们在此提出了基于O（n）术语的经验风险的巨大简化的Monte-Carlo版本，其可以被视为完全的普遍性U形统计，并证明近似阶段没有损坏ERM程序并产生O_P的学习率O_P（1 / {n的平方根）。除了保证这种方法的有效性的理论分析之外，展示了数字实验以出于说明性目的。

著录项

来源
《SIAM International Conference on Data Mining》|2013年|804 p.|共9页
会议地点
作者
Stephan Clemencon; Sylvain Robbiano; Jessica Tressou;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TP274.2-53;
关键词
Empirical risk minimization; Risk sampling; Incomplete U-statistics; Ranking; Minimum-volume set;

机译：经验风险最小化;风险采样;不完全U形统计;排名;最小卷集;

相似文献

外文文献
中文文献
专利

1. CRAMER-TYPE MODERATE DEVIATIONS FOR STUDENTIZED TWO-SAMPLE U-STATISTICS WITH APPLICATIONS [J] . Chang Jinyuan, Shao Qi-Man, Zhou Wen-Xin The Annals of Statistics: An Official Journal of the Institute of Mathematical Statistics . 2016,第5期

机译：样本化的两个样本U统计量的Cramer型中度偏差及其应用
2. Two-sample density-based empirical likelihood tests for incomplete data in application to a pneumonia study [J] . Albert Vexler, Jihnhee Yu Biometrical Journal . 2011,第4期

机译：基于两样本密度的经验似然检验，用于肺炎研究中的不完整数据
3. Two-sample density-based empirical likelihood tests for incomplete data in application to a pneumonia study. [J] . Vexler A, Yu J Biometrical Journal . 2011,第4期

机译：基于两样本密度的经验似然测试，用于肺炎研究中的不完整数据。
4. Maximal Deviations of Incomplete U-statistics with Applications to Empirical Risk Sampling [C] . Stephan Clemencon, Sylvain Robbiano, Jessica Tressou SIAM International Conference on Data Mining . 2013

机译：不完整U形统计数据的最大偏差与申请到经验风险抽样
5. PROBABILITY SAMPLE U-STATISTICS: THEORY AND APPLICATIONS FOR COMPLEX SAMPLE DESIGNS (VARIANCE COMPONENTS, ROBUST, INFERENCE). [D] . FOLSOM, RALPH E. 1984

机译：概率样本U统计：复杂样本设计（方差分量，鲁棒性，推论）的理论和应用。
6. Generating age-specific mortality statistics from incomplete death registration data: two applications of the empirical completeness method [O] . Tim Adair, Alan D Lopez 2021

机译：从不完全死亡登记数据产生年龄特异性死亡率统计数据：两种应用的实证完整性方法
7. Maximal deviations of incomplete U-statistics with applications to empirical risk sampling [O] . Clemencon, Stephan, Robbiano, Sylvain, Tressou-Cosmao, Jessica 2013

机译：不完全U统计量的最大偏差及其在经验风险抽样中的应用

Maximal Deviations of Incomplete U-statistics with Applications to Empirical Risk Sampling

摘要

著录项

相似文献

相关主题

期刊订阅