...
首页> 外文期刊>JMLR: Workshop and Conference Proceedings >Extrapolating the profile of a finite population
【24h】

Extrapolating the profile of a finite population

机译:推断有限群体的轮廓

获取原文
           

摘要

We study a prototypical problem in empirical Bayes. Namely, consider a population consisting of $k$ individuals each belonging to one of $k$ types (some types can be empty). Without any structural restrictions, it is impossible to learn the composition of the full population having observed only a small (random) subsample of size $m = o(k)$. Nevertheless, we show that in the sublinear regime of $m =omega(k/log k)$, it is possible to consistently estimate in total variation the emph{profile} of the population, defined as the empirical distribution of the sizes of each type, which determines many symmetric properties of the population. We also prove that in the linear regime of $m=c k$ for any constant $c$ the optimal rate is $Theta(1/log k)$. Our estimator is based on Wolfowitz’s minimum distance method, which entails solving a linear program (LP) of size $k$. We show that there is a single infinite-dimensional LP whose value simultaneously characterizes the risk of the minimum distance estimator and certifies its minimax optimality. The sharp convergence rate is obtained by evaluating this LP using complex-analytic techniques.
机译:我们研究了经验贝叶斯的原型问题。即,考虑一个由$ k $个人组成的人口,每个人属于$ k $类型(某些类型可以为空)。没有任何结构限制,不可能学习只观察到尺寸为$ m = o(k)$的小(随机)的全部人群的组成。尽管如此,我们表明,在$ M = ω(k / log k)$中,可以始终如一地估计人口的 emph {profile},定义为实证分布每种类型的大小,这决定了人口的许多对称属性。我们还证明,在$ m = c k $的线性制度中,对于任何常数$ c $ c $ theta(1 / log k)$。我们的估算器基于Wolfowitz的最小距离方法,这需要解决大小$ k $的线性程序(LP)。我们表明,有一个单一无限尺寸的LP,其值同时表征最小距离估计器的风险,并证明其最低限度的最优性。通过使用复杂分析技术评价该LP来获得急剧收敛速度。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号