首页> 外文会议>International Conference on Machine Learning >Maximum Likelihood Estimation for Learning Populations of Parameters
【24h】

Maximum Likelihood Estimation for Learning Populations of Parameters

机译:参数学习群体的最大似然估计

获取原文

摘要

Consider a setting with N independent individuals, each with an unknown parameter, p_i ∈ [0, 1] drawn from some unknown distribution P*. After observing the outcomes of t independent Bernoulli trials, i.e., X_i ~ Binomial(t, p_i) per individual, our objective is to accurately estimate P*. This problem arises in numerous domains, including the social sciences, psychology, healthcare, and biology, where the size of the population under study is usually large while the number of observations per individual is often limited. Our main result shows that, in the regime where t N, the maximum likelihood estimator (MLE) is both statistically minimax optimal and efficiently computable. Precisely, for sufficiently large N, the MLE achieves the information theoretic optimal error bound of O(1/t) for t < c log N, with regards to the earth mover's distance (between the estimated and true distributions). More generally, in an exponentially large interval of t beyond c log N, the MLE achieves the minimax error bound of O(1/(√t log N)). In contrast, regardless of how large N is, the naive "plug-in" estimator for this problem only achieves the sub-optimal error of Θ(1/(√t)).
机译:考虑使用n个独立个人的设置,每个都有一个未知的参数,p_i∈[0,1]从一些未知的分发p *中汲取。在观察T独立Bernoulli试验的结果之后,即每个人的X_I〜二项式(T,P_I),我们的目标是准确估计P *。这个问题出现在众多领域,包括社会科学,心理学,医疗保健和生物学,其中群体的规模通常很大,而每个人的观察人数通常是有限的。我们的主要结果表明,在T N的制度中,最大似然估计器(MLE)的最佳差距和有效可计算。精确地,对于足够大的N,MLE在地球移动器的距离(估计和真正的分布之间)方面,MLE实现了O(1 / T)的信息理论最佳误差为O(1 / T)。更一般地,在C log n的指数上间隔中,MLE实现了O(1 /(√tlog n))的最小误差。相反,无论n是多大,对于该问题的Naive“插入”估计器仅实现θ(1 /(√T))的次优误差。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号