Minimax Optimal Bayes Mixtures for Memoryless Sources over Large Alphabets

Elias J??saari; Janne Lepp?-aho; Tomi Silander; Teemu Roos

首页> 外文期刊>JMLR: Workshop and Conference Proceedings >Minimax Optimal Bayes Mixtures for Memoryless Sources over Large Alphabets

【24h】

Minimax Optimal Bayes Mixtures for Memoryless Sources over Large Alphabets

机译：适用于大字母无记忆源的Minimax最佳贝叶斯混合物

获取原文

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

The normalized maximum likelihood (NML) distribution achieves minimax log loss and coding regret for the multinomial model. In practice other nearly minimax distributions are used instead as calculating the sequential probabilities needed for coding and prediction takes exponential time with NML. The Bayes mixture obtained with the Dirichlet prior $operatorname{Dir}(1/2, …, 1/2)$ and asymptotically minimax modifications of it have been widely studied in the context of large sample sizes. Recently there has also been interest in minimax optimal coding distributions for large alphabets. We investigate Dirichlet priors that achieve minimax coding regret when the alphabet size $m$ is finite but large in comparison to the sample size $n$. We prove that a Bayes mixture with the Dirichlet prior $operatorname{Dir}(1/3, …, 1/3)$ is optimal in this regime (in particular, when $m > rac{5}{2} n + rac{4}{n - 2} + rac{3}{2}$). The worst-case regret of the resulting distribution approaches the NML regret as the alphabet size grows.

机译：归一化最大似然（NML）分布实现了多项模型的极小对数损失和编码遗憾。在实践中，使用其他接近极小极大的分布代替，因为使用NML计算编码和预测所需的顺序概率需要花费指数时间。在大样本量的情况下，已经广泛研究了用Dirichlet之前的 operatorname {Dir}（1/2，…，1/2）$及其渐近极小极大修改获得的贝叶斯混合物。最近，对于大字母的最小最大最优编码分布也引起了兴趣。我们研究当字母大小$ m $是有限的但与样本大小$ n $相比较大时实现最小极大编码的Dirichlet先验。我们证明在这种情况下，贝叶斯与Dirichlet之前的$ operatorname {Dir}（1/3，…，1/3）$的混合是最佳的（特别是当$ m> frac {5} {2} n时） + frac {4} {n-2} + frac {3} {2} $）。随着字母表大小的增加，最终分布的最坏情况后悔接近NML后悔。

著录项

来源
《JMLR: Workshop and Conference Proceedings》 |2018年第2010期|共19页
作者
Elias J??saari; Janne Lepp?-aho; Tomi Silander; Teemu Roos;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种
中图分类人工智能理论;
关键词
入库时间 2022-08-18 15:56:25

相似文献

外文文献
中文文献
专利

1. Minimax Optimal Bayes Mixtures for Memoryless Sources over Large Alphabets [J] . Elias J??saari, Janne Lepp?-aho, Tomi Silander, JMLR: Workshop and Conference Proceedings . 2018,第4期

机译：适用于大字母无记忆源的Minimax最佳贝叶斯混合物
2. Minimax Pointwise Redundancy for Memoryless Models Over Large Alphabets [J] . Szpankowski W. Information Theory, IEEE Transactions on . 2012,第7期

机译：大字母无记忆模型的Minimax点向冗余
3. Universal Coding for Memoryless Sources with Countably Infinite Alphabets [J] . Kudryashov B. D., Porov A. V. Problems of information transmission . 2014,第4期

机译：具有无穷多个字母的无记忆源的通用编码
4. Universal Compression of Memoryless Sources over Large Alphabets via Independent Component Analysis [C] . Painsky Amichai, Rosset Saharon, Feder Meir Data compression conference . 2015

机译：通过独立分量分析对大字母无记忆源进行通用压缩
5. Exact solution of Bayes and minimax change-detection problems. [D] . Isom, Joshua David. 2009

机译：贝叶斯和minimax变化检测问题的精确解决方案。
6. An Empirical Bayes Optimal Discovery Procedure Based on Semiparametric Hierarchical Mixture Models [O] . Hisashi Noma, Shigeyuki Matsui 2013

机译：基于半参数层次混合模型的经验贝叶斯最优发现过程
7. Minimax Pointwise Redundancy for Memoryless Models over Large Alphabets [O] . Wojciech Szpankowski, Marcelo J. Weinberger 2013

机译：用于大字母表的无记忆模型的minimax点态冗余
8. On Gamma-Minimax, Minimax, and Bayes Procedures for Selecting Populations Close to a Control [R] . Gupta, S. S., Hsiao, P. 1980

机译：在Gamma-minimax，minimax和Bayes程序中选择靠近控件的人口

Minimax Optimal Bayes Mixtures for Memoryless Sources over Large Alphabets

摘要

著录项

相似文献

相关主题

期刊订阅