Maximum Entropy Discrimination Markov Networks

Zhu Jun; Xing Eric P.

首页> 外文期刊>Journal of machine learning research >Maximum Entropy Discrimination Markov Networks

【24h】

Maximum Entropy Discrimination Markov Networks

机译：最大熵判别马尔可夫网络

获取原文

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

The standard maximum margin approach for structured prediction lacksa straightforward probabilistic interpretation of the learningscheme and the prediction rule. Therefore its unique advantages suchas dual sparseness and kernel tricks cannot be easily conjoined withthe merits of a probabilistic model such as Bayesian regularization,model averaging, and ability to model hidden variables. In thispaper, we present a new general framework called maximumentropy discrimination Markov networks (MaxEnDNet, or simply,MEDN), which integrates these two approaches and combines andextends their merits. Major innovations of this approach include: 1)It extends the conventional max-entropy discrimination learning ofclassification rules to a new structural max-entropydiscrimination paradigm of learning a distribution of Markovnetworks. 2) It generalizes the extant Markov networkstructured-prediction rule based on a point estimator of modelcoefficients to an averaging model akin to a Bayesian predictor thatintegrates over a learned posterior distribution of modelcoefficients. 3) It admits flexible entropic regularization of themodel during learning. By plugging in different prior distributionsof the model coefficients, it subsumes the well-known maximum marginMarkov networks (M3N) as a special case, and leads to a modelsimilar to an L₁-regularized M3N that is simultaneously primaland dual sparse, or other new types of Markov networks. 4) Itapplies a modular learning algorithm that combines existingvariational inference techniques and convex-optimization basedM3N solvers as subroutines.Essentially, MEDN can be understood as a jointly maximumlikelihood and maximum margin estimate of Markov network. Itrepresents the first successful attempt to combine maximum entropylearning (a dual form of maximum likelihood learning) with maximummargin learning of Markov network for structured input/outputproblems; and the basic principle can be generalized to learningarbitrary graphical models, such as the generative Bayesian networksor models with structured hidden variables. We discuss a number oftheoretical properties of this approach, and show that empirically itoutperforms a wide array of competing methods for structuredinput/output learning on both synthetic and real OCR and web dataextraction data sets. color="gray">

机译：用于结构化预测的标准最大余量方法缺乏对学习方案和预测规则的简单概率解释。因此，它的独特优势（例如双重稀疏性和核技巧）不能轻易地与概率模型的优点（例如贝叶斯正则化，模型平均以及对隐藏变量进行建模的能力）相结合。在本文中，我们提出了一个新的通用框架，称为极大化歧视Markov网络（MaxEnDNet，或简称为MEDN），该框架整合了这两种方法，并结合并扩展了它们的优点。该方法的主要创新包括：1）将分类规则的传统最大熵判别学习扩展到学习马尔可夫网络分布的新的结构最大熵判别范式。 2）将基于模型系数的点估计量的现存Markov网络结构化预测规则推广到类似于贝叶斯预测变量的平均模型，该模型对学习到的模型系数的后验分布进行积分。 3）允许在学习过程中对模型进行灵活的熵正则化。通过插入模型系数的不同先验分布，它包含了众所周知的最大余量马尔可夫网络（M 3 N）作为特例，并导致了与 L _{1 -正则化的M 3 N，它同时是primaland对偶稀疏或其他新型马尔可夫网络。 4）采用模块化的学习算法，将已有的变分推理技术和基于凸优化的M 3 N求解器相结合作为子程序。从本质上，MEDN可以理解为马尔可夫网络的最大似然和最大余量估计。它代表了针对结构化输入/输出问题将最大熵学习（最大似然学习的对偶形式）与马尔可夫网络的最大余量学习相结合的首次成功尝试;基本原理可以推广到学习任意图形模型，例如生成贝叶斯网络或具有结构化隐藏变量的模型。我们讨论了这种方法的许多理论特性，并显示出在经验上，它优于在合成和实际OCR和Web数据提取数据集上进行结构化输入/输出学习的各种竞争方法。 color =“ gray”>}

著录项

来源
《Journal of machine learning research》 |2009年第11期|共39页
作者
Zhu Jun; Xing Eric P.;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种
中图分类计算技术、计算机技术;
关键词
入库时间 2022-08-18 16:44:45

相似文献

外文文献
中文文献
专利

1. A lower estimate of the entropy of an automorphism and maximum entropy conditions for and invariant measure of a suspension flow over a Markov shift [J] . Gurevich B. M. Doklady. Mathematics . 2015,第2期

机译：马尔可夫位移上悬浮流的自同构熵和最大熵条件的较低估计及不变度量。
2. Entropy Rate and Maximum Entropy Methods for Countable Semi-Markov Chains [J] . Valerie Girardin, Nikolaos Limnios Communications in Statistics. A, Theory and Methods . 2004,第3期

机译：可数半马尔可夫链的熵率和最大熵方法
3. New Markov-Shannon Entropy models to assess connectivity quality in complex networks: from molecular to cellular pathway, Parasite-Host, Neural, Industry, and Legal-Social networks. [J] . Pablo Riera-Fernández, Cristian R Munteanu, Manuel Escobar, Journal of Theoretical Biology . 2012,第Null期

机译：新的Markov-Shannon熵模型可评估复杂网络中的连接质量：从分子到细胞途径，寄生虫宿主，神经网络，工业网络和法律社会网络。
4. Infinite Markov-Switching Maximum Entropy Discrimination Machines [C] . Sotirios P. Chatzis International Conference on Machine Learning . 2013

机译：无限马尔可夫切换最大熵辨别机
5. Entity Relation Detection with Factorial Hidden Markov Models and Maximum Entropy Discriminant Latent Dirichlet Allocations . [D] . Li, Dingcheng. 2011

机译：因子隐马尔可夫模型与最大熵判别潜在Dirichlet分配的实体关系检测。
6. Information Entropy Production of Maximum Entropy Markov Chains from Spike Trains [O] . Rodrigo Cofré, Cesar Maldonado 2018

机译：信息熵生产最大熵马尔可夫链中尖峰火车
7. Information Entropy Production of Maximum Entropy Markov Chains from Spike Trains [O] . Rodrigo Cofré, Cesar Maldonado 2018

机译：穗列车最大熵马氏链的信息熵产生

Maximum Entropy Discrimination Markov Networks

摘要

著录项

相似文献

相关主题

期刊订阅