Exponentiated Gradient Algorithms for Conditional Random Fields and Max-Margin Markov Networks

Collins Michael; Globerson Amir; Koo Terry; Carreras Xavier; Bartlett Peter L.

首页> 外文期刊>Journal of machine learning research >Exponentiated Gradient Algorithms for Conditional Random Fields and Max-Margin Markov Networks

【24h】

Exponentiated Gradient Algorithms for Conditional Random Fields and Max-Margin Markov Networks

机译：条件随机场和最大余量马尔可夫网络的指数梯度算法

获取原文

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Log-linear and maximum-margin models are two commonly-used methods insupervised machine learning, and are frequently used in structuredprediction problems. Efficient learning of parameters in these modelsis therefore an important problem, and becomes a key factor whenlearning from very large data sets. This paper describesexponentiated gradient (EG) algorithms for training such models, whereEG updates are applied to the convex dual of either the log-linear ormax-margin objective function; the dual in both the log-linear andmax-margin cases corresponds to minimizing a convex function withsimplex constraints. We study both batch and online variants of thealgorithm, and provide rates of convergence for both cases. In themax-margin case, O(1/ε) EG updates are required toreach a given accuracy ε in the dual; in contrast, forlog-linear models only O(log(1/ε)) updates arerequired. For both the max-margin and log-linear cases, our boundssuggest that the online EG algorithm requires a factor of n lesscomputation to reach a desired accuracy than the batch EG algorithm,where n is the number of training examples. Our experiments confirmthat the online algorithms are much faster than the batch algorithmsin practice. We describe how the EG updates factor in a convenientway for structured prediction problems, allowing the algorithms to beefficiently applied to problems such as sequence learning or naturallanguage parsing. We perform extensive evaluation of the algorithms,comparing them to L-BFGS and stochastic gradient descent forlog-linear models, and to SVM-Struct for max-marginmodels. The algorithms are applied to a multi-classproblem as well as to a more complex large-scale parsing task. In allthese settings, the EG algorithms presented here outperform the othermethods. color="gray">

机译：对数线性模型和最大利润模型是有监督的机器学习中的两种常用方法，并且经常用于结构化预测问题中。因此，在这些模型中有效学习参数是一个重要问题，并且成为从非常大的数据集学习时的关键因素。本文介绍了用于训练此类模型的指数梯度（EG）算法，其中EG更新应用于对数线性或最大裕度目标函数的凸对偶;对数线性和最大边界情况下的对偶都对应于最小化具有简单约束的凸函数。我们研究了算法的批量和在线两种变体，并提供了两种情况的收敛速度。在最大余量的情况下，需要 O （1 /ε）EG更新才能在对偶中达到给定的精度ε;相反，对于线性对数模型，仅需要 O （log（1 /ε））更新。对于最大边距和对数线性情况，我们的界线都建议在线EG算法比批处理EG算法（ n 是训练示例的数量。我们的实验证实，在线算法比批处理算法在实践中要快得多。我们描述了EG如何在结构化预测问题的便捷方式中更新因子，从而使算法能够有效地应用于诸如序列学习或自然语言解析之类的问题。我们对算法进行了广泛的评估，将它们与L-BFGS和随机梯度下降的对数线性模型进行比较，并将其与SVM-Struct的最大利润率模型进行了比较。该算法适用于多类问题以及更复杂的大规模解析任务。在所有这些设置下，此处介绍的EG算法均优于其他方法。 color =“ gray”>

著录项

来源
《Journal of machine learning research》 |2008年第8期|共48页
作者
Collins Michael; Globerson Amir; Koo Terry; Carreras Xavier; Bartlett Peter L.;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种
中图分类计算技术、计算机技术;
关键词

相似文献

外文文献
中文文献
专利

1. Face Association for Videos Using Conditional Random Fields and Max-Margin Markov Networks [J] . Ming Du, Rama Chellappa IEEE Transactions on Pattern Analysis and Machine Intelligence . 2016,第9期

机译：使用条件随机场和Max-Margin Markov网络的视频人脸关联
2. Gradient computation in linear-chain conditional random fields using the entropy message passing algorithm [J] . Velimir M. Ilic, Dejan I. Mancev, Branimir T. Todorovic, Pattern recognition letters . 2012,第13期

机译：使用熵消息传递算法的线性链条件随机场中的梯度计算
3. Energy-efficient node scheduling algorithms for wireless sensor networks using Markov Random Field model [J] . Cheng Hongju, Su Zhihuang, Xiong Naixue, Information Sciences: An International Journal . 2016,第Null期

机译：马尔可夫随机场模型的无线传感器网络节能节点调度算法
4. Gleason Grading of Prostate Tumours with Max-Margin Conditional Random Fields [C] . Joseph G. Jacobs, Eleftheria Panagiotaki, Daniel C. Alexander International conference on medical image computing and computer assisted intervention;International workshop on machine learning in medical imaging . 2014

机译：最大边缘条件随机场对前列腺肿瘤的Gleason分级
5. A Study of the Exponentiated Gradient +/- Algorithm for Stochastic Optimization of Neural Networks [D] . Parks, David F. 2019

机译：神经网络随机优化的指数梯度+/-算法研究
6. Fully Bayesian Prediction Algorithms for Mobile Robotic Sensors under Uncertain Localization Using Gaussian Markov Random Fields [O] . Mahdi Jadaliha, Jinho Jeong, Yunfei Xu, 2018

机译：不确定定位下基于高斯马尔可夫随机场的移动机器人传感器全贝叶斯预测算法
7. Gleason Grading of Prostate Tumours with Max-Margin Conditional Random Fields [O] . Joseph G. Jacobs, Eleftheria Panagiotaki, Daniel C. Alexander 2014

机译：具有最大边缘条件随机场的前列腺肿瘤的Glason分级

Exponentiated Gradient Algorithms for Conditional Random Fields and Max-Margin Markov Networks

摘要

著录项

相似文献

相关主题

期刊订阅