首页> 外文期刊>Pattern Analysis and Machine Intelligence, IEEE Transactions on >The Infinite-Order Conditional Random Field Model for Sequential Data Modeling
【24h】

The Infinite-Order Conditional Random Field Model for Sequential Data Modeling

机译:顺序数据建模的无序条件随机场模型

获取原文
获取原文并翻译 | 示例
       

摘要

Sequential data labeling is a fundamental task in machine learning applications, with speech and natural language processing, activity recognition in video sequences, and biomedical data analysis being characteristic examples, to name just a few. The conditional random field (CRF), a log-linear model representing the conditional distribution of the observation labels, is one of the most successful approaches for sequential data labeling and classification, and has lately received significant attention in machine learning as it achieves superb prediction performance in a variety of scenarios. Nevertheless, existing CRF formulations can capture only one- or few-timestep interactions and neglect higher order dependences, which are potentially useful in many real-life sequential data modeling applications. To resolve these issues, in this paper we introduce a novel CRF formulation, based on the postulation of an energy function which entails infinitely long time-dependences between the modeled data. Building blocks of our novel approach are: 1) the sequence memoizer (SM), a recently proposed nonparametric Bayesian approach for modeling label sequences with infinitely long time dependences, and 2) a mean-field-like approximation of the model marginal likelihood, which allows for the derivation of computationally efficient inference algorithms for our model. The efficacy of the so-obtained infinite-order CRF ($({rm CRF}^{infty })$) model is experimentally demonstrated.
机译:顺序数据标记是机器学习应用程序中的一项基本任务,语音和自然语言处理,视频序列中的活动识别以及生物医学数据分析就是典型示例,仅举几例。条件随机场(CRF)是表示观察标签的条件分布的对数线性模型,是用于顺序数据标签和分类的最成功方法之一,并且由于它实现了出色的预测,最近在机器学习中受到了广泛关注在各种情况下的性能。然而,现有的CRF公式只能捕获一个或几个时步的交互作用,而忽略了较高阶的依赖性,这在许多现实生活中的顺序数据建模应用程序中可能很有用。为了解决这些问题,在本文中,我们基于能量函数的假设引入了一种新的CRF公式,该函数在建模数据之间需要无限长的时间依赖性。我们新颖方法的基础是:1)序列记忆器(SM),这是最近提出的用于对具有无限长时间依赖性的标签序列进行建模的非参数贝叶斯方法,以及2)模型边际似然的均值场近似。允许为我们的模型推导计算效率高的推理算法。实验证明了如此获得的无限阶CRF($({rm CRF} ^ {infty})$)模型的功效。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号