首页> 外文OA文献 >Towards easier and faster sequence labeling for natural language processing: A search-based probabilistic online learning framework (SAPO)
【2h】

Towards easier and faster sequence labeling for natural language processing: A search-based probabilistic online learning framework (SAPO)

机译:为了更轻松,更快的自然语言处理序列标记:基于搜索的概率在线学习框架(SAPO)

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

There are two major approaches for structured classification. One is theprobabilistic gradient-based methods such as conditional random fields (CRF),which has high accuracy but with drawbacks: slow training, and no support ofsearch-based optimization (which is important in many cases). The other one isthe search-based learning methods such as perceptrons and margin infusedrelaxed algorithm (MIRA), which have fast training but also with drawbacks: lowaccuracy, no probabilistic information, and non-convergence in real-worldtasks. We propose a novel and "shockingly easy" solution, a search-basedprobabilistic online learning method, to address most of those issues. Thismethod searches the output candidates, derives probabilities, and conductefficient online learning. We show that this method is with fast training,support search-based optimization, very easy to implement, with top accuracy,with probabilities, and with theoretical guarantees of convergence. Experimentson well-known tasks show that our method has better accuracy than CRF andalmost as fast training speed as perceptron and MIRA. Results also show thatSAPO can easily beat the state-of-the-art systems on those highly-competitivetasks, achieving record-breaking accuracies.
机译:结构化分类有两种主要方法。一个是基于级别的基于梯度的方法,如条件随机字段(CRF),具有高精度,但有缺点:慢速训练,并且不支持基于研究的优化(在许多情况下很重要)。另一个是基于搜索的学习方法,如Perceptrons和Margin Infusedrelaxed算法(Mira),其具有快速训练,但也具有缺点:低理解,无概率信息,以及实际世界的非收敛性。我们提出了一种小说和“令人震惊的简单”解决方案,基于搜索的在线学习方法,以解决大多数问题。 Thismethod搜索输出候选人,推导概率和在线学习。我们展示了这种方法,采用快速培训,支持基于搜索的优化,非常易于实现,顶部准确性,具有概率,并具有融合的理论保证。实验室众所周知的任务表明,我们的方法比CRF Andal在Perceptron和Mira的快速训练速度具有更好的准确性。结果还表明,据称可以轻松地击败最先进的系统,以实现高度竞争对手,实现录制折断的精度。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号