【24h】

RST Parsing from Scratch

机译:从头开始解析

获取原文

摘要

We introduce a novel top-down end-to-end formulation of document level discourse parsing in the Rhetorical Structure Theory (RST) framework. In this formulation, we consider discourse parsing as a sequence of splitting decisions at token boundaries and use a seq2seq network to model the splitting decisions. Our framework facilitates discourse parsing from scratch without requiring discourse segmentation as a prerequisite; rather, it yields segmentation as part of the parsing process. Our unified parsing model adopts a beam search to decode the best tree structure by searching through a space of high scoring trees. With extensive experiments on the standard English RST discourse treebank, we demonstrate that our parser outperforms existing methods by a good margin in both end-to-end parsing and parsing with gold segmentation. More importantly, it does so without using any handcrafted features, making it faster and easily adaptable to new languages and domains.
机译:我们介绍了在修辞结构理论(RST)框架中的文献水平话语解析的新型自上而下的端到端制定。 在这种制定中,我们将话语解析为令牌边界处的拆分决策顺序,并使用SEQ2Seq网络来模拟分割决策。 我们的框架有助于从头划痕进行话语,而无需话语细分作为先决条件; 相反,它将分割作为解析过程的一部分产生分割。 我们统一的解析模型采用光束搜索来通过搜索高分树木的空间来解码最佳树结构。 通过对标准英语RST话语TreeBank进行了广泛的实验,我们证明我们的解析器在端到端解析和与金分割中解析的良好保证金以良好的余量优于现有方法。 更重要的是,它确实如此,而不使用任何手工制作功能,使其更快,易于适应新的语言和域。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号