首页> 外文会议>Conference on Empirical Methods in Natural Language Processing >Span-based discontinuous constituency parsing: a family of exact chart-based algorithms with time complexities from O(n~6) down to O(n~3)
【24h】

Span-based discontinuous constituency parsing: a family of exact chart-based algorithms with time complexities from O(n~6) down to O(n~3)

机译:基于跨度的不连续选区解析:一个基于图表的一系列基于图表的算法,与O(n〜6)到O(n〜3)的时间复杂性

获取原文

摘要

We introduce a novel chart-based algorithm for span-based parsing of discontinuous constituency trees of block degree two, including ill-nested structures. In particular, we show that we can build variants of our parser with smaller search spaces and time complexities ranging from O(n~6) down to O(n~3). The cubic time variant covers 98% of constituents observed in linguistic treebanks while having the same complexity as continuous constituency parsers. We evaluate our approach on German and English treebanks (Negra, Tiger, and DPTB) and report state-of-the-art results in the fully supervised setting. We also experiment with pre-trained word embeddings and Bert-based neural networks.
机译:我们介绍了一种基于跨度的跨度的基于跨度的跨度选区树的基于图表的算法,包括嵌套的结构。特别是,我们表明我们可以使用较小的搜索空间和从O(n〜6)到O(n〜3)的时间复杂性构建与较小的搜索空间和时间复杂性的变体。立方时间变体覆盖在语言树木间观察到的98%的成分,同时具有与连续组件解析器相同的复杂性。我们评估我们在德语和英语树木库(Negra,Tiger和DPTB)上的方法,并在完全监督的环境中报告最先进的结果。我们还试验预先训练的单词嵌入和基于伯特的神经网络。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号