Split or Merge: Which is Better for Unsupervised RST Parsing?

机译：分裂或合并：哪个更适合无人监督的解析？

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Rhetorical Structure Theory (RST) parsing is crucial for many downstream NLP tasks that require a discourse structure for a text. Most of the previous RST parsers have been based on supervised learning approaches. That is, they require an annotated corpus of sufficient size and quality, and heavily rely on the language and domain dependent corpus. In this paper, we present two language-independent unsupervised RST parsing methods based on dynamic programming. The first one builds the optimal tree in terms of a dissimilarity score function that is defined for splitting a text span into smaller ones. The second builds the optimal tree in terms of a similarity score function that is defined for merging two adjacent spans into a large one. Experimental results on English and German RST treebanks showed that our parser based on span merging achieved the best score, around 0.8 F_1 score, which is close to the scores of the previous supervised parsers.

机译：修辞结构理论（RST）解析对于需要文本的话语结构的许多下游NLP任务至关重要。大多数以前的RST解析器都是基于受监督的学习方法。也就是说，它们需要有足够的大小和质量的注释语料库，并严重依赖语言和域依赖语料库。在本文中，我们提出了一种基于动态编程的独立无关的无人监督解析方法。第一个在定义为较小的分割文本跨度拆分为较小的相提觉的分数函数而构建最佳树。第二个在用于将两个相邻的跨度合并到大一个方面的相似度得分函数方面构建最佳树。英语和德国RST TreeBanks的实验结果表明，我们的解析器基于SPAN合并实现了最佳分数，约为0.8 F_1得分，这与先前监督解析器的分数接近。

著录项

来源
《International joint conference on natural language processing》|2019年|cxxxviii p. 5174-5821|共6页
会议地点
作者
Naoki Kobayash; Tsutomu Hirao; Kengo Nakamaura; Hidetaka Kamigaito; Manabu Okumura; Masaaki Nagata;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类程序设计、软件工程;
关键词

相似文献

外文文献
中文文献
专利

1. A Split-and-Merge Expression Parser in C# [J] . Vassili Kaplan MSDN Magazine . 2015,第10期

机译：C＃中的拆分合并表达式解析器
2. Robust unsupervised discriminative dependency parsing [J] . Yong Jiang, Jiong Cai, Kewei Tu Tsinghua Science and Technology . 2020,第2期

机译：鲁棒的无监督判别依赖性解析
3. Knowledge-guided unsupervised rhetorical parsing for text summarization [J] . Hou Shengluan, Lu Ruqian Information Systems . 2020,第Deca期

机译：知识引导的文本摘要的修辞解析
4. Split or Merge: Which is Better for Unsupervised RST Parsing? [C] . Naoki Kobayash, Tsutomu Hirao, Kengo Nakamaura, International joint conference on natural language processing;Conference on empirical methods in natural language processing . 2019

机译：拆分或合并：哪种方法更适合无监督的RST解析？
5. Supervised Training on Synthetic Languages: A Novel Framework for Unsupervised Parsing [D] . Wang, Dingquan. 2019

机译：关于综合语培训：无监督解析的新框架
6. Unsupervised segmentation of noisy electron microscopy images using salient watersheds and region merging [O] . Saket Navlakha, Parvez Ahammad, Eugene W Myers 2013

机译：使用显着分水岭和区域合并的有噪电子显微镜图像的无监督分割
7. Split or Merge: Which is Better for Unsupervised RST Parsing? [O] . Naoki Kobayashi, Tsutomu Hirao, Kengo Nakamura, 2019

机译：分裂或合并：哪个更适合无人监督的解析？

Split or Merge: Which is Better for Unsupervised RST Parsing?

摘要

著录项

相似文献

相关主题

期刊订阅