首页> 外文会议>International conference on text, speech and dialogue >An Experiment with Theme-Rheme Identification
【24h】

An Experiment with Theme-Rheme Identification

机译:主题韵识别实验

获取原文

摘要

In this paper we start from the theory of Functional Sentence Perspective developed primarily by Firbas, Svoboda and also later by Sgall et al.. We make an attempt to formulate and implement a procedure for Czech allowing to automatically recognize which sentence constituents carry information that is contextually dependent and thus known to an addressee (theme), constituents containing new information (rheme), and also constituents bearing non-thematic and non-rhematic information (transition). The experimental implementation of the procedure uses tools developed in NLP Centre, FI MU, particularly the morphological analyzer Majka, disambiguator DESAMB and parser SET. As a starting data resource we use a small corpus of 120 Czech sentences, which at the moment does not include a free continuous text. This is motivated by the fact that we do not use syntactically pre-tagged text but perform syntactic analysis directly using the parser SET. Thus, we offer only a very basic evaluation, which captures the main FSP phenomena and shows that the task is feasible. The toolset developed for the experiment consists of two parts: first, a chunker, which determines word-order positions from the parse tree of a sentence, second, an FSP tagger which is the implementation of the procedure. It labels the chunks with the tags of what is further called functional elements (e.g. theme proper, transition, rheme proper).
机译:在本文中,我们从主要由Firbas,Svoboda和后来由Sgall等人开发的功能句透视理论开始。我们尝试制定和实现捷克语的程序,该程序可以自动识别哪些句子成分携带的信息是上下文相关,因此对于收件人(主题)而言,包含新信息的成分(类目),以及带有非主题和非流变信息的类目(过渡)。该程序的实验性实施使用了FI MU NLP中心开发的工具,尤其是形态分析仪Majka,消歧器DESAMB和解析器SET。作为起始数据资源,我们使用120个捷克语句子的小型语料库,目前该语料库不包含自由的连续文本。这是因为我们不使用语法上预先标记的文本,而是直接使用解析器SET执行语法分析。因此,我们仅提供一个非常基本的评估,该评估可以捕获FSP的主要现象并表明该任务是可行的。为该实验开发的工具集包括两个部分:第一,一个分块器,它从句子的分析树确定单词顺序的位置,第二,一个FSP标记器,它是该过程的实现。它使用进一步称为功能元素的标签(例如,主题适当,过渡,音韵适当)的标签来标记这些块。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号