首页> 外文会议>International Conference on Text, Speech and Dialogue >An Experiment with Theme-Rheme Identification
【24h】

An Experiment with Theme-Rheme Identification

机译:主题res识别的实验

获取原文

摘要

In this paper we start from the theory of Functional Sentence Perspective developed primarily by Firbas [1], Svoboda [12] and also later by Sgall et al. [9]. We make an attempt to formulate and implement a procedure for Czech allowing to automatically recognize which sentence constituents carry information that is contextually dependent and thus known to an addressee (theme), constituents containing new information (rheme), and also constituents bearing non-thematic and non-rhematic information (transition). The experimental implementation of the procedure uses tools developed in NLP Centre, FI MU, particularly the morphological analyzer Majka [17], disambiguator DESAMB [16] and parser SET [5]. As a starting data resource we use a small corpus of 120 Czech sentences, which at the moment does not include a free continuous text. This is motivated by the fact that we do not use syntactically pre-tagged text but perform syntactic analysis directly using the parser SET. Thus, we offer only a very basic evaluation, which captures the main FSP phenomena and shows that the task is feasible. The toolset developed for the experiment consists of two parts: first, a chunker, which determines word-order positions from the parse tree of a sentence, second, an FSP tagger which is the implementation of the procedure. It labels the chunks with the tags of what is further called functional elements (e.g. theme proper, transition, rheme proper). An experimental version is available at http://nlp.fi.muni.cz/-xsvobo15/fsp/fsp.html.
机译:在本文中,我们从主要由FiRBAS [1],SVOBODA [12]的功能句子角度从函数句子透视理论开始,而且还通过SGALL等人。 [9]。我们试图制定和实施捷克语的程序,允许自动识别哪些句子成员携带上下文依赖的判决信息,因此已知的收件人(主题),包含新信息(Reme)的成分,以及构成非专题的组成部分和非现兵信息(转换)。该程序的实验实施是在NLP中心开发的工具,Fi Mu,特别是形态学分析仪Majka [17],消歧员Desamb [16]和解析器组[5]。作为一个起始数据资源,我们使用120捷克语句的小语料库,目前不包括免费连续文本。这是由于我们不使用句法预标记的文本但直接使用解析器集执行语法分析的事实。因此,我们只提供非常基本的评估,捕获了主要的FSP现象,并表明任务是可行的。为实验开发的工具集由两部分组成:第一,一个块,该块是从句子的解析树中确定单词阶位置,第二个是一个是过程的FSP标签。它用进一步称为功能元素的标签来标记块(例如主题,转换,reme适当)。在http://nlp.fi.muni.cz/-xsvobo15/fsp/fsp.html上提供了一个实验版本。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号