首页> 外文会议>Conference on Intelligent Text Processing and Computational Linguistics;CICLing 2014 >An Investigation on the Influence of Genres and Textual Organisation on the Use of Discourse Relations
【24h】

An Investigation on the Influence of Genres and Textual Organisation on the Use of Discourse Relations

机译:流派与文本组织对话语关系利用影响的调查

获取原文

摘要

In this paper, we investigate some of the problems associated with the automatic extraction of discourse relations. In particular, we study the influence of communicative goals encoded in a given genre against another, and between the various communicative goals encoded between sections of documents of a same genre. Some investigations have been made in the past in order to identify the differences seen across either genres or textual organization, but none have made a thorough statistical analysis of these differences across currently available annotated corpora. In this paper, we show that both the communicative goal of a given genre and, to a lesser extend, that of a particular topic tackled by that genre, do in fact influence in the distribution of discourse relations. Using a statistically grounded approach, we show that certain discourse relations are more likely to appear within given genres and subsequently within sections within a genre. In particular, we observed that Attributions are common in the newspaper articles genre while Joint relations are comparatively more frequent in online reviews. We also notice that Temporal relations are statically more common in the methodology sections of scientific research documents than in the rest of the text. These results are important as they give clues to allow the tailoring of current discourse taggers to specific textual genres.
机译:在本文中,我们调查了与自动提取话语关系相关的一些问题。特别是,我们研究了在给定类型中编码的交际目标的影响,以及在相同类型的文件部分之间编码的各种交流目标之间。过去已经进行了一些调查,以确定各种类型或文本组织中看到的差异,但没有对目前可用的注释基层的这些差异进行了全面的统计分析。在本文中,我们表明,给定类型的交际目标和对较小的延伸,由这种类型解决的特定主题的延伸,实际上对话语关系分配有影响。使用统计上接地的方法,我们表明某些话语关系更容易出现在给定的类型中,随后在流派内的部分内出现。特别是,我们观察到,在线关系中的报纸文章类型中常见于报纸文章类型中常见,而在线评论中的联合关系比较频繁。我们还注意到,在科学研究文件的方法部分比在其余文本中,时间关系在静态更常见。这些结果很重要,因为它们提供了允许将当前话语标签剪裁到特定文本类型的线索。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号