首页> 外文学位 >Automated annotation and visualization of rhetorical figures.
【24h】

Automated annotation and visualization of rhetorical figures.

机译:修辞格的自动注释和可视化。

获取原文
获取原文并翻译 | 示例

摘要

Linguistic annotation provides additional information asserted with a particular purpose in a document or other piece of information. It is widely used in various fields, from computing and bioinformatics, through imaging, to law and linguistics. There is also a clear distinction between what is communicated through the written/spoken natural language and how this is passed on. A new problem of linguistic annotation is the annotation of classical rhetorical figures---patterns of text in which a characteristic syntactic form modifies the standard meanings of words, and leads to a change or an extension of meaning. Rhetoric studies the effectiveness of language comprehensively, including its emotional impact, as much as its propositional content. The annotation of rhetorical figures is therefore important not only for the linguistic point of view, but also for discovering different styles of writing, purpose and effect of written documents, and for better natural language understanding in general.;Lastly, we present the visualization of the occurrences of the figures and comparison between 14 American presidents' inaugural addresses including the most recent one by President Barack Obama. The provocative results of this comparison show that (a) automated analysis of meaningful rhetorical information is possible and tractable, and (b) help us with understanding what creates a successful orator.;The purpose of this thesis is the automated annotation of rhetorical figures. In the thesis we primarily focus on the figures of repetition, which include the repetition of words, phrases, and clauses. Additionally, we also describe the work we have done on the detection and annotation of figures of parallelism, as well as those that pertain more to the semantics than to the syntax, or positioning. We have developed a rhetorical figure annotation tool dubbed JANTOR (Java ANnotation Tool Of Rhetoric), which enables manual and automated annotation of files in HTML format. We have applied a lexicalized probabilistic context-free grammar parser for the recognition of the figures of repetition. We also describe a simple parse tree distance used for calculating the difference between similarly structured phrases, which is necessary for the recognition of some of the figures of parallelism. Moreover, we have applied the semantic relationships contained in the WordNet lexical database and extended Porter stemmer algorithm for finding derivationally related words. Finally, we present a method for finding pairs of words which are ordinarily contradictory, which is crucial for detecting the interesting figure of speech: oxymoron . For this purpose typed dependency grammars together with WordNet are used. The experiments we have conducted on the detection of selected subset of rhetorical figures have yielded very promising results.
机译:语言注释提供了在文档中具有特定目的的其他信息或其他信息。它广泛应用于从计算和生物信息学到影像学到法律和语言学的各个领域。通过书面/口头自然语言传达的信息与传递方式之间也有明显的区别。语言注释的一个新问题是古典修辞格的注释-文本模式,其中一种特殊的句法形式修改了单词的标准含义,并导致含义的改变或扩展。修辞学全面研究语言的有效性,包括情感影响以及命题内容。因此,修辞格的注释不仅对于语言学观点很重要,而且对于发现不同的写作风格,书面文档的目的和效果以及总体上更好地理解自然语言也很重要。数字的发生和14位美国总统就职演说之间的比较,包括巴拉克·奥巴马总统最近发表的演说。比较结果的启发性结果表明:(a)对有意义的修辞信息进行自动分析是可能且容易处理的;(b)帮助我们理解是什么造就了成功的演说家。本论文的目的是对修辞人物进行自动注释。在本文中,我们主要关注重复的图形,包括单词,短语和从句的重复。此外,我们还描述了我们在检测和注释并行性图形方面所做的工作,以及与语义有关而不是与语法或位置有关的工作。我们开发了一种称为JANTOR(修辞的Java注释工具)的修辞图形注释工具,该工具可以对HTML格式的文件进行手动和自动注释。我们已经应用了词汇化的概率上下文无关文法解析器来识别重复的数字。我们还描述了一个简单的分析树距离,用于计算结构相似的短语之间的差异,这对于识别某些并行度图形是必需的。此外,我们已将WordNet词汇数据库中包含的语义关系和扩展的Porter stemmer算法应用于发现派生相关的单词。最后,我们提出一种查找通常矛盾的单词对的方法,这对于检测有趣的语音比喻是至关重要的:oxymoron。为此,使用了类型依赖性语法和WordNet。我们对选定的修辞格子集进行的实验产生了非常有希望的结果。

著录项

  • 作者

    Gawryjolek, Jakub J.;

  • 作者单位

    University of Waterloo (Canada).;

  • 授予单位 University of Waterloo (Canada).;
  • 学科 Language Linguistics.;Computer Science.
  • 学位 M.Math.
  • 年度 2009
  • 页码 118 p.
  • 总页数 118
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号