首页> 外文会议>Workshop on stylistic variation 2017 >Modeling Communicative Purpose with Functional Style: Corpus and Features for German Genre and Register Analysis
【24h】

Modeling Communicative Purpose with Functional Style: Corpus and Features for German Genre and Register Analysis

机译:用功能风格建模交际目的:德语体裁和语种分析的语料库和功能

获取原文
获取原文并翻译 | 示例

摘要

While there is wide acknowledgement in NLP of the utility of document characterization by genre, it is quite difficult to determine a definitive set of features or even a comprehensive list of genres. This paper addresses both issues. First, with prototype semantics, we develop a hierarchical taxonomy of discourse functions. We implement the taxonomy by developing a new text genre corpus of contemporary German to perform a text based comparative register analysis. Second, we extract a host of style features, both deep and shallow, aiming beyond linguistically motivated features at situational correlates in texts. The feature sets are used for supervised text genre classification, on which our models achieve high accuracy. The combination of the corpus typology and feature sets allows us to characterize types of communicative purpose in a comparative setup, by qualitative interpretation of style feature loadings of a regularized discriminant analysis. Finally, to determine the dependence of genre on topics (which are arguably the distinguishing factor of sub-genre), we compare and combine our style models with Latent Dirichlet Allocation features across different corpus settings with unstable topics.
机译:尽管NLP广泛认可了按类型进行文档表征的效用,但要确定一组确定的功能甚至是一个完整的类型列表都非常困难。本文解决了这两个问题。首先,利用原型语义,我们开发了话语功能的分层分类法。我们通过开发当代德语的新文本体裁语料库来执行分类,以执行基于文本的比较注册分析。其次,我们提取了许多样式特征,包括深浅的样式特征,其目的是超越语言动机特征来定位文本中的情境关联。这些功能集用于监督文本体裁分类,我们的模型在这些分类上实现了较高的准确性。语料库类型学和特征集的组合使我们能够通过对正则判别分析的样式特征加载进行定性解释,从而在比较设置中表征交流目的的类型。最后,为了确定体裁对主题的依赖关系(可以说是子体裁的区别因素),我们将样式模型与具有不稳定主题的不同语料库设置中的潜在狄利克雷分配特征进行比较和组合。

著录项

  • 来源
  • 会议地点 Copenhagen(DK)
  • 作者

    Thomas Haider; Alexis Palmer;

  • 作者单位

    Max Planck Institute for Empirical Aesthetics Frankfurt am Main, Germany;

    University of North Texas Denton, Texas, USA;

  • 会议组织
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

  • 入库时间 2022-08-26 14:23:35

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号