首页> 外文会议>Computational Linguistics and Intelligent Text Processing >Generating Natural Word Orders in a Semi-free Word Order Language: Treebank-Based Linearization Preferences for German
【24h】

Generating Natural Word Orders in a Semi-free Word Order Language: Treebank-Based Linearization Preferences for German

机译:使用半自由字序语言生成自然字序:德语的基于树库的线性化首选项

获取原文

摘要

We outline an algorithm capable of generating varied but natural sounding sequences of argument NPs in subordinate clauses of German, a semi-free word order language. In order to attain the right level of output flexibility, the algorithm considers (1) the relevant lexical properties of the head verb (not only transitivity type but also reflexivity, thematic relations expressed by the NPs, etc.), and (2) the animacy and definiteness values of the arguments, and their length. The relevant statistical data were extracted from the NEGRA-II treebank and from hand-coded features for animacy and definiteness. The algorithm maps the relevant properties onto "primary" versus "secondary" placement options in the generator. The algorithm is restricted in that it does not take into account linear order determinants related to the sentence's information structure and its discourse context (e.g. contrastiveness). These factors may modulate the above preferences or license "tertiary" linear orders beyond the primary and secondary options considered here.
机译:我们概述了一种算法,该算法能够在德语(一种半自由字序语言)的从句中生成自变量NP的变化而自然的发音序列。为了获得适当水平的输出灵活性,该算法考虑了(1)头部动词的相关词汇属性(不仅包括传递性类型,而且还包括自反性,由NP表示的主题关系等),以及(2)参数的动画性和确定性值及其长度。有关的统计数据是从NEGRA-II树库和从手工编码的特征中提取出来的,以显示动画效果和确定性。该算法将相关属性映射到生成器中的“主要”与“次要”放置选项上。该算法的局限性在于它不考虑与句子的信息结构及其话语语境(例如对比性)有关的线性顺序决定因素。这些因素可能会调节以上的首选项,或许可超出此处考虑的主要和次要选项的“第三”线性订单。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号