...
首页> 外文期刊>Linguistic Issues in Language Technology >Kolmogorov complexity of morphs and constructions in English
【24h】

Kolmogorov complexity of morphs and constructions in English

机译:英语中Kolmogorov的变形和构造的复杂性

获取原文
           

摘要

This chapter demonstrates how compression algorithms can be used to address morphological and syntactic complexity in detail by analysing the contribution of specific linguistic features to English texts. The point of departure is the ongoing complexity debate and quest for complexity metrics. After decades of adhering to the equal complexity axiom, recent research seeks to define and measure linguistic complexity (Dahl 2004; Kortmann and Szmrecsanyi 2012; Miestamo et al. 2008). Against this backdrop, I present a new flavour of the Juola-style compression technique (Juola 1998), targeted manipulation. Essentially, compression algorithms are used to measure linguistic complexity via the relative informativeness in text samples. Thus, I assess the contribution of morphs such as a?? ing or a?? ed , and functional constructions such as progressive ( be + verb- ing ) or perfect ( have + verb past participle) to the syntactic and morphological complexity in a mixed-genre corpus of Alicea??s Adventures in Wonderland , the Gospel of Mark and newspaper texts. I find that a higher number of marker types leads to higher amounts of morphological complexity in the corpus. Syntactic complexity is reduced because the presence of morphological markers enhances the algorithmic prediction of linguistic patterns. To conclude, I show that information-theoretic methods yield linguistically meaningful results and can be used to measure the complexity of specific linguistic features in naturalistic copora.??.
机译:本章通过分析特定语言功能对英语文本的影响,详细说明如何使用压缩算法来解决形态和句法复杂性。出发点是正在进行的复杂性辩论和对复杂性指标的追求。经过数十年坚持平等复杂性公理,最近的研究试图定义和衡量语言复杂性(Dahl 2004; Kortmann和Szmrecsanyi 2012; Miestamo等人2008)。在此背景下,我介绍了一种有针对性的Juola风格压缩技术(Juola 1998)。本质上,压缩算法用于通过文本样本中的相对信息量来衡量语言复杂性。因此,我评估了诸如a? ing还是a ?? ed和功能构造,例如Alicea的《仙境历险记》,《马可福音》和《福音书》的混合体裁语料库中的句法和形态复杂性,例如渐进式(be +动词)或完美式(具有+动词过去分词)。报纸上的文字。我发现标记物类型越多,语料库中的形态复杂性越高。语法复杂性降低,因为形态标记的存在增强了语言模式的算法预测。总而言之,我证明了信息理论方法产生了有意义的语言结果,并且可以用来衡量自然主义语料库中特定语言特征的复杂性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号