【24h】

Taylor's Law for Human Linguistic Sequences

机译:泰勒对人类语言序列的法律

获取原文

摘要

Taylor's law describes the fluctuation characteristics underlying a system in which the variance of an event within a time span grows by a power law with respect to the mean. Although Taylor's law has been applied in many natural and social systems, its application for language has been scarce. This article describes a new quantification of Taylor's law in natural language and reports an analysis of over 1100 texts across 14 languages. The Taylor exponents of written natural language texts were found to exhibit almost the same value. The exponent was also compared for other language-related data, such as the child-directed speech, music, and programming language code. The results show how the Taylor exponent serves to quantify the fundamental structural complexity underlying linguistic time series. The article also shows the applicability of these findings in evaluating language models.
机译:泰勒的法律描述了一个系统的波动特征,其中一个系统在时间范围内的事件的变化由电力法相对于平均值而增长。尽管泰勒的法律已应用于许多自然和社会系统,但其对语言的应用已经稀缺。本文介绍了在自然语言中的新量化,并报告了14种语言的1100多个文本的分析。发现书面自然语言文本的泰勒指数表现出几乎相同的价值。还与其他语言相关数据进行比较,例如儿童定向的语音,音乐和编程语言代码。结果表明泰勒指数如何用于量化语言时间序列的基本结构复杂性。本文还显示了这些发现在评估语言模型中的适用性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号