【24h】

Taylor's Law for Human Linguistic Sequences

机译:泰勒人类语言序列定律

获取原文

摘要

Taylor's law describes the fluctuation characteristics underlying a system in which the variance of an event within a time span grows by a power law with respect to the mean. Although Taylor's law has been applied in many natural and social systems, its application for language has been scarce. This article describes a new quantification of Taylor's law in natural language and reports an analysis of over 1100 texts across 14 languages. The Taylor exponents of written natural language texts were found to exhibit almost the same value. The exponent was also compared for other language-related data, such as the child-directed speech, music, and programming language code. The results show how the Taylor exponent serves to quantify the fundamental structural complexity underlying linguistic time series. The article also shows the applicability of these findings in evaluating language models.
机译:泰勒定律描述了系统的波动特征,在该系统中,时间范围内事件的方差相对于均值通过幂定律增长。尽管泰勒定律已在许多自然和社会系统中得到应用,但对语言的应用却很少。本文介绍了自然语言中泰勒定律的新量化方法,并报告了对14种语言中1100多种文本的分析。人们发现自然语言文字的泰勒指数表现出几乎相同的价值。还将该指数与其他与语言相关的数据进行了比较,例如面向儿童的语音,音乐和编程语言代码。结果表明,泰勒指数是如何量化语言时间序列背后的基本结构复杂性的。本文还显示了这些发现在评估语言模型中的适用性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号