【24h】

On the origin of long-range correlations in texts

机译:关于文本中远程关联的起源

获取原文
获取原文并翻译 | 示例
       

摘要

The complexity of human interactions with social and natural phenomena is mirrored in the way we describe our experiences through natural language. In order to retain and convey such a high dimensional information, the statistical properties of our linguistic output has to be highly correlated in time. An example are the robust observations, still largely not understood, of correlations on arbitrary long scales in literary texts. In this paper we explain how long-range correlations flow from highly structured linguistic levels down to the building blocks of a text (words, letters, etc.). By combining calculations and data analysis we show that correlations take form of a bursty sequence of events once we approach the semantically relevant topics of the text. The mechanisms we identify are fairly general and can be equally applied to other hierarchical settings.
机译:我们通过自然语言描述我们的经历的方式反映了人类与社会和自然现象互动的复杂性。为了保留和传达如此高维度的信息,我们的语言输出的统计属性必须在时间上高度相关。一个很好的例子是文学文本中任意长尺度上的相关性的鲁棒性观察,但在很大程度上仍未被理解。在本文中,我们将解释长期的关联如何从高度结构化的语言层次一直流到文本的构建基块(单词,字母等)。通过结合计算和数据分析,我们表明,一旦我们接近文本的语义相关主题,相关性就会以一系列突发事件的形式出现。我们确定的机制相当笼统,可以同等地应用于其他层次设置。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号