...
首页> 外文期刊>Physical review, E >Zipf and Heaps laws from dependency structures in component systems
【24h】

Zipf and Heaps laws from dependency structures in component systems

机译:ZIPF和堆法来自组件系统中的依赖结构

获取原文
获取原文并翻译 | 示例
   

获取外文期刊封面封底 >>

       

摘要

Complex natural and technological systems can be considered, on a coarse-grained level, as assemblies of elementary components: for example, genomes as sets of genes or texts as sets of words. On one hand, the joint occurrence of components emerges from architectural and specific constraints in such systems. On the other hand, general regularities may unify different systems, such as the broadly studied Zipf and Heaps laws, respectively concerning the distribution of component frequencies and their number as a function of system size. Dependency structures (i.e., directed networks encoding the dependency relations between the components in a system) were proposed recently as a possible organizing principles underlying some of the regularities observed. However, the consequences of this assumption were explored only in binary component systems, where solely the presence or absence of components is considered, and multiple copies of the same component are not allowed. Here we consider a simple model that generates, from a given ensemble of dependency structures, a statistical ensemble of sets of components, allowing for components to appear with any multiplicity.Our model is a minimal extension that is memoryless and therefore accessible to analytical calculations. A mean-field analytical approach (analogous to the "Zipfian ensemble" in the linguistics literature) captures the relevant laws describing the component statistics as we show by comparison with numerical computations. In particular, we recover a power-law Zipf rank plot, with a set of core components, and a Heaps law displaying three consecutive regimes (linear, sublinear, and saturating) that we characterize quantitatively.
机译:可以考虑复杂的自然和技术系统,在粗粒水平上,作为基本组分的组件:例如,作为基因组或文本组的基因组作为单词。一方面,在这种系统中的建筑和特定约束中出现了组件的联合发生。另一方面,一般规律性可以统一不同的系统,例如广泛研究的ZIPF和堆法,分别是关于元件频率的分布及其作为系统尺寸的函数的数量。最近提出了依赖结构(即,编码系统中的组件之间的依赖关系的定向网络)作为可能的一些规律性地区的组织原则。然而,仅在二元组分系统中探讨了这种假设的后果,其中仅考虑存在组分的存在或不存在,并且不允许使用同一组分的多个副本。在这里,我们考虑一个简单的模型,从依赖结构的给定集合,一个组件集的统计集合,允许组件以任何多重性出现.OUR模型是无记忆的最小扩展,因此可以访问分析计算。一种平均场地分析方法(类似于语言学文献中的“Zipfian集合”)捕获了通过与数值计算的比较来描述组件统计数据的相关法律。特别是,我们恢复了一组核心组件的电力范围Zipf秩图,以及堆法,显示我们定量表征的三个连续制度(线性,乘以和饱和)。

著录项

  • 来源
    《Physical review, E》 |2018年第1期|共9页
  • 作者单位

    Dipartimento di Fisica and INFN Università degli Studi di Torino Via Pietro Giuria 1 10125 Torino Italy;

    Santa Fe Institute 1399 Hyde Park Road Santa Fe New Mexico 87501 USA;

    Sorbonne Universités UPMC Univ Paris 06 UMR 7238 Computational and Quantitative Biology 4 Place Jussieu Paris France;

    Dipartimento di Fisica and INFN Università degli Studi di Torino Via Pietro Giuria 1 10125 Torino Italy;

    Sorbonne Universités UPMC Univ Paris 06 UMR 7238 Computational and Quantitative Biology 4 Place Jussieu Paris France;

    Sorbonne Universités UPMC Univ Paris 06 UMR 7238 Computational and Quantitative Biology 4 Place Jussieu Paris France;

  • 收录信息
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类 统计物理学;等离子体物理学;流体力学;
  • 关键词

    Zipf; Heaps; laws;

    机译:ZIPF;堆;法律;

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号