The results of quantitative analysis of word distribution in two fables in Ukrainian by Ivan Franko: "Fox Mykyta" and "Abu-Kasym's Slippers" are reported. Our study consists of two parts: the analysis of frequency-rank distributions and the application of complex networks theory. The analysis of frequency-rank distributions shows that the text sizes are sufficient to observe statistical properties. The power-law character of these distributions (Zipf's law) holds in the region of rank variable $r=20 div 3000$ with an exponent $lphasimeq 1$. This substantiates the choice of the above texts to analyse typical properties of the language complex network on their basis. Besides, an applicability of the Simon model to describe non-asymptotic properties of word distributions is evaluated.
展开▼
机译:据报道,伊万·弗兰科(Ivan Franko)在乌克兰的两个寓言中对单词分布进行定量分析的结果是:“福克斯·迈克塔”和“阿布·卡西姆的拖鞋”。我们的研究包括两个部分:频率秩分布的分析和复杂网络理论的应用。频率等级分布的分析表明,文本大小足以观察统计属性。这些分布的幂律特征(齐普夫定律)在等级变量$ r = 20 div 3000 $的区域中具有指数$ alpha simeq 1 $。这证实了以上文本的选择,以便根据它们来分析语言复杂网络的典型属性。此外,评估了Simon模型用于描述词分布的非渐近性质的适用性。
展开▼