首页> 外文会议>IC-MSQUARE >Deviations in the Zipf and Heaps laws in natural languages
【24h】

Deviations in the Zipf and Heaps laws in natural languages

机译:在自然语言中偏离ZIPF和堆法的偏差

获取原文

摘要

This paper is devoted to verifying of the empirical Zipf and Hips laws in natural languages using Google Books Ngram corpus data. The connection between the Zipf and Heaps law which predicts the power dependence of the vocabulary size on the text size is discussed. In fact, the Heaps exponent in this dependence varies with the increasing of the text corpus. To explain it, the obtained results are compared with the probability model of text generation. Quasi-periodic variations with characteristic time periods of 60-100 years were also found.
机译:本文致力于使用Google Books Ngram Corpus数据验证以天然语言的实证ZIPF和HIPS法律。讨论了预测文本大小对词汇大小的功率依赖性之间的ZIPF和堆法的连接。事实上,这种依赖的堆指数随着文本语料库的增加而变化。为了解释它,将获得的结果与文本生成的概率模型进行比较。还发现了与60-100岁的特征时间段的准周期性变化。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号