This paper aims to verify the Zipf's law in Korean language.Firstly,the statistical distribution is investi-gated for two linguistic units,words and alphabets,on a massive Korean text corpus.Then the least square method is adopted to simulate the curve of rank-frequency distribution of words in Korean text.Finally,the estimation val-ues of the parameter of Zipf's law is calculated.The experimental results show that the relationship between fre-quency and rank of both linguistic units falls into the Zipf's law in Korean language.%该文目的在于验证齐普夫定律对朝鲜语的适用性.首先统计了朝鲜语大规模语料中的文字及字母两种语言单位的频率分布,然后利用最小二乘法对文字频率分布曲线进行了拟合,最后计算了文字字频齐普夫定律的参数估计值.实验结果表明,朝鲜语的文字和字母的频率与频级关系都近似符合齐普夫定律,验证了齐普夫定律对朝鲜语的适用性,这对朝鲜语的信息处理与研究具有重要的现实意义.
展开▼