首页> 外文会议>International Conference on Control, Automation and Artificial Intelligence >Statistics and Analysis of Mongolian Syllables Based on Network Corpus
【24h】

Statistics and Analysis of Mongolian Syllables Based on Network Corpus

机译:基于网络语料库的蒙古音节统计与分析

获取原文

摘要

This article achieved the large-scale Mongolian text corpus from CCTV and some other news websites, and conducted statistics and analysis on the Mongolian syllables in this text. From the statistics and analysis, we can see that the possibility of the co-occurrence of the different Mongolian syllable by the n-gram model. At the same time, these data also show that the main reasons leading to the misspelling of Mongolian include the following aspects: one is the monosyllabic error, the second is the misuse of the space, the third is the improper use of the control character, and the fourth is the polyphonic word of the same shape.
机译:本文通过CCTV和其他一些新闻网站实现了大规模的蒙古文本语料库,并对本文的蒙古音节进行了统计和分析。从统计和分析中,我们可以看到,N-Gram模型的不同蒙古音节的共同发生的可能性。同时,这些数据还表明,导致蒙古拼写的主要原因包括以下几个方面:一个是单音节误差,第二个是误用的空间,第三个是控制角色的使用不当使用,第四是相同形状的复音词。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号