首页> 外文会议>International Conference on Digital Information and Communication Technology and its Applications >Entropy rate of Thai text and testing author authenticity using character combination distribution
【24h】

Entropy rate of Thai text and testing author authenticity using character combination distribution

机译:泰语文本的熵率和使用字符组合分布测试作者真实性

获取原文

摘要

This paper has two main goals. The first goal is to estimate the entropy rate of Thai text which is found to be roughly 2 bits/character. The second goal is to come up with methods for text authentication based on probability distribution and information theoretic quantities. Using proposed methods, we found that digital books composed by the same author give close numerical values, while those from different authors give much higher differences. Among the three techniques under consideration, we found that the entropy-based method provides the best test. Thirty Thai text sources of various styles are tested to increase reliability of the study. Additionally, the comparison of the effectiveness of proposed methods is shown here.
机译:本文有两个主要目标。 第一个目标是估计发现大约2位/字符的泰语文本的熵率。 第二个目标是基于概率分布和信息理论量来提出文本认证的方法。 使用所提出的方法,我们发现由同一作者组成的数字图书提供了近似数值,而来自不同作者的人则提供更高的差异。 在所考虑的三种技术中,我们发现基于熵的方法提供了最佳测试。 测试各种风格的30个泰国文本来源,以提高研究的可靠性。 另外,这里显示了所提出的方法的有效性的比较。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号