...
首页> 外文期刊>IEEE Transactions on Information Theory >The zero-frequency problem: estimating the probabilities of novel events in adaptive text compression
【24h】

The zero-frequency problem: estimating the probabilities of novel events in adaptive text compression

机译:零频问题:估计自适应文本压缩中新事件的概率

获取原文
获取原文并翻译 | 示例

摘要

Approaches to the zero-frequency problem in adaptive text compression are discussed. This problem relates to the estimation of the likelihood of a novel event occurring. Although several methods have been used, their suitability has been on empirical evaluation rather than a well-founded model. The authors propose the application of a Poisson process model of novelty. Its ability to predict novel tokens is evaluated, and it consistently outperforms existing methods. It is applied to a practical statistical coding scheme, where a slight modification is required to avoid divergence. The result is a well-founded zero-frequency model that explains observed differences in the performance of existing methods, and offers a small improvement in the coding efficiency of text compression over the best method previously known.
机译:讨论了自适应文本压缩中零频问题的解决方法。该问题与新事件发生的可能性的估计有关。尽管已经使用了几种方法,但是它们的适用性取决于经验评估,而不是基于充分依据的模型。作者提出了新颖的泊松过程模型的应用。评估了其预测新令牌的能力,并且其性能始终优于现有方法。它被应用到实际的统计编码方案中,其中需要稍作修改以避免差异。结果是一个有充分根据的零频率模型,该模型可以解释观察到的现有方法的性能差异,并且与以前已知的最佳方法相比,文本压缩的编码效率略有提高。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号