【24h】

PREDICTION BY COMPRESSION

机译:压缩预测

获取原文

摘要

It is well known that text compression can be achieved by predicting the next symbol in the stream of text data based on the history seen up to the current symbol. The better the prediction the more skewed the conditional probability distribution of the next symbol and the shorter the codeword that needs to be assigned to represent this next symbol. What about the opposite direction ? suppose we have a black box that can compress text stream. Can it be used to predict the next symbol in the stream ? We introduce a novel criterion based on the length of the compressed data and use it to predict the next symbol. We examine empirically the prediction error rate and its dependency on some compression parameters.
机译:众所周知,可以通过基于由当前符号看到的历史预测文本数据流中的下一个符号来实现文本压缩。预测越好,下一个符号的条件概率分布越越好,需要分配的较短码字以表示下一个符号。对相反方向呢?假设我们有一个可以压缩文本流的黑匣子。它可以用来预测流中的下一个符号吗?我们基于压缩数据的长度引入新的标准,并使用它来预测下一个符号。我们对一些压缩参数的预测误差率及其依赖性来检查预测误差率。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号