首页> 外文会议>IEEE International Symposium on Information Theory >Complexity and Similarity for Sequences using LZ77-based conditional information measure
【24h】

Complexity and Similarity for Sequences using LZ77-based conditional information measure

机译:使用基于LZ77的条件信息量度的序列复杂度和相似度

获取原文

摘要

This work concerns the definition of conditional mutual information in the framework of Algorithmic Information Theory (AIT), which is of use when no probabilistic model of the data is available, or hard to devise. We introduce a practical way to construct a conditional mutual information quantity which respects the chain rule and the data processing inequalityThe proposed implementation, named SALZA, allows to accomplish various information-theoretic tasks on sequences. The algorithmic model of the data used in this work is that of the well-known Lempel-Ziv primitive: we assume new data is to be expressed in terms of references to prior data.SALZA enables a flexible specification of prior data and extracts information quantities based on the significance of the references to these prior data. The tool readily implements the computation of an information measure based on LZ77 and a universal classifier based on the Ziv-Merhav relative coder for the universal clustering of sequences.Illustration of the proposed implementation is provided on clustering and causality inference examples.
机译:这项工作涉及在算法信息论(AIT)框架中对条件互信息的定义,当没有可用的数据概率模型或难以设计时,可以使用该条件。我们介绍了一种构造有条件的互信息量的实用方法,该条件互信息量要考虑链规则和数据处理不平等性。所提出的名为SALZA的实现可以完成序列上的各种信息理论任务。这项工作中使用的数据的算法模型是著名的Lempel-Ziv原语的模型:我们假设要根据对先前数据的引用来表达新数据.SALZA可以灵活地规范先前数据并提取信息量基于对这些现有数据的参考意义。该工具可以轻松实现基于LZ77的信息度量的计算以及基于Ziv-Merhav相对编码器的通用分类器的通用序列聚类。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号