【24h】

Permutation Based XML Compression

机译:基于置换的XML压缩

获取原文

摘要

An XML document D often has a regular structure, i.e., it is composed of many similarly named and structured subtrees. Therefore, the entropy of a trees structuredness should be relatively low and thus the trees should be highly compressible by transforming them to an intermediate form. In general, this idea is used in permutation based XML-conscious compressors. An example of such a compressor is called XSAQCT, where the compressible form is called an annotated tree. While XSAQCT proved to be useful for various applications, it was never shown that it is a lossless compressor. This paper provides the formal background for the definition of an annotated tree, and a formal proof that the compression is lossless. It also shows properties of annotated trees that are useful for various applications, and discusses a measure of compressibility using this approach, followed by the experimental results showing compressibility of annotated trees.
机译:XML文档D通常具有规则的结构,即,它由许多相似命名和结构化的子树组成。因此,树木的结构性的熵应该相对较低,因此,通过将树木转化为中间形式,树木应该具有较高的可压缩性。通常,此想法用于基于置换的XML感知压缩器。这种压缩器的一个示例称为XSAQCT,其中可压缩形式称为带注释的树。尽管XSAQCT被证明对各种应用有用,但从未有人证明它是无损压缩机。本文为带注释树的定义提供了形式背景,并提供了压缩无损的形式证明。它还显示了可用于各种应用程序的带注释树的属性,并讨论了使用此方法的可压缩性度量,随后是显示带注释树的可压缩性的实验结果。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号