...
首页> 外文期刊>Pattern Recognition: The Journal of the Pattern Recognition Society >Content-lossless document image compression based on structural analysis and pattern matching
【24h】

Content-lossless document image compression based on structural analysis and pattern matching

机译:基于结构分析和模式匹配的无损文档图像压缩

获取原文
获取原文并翻译 | 示例
           

摘要

This payer presents a highly efficient content-lossless document image compression scheme. The method consists of three stages. Firstly, the image is analysed and segmented into symbols and position parameters by analysing the relation of the foreground to background and their connectivity. Secondly, the initial representative symbol set from symbols in the image is extracted and matched by direction-based bit-map analysis and matching, and the final representative and synthetic pattern set with less-repeated symbol is formed from the previous symbol set by multi-stage structure clustering and representative pattern deriving and synthesis. This final component set is reorganized into a compact library image. Finally, high ratio compression is achieved by coding relative positions of symbols, parameters of representative patterns and the library image using the adaptive arithmetic coder with different orders and the Q-Coder, respectively. Our scheme achieves much better compression and less error-map than most of alternative systems. Its lossiness can be reduced to a quite small level in a well-defined pattern deriving and synthesis manner compromising compression ratio. Our method can assure content-lossless reconstruction in our symbol-level content-lossless criteria. The method can be easily combined with soft pattern matching to extend to lossless mode. In addition, combining this method with the JBIG1 progressive mode with less-redundancy component library can achieve content-lossless progressive transmission capability. Our method can also be used to deal with various symbolic images including nested symbols like Chinese character images by means of symbolic segmentation based on only connection and position-based bit-map reconstruction. (C) 2000 Pattern Recognition Society. Published by Elsevier Science Ltd. All rights reserved. [References: 21]
机译:该付款人提出了一种高效的无内容损失文档图像压缩方案。该方法包括三个阶段。首先,通过分析前景与背景之间的关系及其连通性,对图像进行分析并将其分割为符号和位置参数。其次,通过基于方向的位图分析和匹配,从图像中的符号中提取出初始的代表性符号集并进行匹配,然后,通过将先前的符号集进行多步运算,形成重复次数较少的最终的代表性和合成图案集。阶段结构聚类和代表性模式的推导与综合。最终的组件集被重新组织成一个紧凑的库映像。最后,通过分别使用具有不同阶数的自适应算术编码器和Q-Coder对符号的相对位置,代表图案的参数和库图像进行编码来实现高比率压缩。与大多数替代系统相比,我们的方案可实现更好的压缩和更少的错误映射。它的有损性可以以一种确定的模式导出和合成方式降低到很小的水平,从而损害压缩率。我们的方法可以确保以我们的符号级内容无损标准进行内容无损重构。该方法可以轻松地与软模式匹配结合以扩展到无损模式。此外,将此方法与具有较少冗余组件库的JBIG1渐进模式结合可以实现无内容损失的渐进传输能力。通过仅基于连接和基于位置的位图重构的符号分割,我们的方法还可用于处理包括汉字图像之类的嵌套符号在内的各种符号图像。 (C)2000模式识别学会。由Elsevier Science Ltd.出版。保留所有权利。 [参考:21]

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号