Content-lossless document image compression based on structural analysis and pattern matching

Yang YB.; Yu DG.; Yan H.

首页> 外文期刊>Pattern Recognition: The Journal of the Pattern Recognition Society >Content-lossless document image compression based on structural analysis and pattern matching

【24h】

Content-lossless document image compression based on structural analysis and pattern matching

机译：基于结构分析和模式匹配的无损文档图像压缩

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

This payer presents a highly efficient content-lossless document image compression scheme. The method consists of three stages. Firstly, the image is analysed and segmented into symbols and position parameters by analysing the relation of the foreground to background and their connectivity. Secondly, the initial representative symbol set from symbols in the image is extracted and matched by direction-based bit-map analysis and matching, and the final representative and synthetic pattern set with less-repeated symbol is formed from the previous symbol set by multi-stage structure clustering and representative pattern deriving and synthesis. This final component set is reorganized into a compact library image. Finally, high ratio compression is achieved by coding relative positions of symbols, parameters of representative patterns and the library image using the adaptive arithmetic coder with different orders and the Q-Coder, respectively. Our scheme achieves much better compression and less error-map than most of alternative systems. Its lossiness can be reduced to a quite small level in a well-defined pattern deriving and synthesis manner compromising compression ratio. Our method can assure content-lossless reconstruction in our symbol-level content-lossless criteria. The method can be easily combined with soft pattern matching to extend to lossless mode. In addition, combining this method with the JBIG1 progressive mode with less-redundancy component library can achieve content-lossless progressive transmission capability. Our method can also be used to deal with various symbolic images including nested symbols like Chinese character images by means of symbolic segmentation based on only connection and position-based bit-map reconstruction. (C) 2000 Pattern Recognition Society. Published by Elsevier Science Ltd. All rights reserved. [References: 21]

机译：该付款人提出了一种高效的无内容损失文档图像压缩方案。该方法包括三个阶段。首先，通过分析前景与背景之间的关系及其连通性，对图像进行分析并将其分割为符号和位置参数。其次，通过基于方向的位图分析和匹配，从图像中的符号中提取出初始的代表性符号集并进行匹配，然后，通过将先前的符号集进行多步运算，形成重复次数较少的最终的代表性和合成图案集。阶段结构聚类和代表性模式的推导与综合。最终的组件集被重新组织成一个紧凑的库映像。最后，通过分别使用具有不同阶数的自适应算术编码器和Q-Coder对符号的相对位置，代表图案的参数和库图像进行编码来实现高比率压缩。与大多数替代系统相比，我们的方案可实现更好的压缩和更少的错误映射。它的有损性可以以一种确定的模式导出和合成方式降低到很小的水平，从而损害压缩率。我们的方法可以确保以我们的符号级内容无损标准进行内容无损重构。该方法可以轻松地与软模式匹配结合以扩展到无损模式。此外，将此方法与具有较少冗余组件库的JBIG1渐进模式结合可以实现无内容损失的渐进传输能力。通过仅基于连接和基于位置的位图重构的符号分割，我们的方法还可用于处理包括汉字图像之类的嵌套符号在内的各种符号图像。（C）2000模式识别学会。由Elsevier Science Ltd.出版。保留所有权利。 [参考：21]

著录项

来源
《Pattern Recognition: The Journal of the Pattern Recognition Society》 |2000年第8期|共17页
作者
Yang YB.; Yu DG.; Yan H.;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类自动化技术及设备;
关键词
Document image analysis; Structural clustering; Pattern matching; Image compression;

机译：文档图像分析;结构聚类;模式匹配;图像压缩;

相似文献

外文文献
中文文献
专利

1. Content-lossless document image compression based on structural analysis and pattern matching [J] . Yang YB., Yu DG., Yan H. Pattern Recognition: The Journal of the Pattern Recognition Society . 2000,第8期

机译：基于结构分析和模式匹配的无损文档图像压缩
2. Compression of Chinese Document Images by Complex Shape Matching [J] . Chwan-Yi Shiah, Yun-Sheng Yen The Computer journal . 2013,第11期

机译：基于复杂形状匹配的中文文档图像压缩
3. A collaborative enhancement-compression approach for historical document images based on PDE-analysis [J] . Yagoubi Mohamed Riad, Serir Amina, Beghdadi Azeddine Digital Signal Processing . 2017,第期

机译：基于PDE分析的历史文献图像的协作增强方法
4. Symbolic Document Image Compression Based on Pattern Matching Techniques [C] . Chwan-Yi Shiah, Yun-Sheng Yen International conference on graphic and image processing;ICGIP 2011 . 2012

机译：基于模式匹配技术的符号文档图像压缩
5. Text image compression based on pattern matching. [D] . Ye, Yan. 2002

机译：基于模式匹配的文本图像压缩。
6. Impact of health insurance on healthcare utilisation patterns in Vietnam: a survey-based analysis with propensity score matching method [O] . Nguyen Thi Thu Thuong 2020

机译：健康保险对越南医疗利用模式的影响：基于调查的分析与倾向分数匹配方法
7. Compression of Chinese document images based on morphologic analysis and pattern matching [O] . Chen Shu, Yan Hong, 徐至展 2006

机译：基于形态分析和模式匹配的中文文档图像压缩
8. Information System for co-Registration of CAESAR Sea-Images Based on TwoDimensional Image Matching Techniques (SCORE). Part 3: Design and Detailed Design Document [R] . Rietveld, A. J., Habets, G. M. H. J., Looijen, W. J. 1992

机译：基于二维图像匹配技术（sCORE）的CaEsaR海图共同配准信息系统。第3部分：设计和详细设计文件

Content-lossless document image compression based on structural analysis and pattern matching

摘要

著录项

相似文献

相关主题

期刊订阅