...
首页> 外文期刊>Artificial Intelligence Review: An International Science and Engineering Journal >A review on document image analysis techniques directly in the compressed domain
【24h】

A review on document image analysis techniques directly in the compressed domain

机译:直接在压缩域中的文档图像分析技术综述

获取原文
获取原文并翻译 | 示例
   

获取外文期刊封面封底 >>

       

摘要

The rapid growth of digital libraries, e-governance, and internet based applications has caused an exponential escalation in the volume of Big-data' particularly due to texts, images, audios and videos that are being both archived and transmitted on a daily basis. In order to make their storage and transfer efficient, different data compression techniques are used in the literature. The ultimate motive behind data compression is to transform a big size data into small size data, which eventually implies less space while archiving, and less time in transferring. However, in order to operate/analyze compressed data, it is usually necessary to decompress it, so as to bring back the data to its original form, which unfortunately warrants an additional computing cost. In this backdrop, if operating upon the compressed data itself can be made possible without going through the stage of decompression, then the advantage that could be accomplished due to compression would escalate. Further due to compression, from the data structure and storage perspectives, the original visibility structure of the data also being lost, it turns into a potential challenge to trace the original information in the compressed representation. This challenge is the motivation behind exploring the idea of direct processing on the compressed data itself in the literature. The proposed survey paper specifically focuses on compressed document images and brings out two original contributions. The first contribution is that it presents a critical study on different image analysis and image compression techniques, and highlights the motivational reasons for pursuing document image analysis in the compressed domain. The second contribution is that it summarizes the different compressed domain techniques in the literature so far based on the type of compression and operations performed by them. Overall, the paper aims to provide a perspective for pursuing further research in the area of document image analysis and pattern recognition directly based on the compressed data.
机译:数字图书馆,电子治理和基于互联网的应用程序的快速增长导致了大数据量的指数升级,特别是由于每天存档和传输的文本,图像,录音和视频是由于文本,图像,音频和视频。为了使其存储和转移有效,文献中使用不同的数据压缩技术。数据压缩背后的最终动机是将大尺寸数据转换为小尺寸数据,最终暗示存档时的空间较少,以及较少的传输时间。然而,为了操作/分析压缩数据,通常需要解压缩它,以便将数据带回其原始形式,这不幸的是需要额外的计算成本。在该背景中,如果在压缩数据本身上操作,则可以在不经过减压阶段的情况下进行,然后可以由于压缩而可以实现的优点将升级。进一步由于压缩,从数据结构和存储的角度来看,数据的原始可见性结构也丢失,它变成了追踪压缩表示中的原始信息的潜在挑战。这一挑战是探索文献中压缩数据本身的直接处理思想的动机。拟议的调查纸专注于压缩的文件图像并带来两个原始贡献。第一贡献是它提出了对不同图像分析和图像压缩技术的关键研究,并突出了在压缩域中追求文档图像分析的动机原因。第二贡献是,到目前为止,它基于它们执行的压缩类型和操作的类型来总结了文献中的不同压缩域技术。总体而言,本文旨在提供基于压缩数据直接追求文档图像分析和模式识别领域进一步研究的视角。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号