【24h】

Multiscale document segmentation

机译:多尺度文件分割

获取原文

摘要

In this paper, we propose a new approach to docuemtn segmentation which explotis both local texture characteristics and image structure to segment scanned documents into regions such as text, background, headings and images. Our method is based on the use of a multiscal Bayesian framework. This framework is chosen because it allows accurate modeling of both the image characteristics and contextual structure of each region. The parameters which describe the characteristics of typical images are extracted from a database of training images which are produced by scanning typical documents nd hand segmenting them into the desired components. This training procedure is based on the expectation maximization (EM) algorithm and results in approximate maximum likelihood (ML) estimates of the model parameters for region textures and contextual structure at various resolutions. Once the training procedure is performed, scanned documents may be segmented using a fine-to-coarse-to-fine procedure that is computationally efficient.
机译:在本文中,我们提出了一种新的DOCUEMTN分段方法,该方法将局部纹理特征和图像结构分析到将扫描文档进行扫描到文本,背景,标题和图像等地区。我们的方法是基于使用多念观贝叶斯框架的使用。选择该框架,因为它允许精确建模对每个区域的图像特征和上下文结构。描述了描述典型图像特征的参数从训练图像的数据库中提取,该训练图像通过将典型的文档ND手段分割成期望的组件来产生。该培训过程基于期​​望最大化(EM)算法,并导致区域纹理和上下文结构的近似最大似然(ML)估计在各种分辨率下。一旦执行训练过程,就可以使用正在计算效率的细小粗细的过程对扫描的文档进行分段。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号