首页> 外文会议>International Conference on Frontiers in Handwriting Recognition >dhSegment: A Generic Deep-Learning Approach for Document Segmentation
【24h】

dhSegment: A Generic Deep-Learning Approach for Document Segmentation

机译:dhSegment:用于文档分割的通用深度学习方法

获取原文

摘要

In recent years there have been multiple successful attempts tackling document processing problems separately by designing task specific hand-tuned strategies. We argue that the diversity of historical document processing tasks prohibits to solve them one at a time and shows a need for designing generic approaches in order to handle the variability of historical series. In this paper, we address multiple tasks simultaneously such as page extraction, baseline extraction, layout analysis or multiple typologies of illustrations and photograph extraction. We propose an open-source implementation of a CNN-based pixel-wise predictor coupled with task dependent post-processing blocks. We show that a single CNN-architecture can be used across tasks with competitive results. Moreover most of the task-specific post-precessing steps can be decomposed in a small number of simple and standard reusable operations, adding to the flexibility of our approach.
机译:近年来,已经进行了多次成功的尝试,这些任务通过设计特定于任务的手动调整策略来分别解决文档处理问题。我们认为,历史文档处理任务的多样性禁止一次解决一个问题,并且表明需要设计通用方法来处理历史序列的可变性。在本文中,我们同时处理多个任务,例如页面提取,基线提取,布局分析或插图和照片提取的多种类型。我们提出了一个基于CNN的像素级预测器的开源实现,并结合了与任务相关的后处理模块。我们表明,单个CNN架构可用于具有竞争性结果的任务。而且,大多数特定于任务的后处理步骤可以分解为少量的简单且标准的可重用操作,从而增加了我们方法的灵活性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号