dhSegment: A Generic Deep-Learning Approach for Document Segmentation

机译：dhSegment：用于文档分割的通用深度学习方法

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

In recent years there have been multiple successful attempts tackling document processing problems separately by designing task specific hand-tuned strategies. We argue that the diversity of historical document processing tasks prohibits to solve them one at a time and shows a need for designing generic approaches in order to handle the variability of historical series. In this paper, we address multiple tasks simultaneously such as page extraction, baseline extraction, layout analysis or multiple typologies of illustrations and photograph extraction. We propose an open-source implementation of a CNN-based pixel-wise predictor coupled with task dependent post-processing blocks. We show that a single CNN-architecture can be used across tasks with competitive results. Moreover most of the task-specific post-precessing steps can be decomposed in a small number of simple and standard reusable operations, adding to the flexibility of our approach.

机译：近年来，已经进行了多次成功的尝试，这些任务通过设计特定于任务的手动调整策略来分别解决文档处理问题。我们认为，历史文档处理任务的多样性禁止一次解决一个问题，并且表明需要设计通用方法来处理历史序列的可变性。在本文中，我们同时处理多个任务，例如页面提取，基线提取，布局分析或插图和照片提取的多种类型。我们提出了一个基于CNN的像素级预测器的开源实现，并结合了与任务相关的后处理模块。我们表明，单个CNN架构可用于具有竞争性结果的任务。而且，大多数特定于任务的后处理步骤可以分解为少量的简单且标准的可重用操作，从而增加了我们方法的灵活性。

著录项

来源
《International Conference on Frontiers in Handwriting Recognition》|2018年|7-12|共6页
会议地点
作者
Sofia Ares Oliveira; Benoit Seguin; Frederic Kaplan;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Task analysis; Training; Text analysis; Neural networks; Layout; Standards; Image segmentation;

机译：任务分析;培训;文本分析;神经网络;布局;标准;图像分割;
入库时间 2022-08-26 13:51:13

相似文献

外文文献
中文文献
专利

1. Automatic segmentation of TBM muck images via a deep-learning approach to estimate the size and shape of rock chips [J] . Zhou Xiaoxiong, Gong Qiuming, Liu Yongqiang, Automation in construction . 2021,第Juna期

机译：通过深度学习方法自动分割TBM MOCK图像来估计岩石芯片的尺寸和形状
2. Segmentation-based deep-learning approach for surface-defect detection [J] . Tabernik Domen, Sela Samo, Skvarc Jure, Journal of Intelligent Manufacturing . 2020,第3期

机译：基于分段的表面缺陷检测的深学习方法
3. A Human Behavior-Driven Deep-Learning Approach for Automatic Sigmoid Segmentation [J] . Gonzalez Y., Shen C., Jung H., International Journal of Radiation Oncology, Biology, Physics . 2019,第1Suppla期

机译：一种用于自动乙状结构分割的人类行为驱动的深度学习方法
4. dhSegment: A Generic Deep-Learning Approach for Document Segmentation [C] . Sofia Ares Oliveira, Benoit Seguin, Frederic Kaplan International Conference on Frontiers in Handwriting Recognition . 2018

机译：DHSEMING：文档分割的通用深度学习方法
5. Towards generic region segmentation for image/video analysis: An integrated perceptual grouping approach using Generic-Edge-Token-graph. [D] . Chen, HuiQiong. 2009

机译：走向用于图像/视频分析的通用区域分割：使用Generic-Edge-Token-graph的集成感知分组方法。
6. Dual-energy CT for automatic organs-at-risk segmentation in brain-tumor patients using a multi-atlas and deep-learning approach [O] . Brent van der Heyden, Patrick Wohlfahrt, Daniëlle B. P. Eekers, -1

机译：使用多图谱和深度学习方法的双能CT对脑肿瘤患者的危险器官自动分割
7. A Generic Framework for Information Segmentation in Document Images: A part-based Approach [O] . Ahmed Sheraz 2016

机译：文档图像中信息分割的通用框架：基于部分的方法

dhSegment: A Generic Deep-Learning Approach for Document Segmentation

摘要

著录项

相似文献

相关主题

期刊订阅