首页> 外文会议>International Conference on Document Analysis and Recognition >Deep Splitting and Merging for Table Structure Decomposition
【24h】

Deep Splitting and Merging for Table Structure Decomposition

机译:表结构分解的深度拆分和合并

获取原文

摘要

Given the large variety and complexity of tables, table structure extraction is a challenging task in automated document analysis systems. We present a pair of novel deep learning models (Split and Merge models) that given an input image, 1) predicts the basic table grid pattern and 2) predicts which grid elements should be merged to recover cells that span multiple rows or columns. We propose projection pooling as a novel component of the Split model and grid pooling as a novel part of the Merge model. While most Fully Convolutional Networks rely on local evidence, these unique pooling regions allow our models to take advantage of the global table structure. We achieve state-of-the-art performance on the public ICDAR 2013 Table Competition dataset of PDF documents. On a much larger private dataset which we used to train the models, we significantly outperform both a state-ofthe-art deep model and a major commercial software system.
机译:鉴于表的多样性和复杂性,在自动文档分析系统中,表结构的提取是一项艰巨的任务。我们提供了一对新颖的深度学习模型(拆分和合并模型),这些模型给定了输入图像,1)预测了基本的表格网格模式,2)预测了应该合并哪些网格元素以恢复跨越多行或多列的单元格。我们建议将投影池作为Split模型的一个新颖组件,将网格池作为Merge模型的一个新颖部分。尽管大多数完全卷积网络都依赖于本地证据,但这些独特的合并区域使我们的模型能够利用全局表结构。我们在公开的ICDAR 2013表格竞赛数据集PDF文档上实现了最先进的性能。在我们用来训练模型的更大的私有数据集上,我们的性能大大超过了最新的深度模型和主要的商业软件系统。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号