首页> 外文会议>IAPR International Conference on Document Analysis and Recognition >DeepDeSRT: Deep Learning for Detection and Structure Recognition of Tables in Document Images
【24h】

DeepDeSRT: Deep Learning for Detection and Structure Recognition of Tables in Document Images

机译:Deepdesrt:深入学习文献图像中表格的检测和结构识别

获取原文

摘要

This paper presents a novel end-to-end system for table understanding in document images called DeepDeSRT. In particular, the contribution of DeepDeSRT is two-fold. First, it presents a deep learning-based solution for table detection in document images. Secondly, it proposes a novel deep learning-based approach for table structure recognition, i.e. identifying rows, columns, and cell positions in the detected tables. In contrast to existing rule-based methods, which rely on heuristics or additional PDF metadata (like, for example, print instructions, character bounding boxes, or line segments), the presented system is data-driven and does not need any heuristics or metadata to detect as well as to recognize tabular structures in document images. Furthermore, in contrast to most existing table detection and structure recognition methods, which are applicable only to PDFs, DeepDeSRT processes document images, which makes it equally suitable for born-digital PDFs (as they can automatically be converted into images) as well as even harder problems, e.g. scanned documents. To gauge the performance of DeepDeSRT, the system is evaluated on the publicly available ICDAR 2013 table competition dataset containing 67 documents with 238 pages overall. Evaluation results reveal that DeepDeSRT outperforms state-of-the-art methods for table detection and structure recognition and achieves F1-measures of 96.77% and 91.44% for table detection and structure recognition, respectively. Additionally, DeepDeSRT is evaluated on a closed dataset from a real use case of a major European aviation company comprising documents which are highly unlike those in ICDAR 2013. Tested on a randomly selected sample from this dataset, DeepDeSRT achieves high detection accuracy for tables which demonstrates the sound generalization capabilities of our system.
机译:本文介绍了一种新的端到端系统,用于叫做DeepDesrt的文档图像中的表格理解。特别是,Deepdesrt的贡献是两倍。首先,它提出了一种基于深入的学习的解决方案,用于文档图像中的表检测。其次,提出了一种基于深度学习的表结构识别方法,即识别检测到的表中的行,列和小区位置。与依赖于启发式的基于规则的方法或额外的PDF元数据(例如,打印指令,字符边界框或行段),所呈现的系统是数据驱动的,不需要任何启发式或元数据要检测以及识别文档图像中的表格结构。此外,与大多数现有的表检测和结构识别方法相比,这仅适用于PDFS,DeepDESRT处理文档图像,这使得它同样适用于出生的数字PDF(因为它们可以自动转换为图像)以及甚至更难的问题,例如扫描文件。为了衡量DeepDesrt的性能,系统在公开可用的ICDAR 2013表竞赛数据集上进行了评估,其中包含67个文件,总体上有238页。评价结果表明,Deepdesrt优于表检测和结构识别的最先进方法,并分别实现了表检测和结构识别的96.77 %和91.44 %的F1测量。此外,DeepDeSRT对从一个主要的欧洲航空公司,包括其为高度不同于那些在2013年ICDAR测试来自该数据集随机选择的示例文档的实际使用情况下的密闭数据集进行评估,DeepDeSRT实现为表检测精度高这表明我们系统的声音泛化能力。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号