DeepDeSRT: Deep Learning for Detection and Structure Recognition of Tables in Document Images

机译：Deepdesrt：深入学习文献图像中表格的检测和结构识别

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

This paper presents a novel end-to-end system for table understanding in document images called DeepDeSRT. In particular, the contribution of DeepDeSRT is two-fold. First, it presents a deep learning-based solution for table detection in document images. Secondly, it proposes a novel deep learning-based approach for table structure recognition, i.e. identifying rows, columns, and cell positions in the detected tables. In contrast to existing rule-based methods, which rely on heuristics or additional PDF metadata (like, for example, print instructions, character bounding boxes, or line segments), the presented system is data-driven and does not need any heuristics or metadata to detect as well as to recognize tabular structures in document images. Furthermore, in contrast to most existing table detection and structure recognition methods, which are applicable only to PDFs, DeepDeSRT processes document images, which makes it equally suitable for born-digital PDFs (as they can automatically be converted into images) as well as even harder problems, e.g. scanned documents. To gauge the performance of DeepDeSRT, the system is evaluated on the publicly available ICDAR 2013 table competition dataset containing 67 documents with 238 pages overall. Evaluation results reveal that DeepDeSRT outperforms state-of-the-art methods for table detection and structure recognition and achieves F1-measures of 96.77% and 91.44% for table detection and structure recognition, respectively. Additionally, DeepDeSRT is evaluated on a closed dataset from a real use case of a major European aviation company comprising documents which are highly unlike those in ICDAR 2013. Tested on a randomly selected sample from this dataset, DeepDeSRT achieves high detection accuracy for tables which demonstrates the sound generalization capabilities of our system.

机译：本文介绍了一种新的端到端系统，用于叫做DeepDesrt的文档图像中的表格理解。特别是，Deepdesrt的贡献是两倍。首先，它提出了一种基于深入的学习的解决方案，用于文档图像中的表检测。其次，提出了一种基于深度学习的表结构识别方法，即识别检测到的表中的行，列和小区位置。与依赖于启发式的基于规则的方法或额外的PDF元数据（例如，打印指令，字符边界框或行段），所呈现的系统是数据驱动的，不需要任何启发式或元数据要检测以及识别文档图像中的表格结构。此外，与大多数现有的表检测和结构识别方法相比，这仅适用于PDFS，DeepDESRT处理文档图像，这使得它同样适用于出生的数字PDF（因为它们可以自动转换为图像）以及甚至更难的问题，例如扫描文件。为了衡量DeepDesrt的性能，系统在公开可用的ICDAR 2013表竞赛数据集上进行了评估，其中包含67个文件，总体上有238页。评价结果表明，Deepdesrt优于表检测和结构识别的最先进方法，并分别实现了表检测和结构识别的96.77 ％和91.44 ％的F1测量。此外，DeepDeSRT对从一个主要的欧洲航空公司，包括其为高度不同于那些在2013年ICDAR测试来自该数据集随机选择的示例文档的实际使用情况下的密闭数据集进行评估，DeepDeSRT实现为表检测精度高这表明我们系统的声音泛化能力。

著录项

来源
《IAPR International Conference on Document Analysis and Recognition》|2017年|732p|共6页
会议地点
作者
Sebastian Schreiber; Stefan Agne; Ivo Wolf; Andreas Dengel; Sheraz Ahmed;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TP391.41-53;
关键词
Portable document format; Image recognition; Machine learning; Training; Metadata; Task analysis; Feature extraction;

机译：便携式文件格式;图像识别;机器学习;培训;元数据;任务分析;特征提取;

相似文献

外文文献
中文文献
专利

1. Modeling train timetables as images: A cost-sensitive deep learning framework for delay propagation pattern recognition [J] . Huang Ping, Li Zhongcan, Wen Chao, Expert systems with applications . 2021,第Sepa期

机译：建模火车时间表作为图像：延迟传播模式识别的成本敏感的深度学习框架
2. Deep learning for automated forgery detection in hyperspectral document images [J] . Khan Muhammad Jaleed, Yousaf Adeel, Abbas Asad, Journal of electronic imaging . 2018,第PTa2期

机译：深度学习用于高光谱文档图像中的自动伪造检测
3. Text Detection and Recognition for Images of Medical Laboratory Reports With a Deep Learning Approach [J] . Xue Wenyuan, Li Qingyong, Xue Qiyuan Quality Control, Transactions . 2020,第期

机译：深度学习方法医学实验室报告图像图像的文本检测与识别
4. DeepDeSRT: Deep Learning for Detection and Structure Recognition of Tables in Document Images [C] . Sebastian Schreiber, Stefan Agne, Ivo Wolf, IAPR International Conference on Document Analysis and Recognition . 2017

机译：DeepDeSRT：深度学习，用于文档图像中表格的检测和结构识别
5. A Face Detection and Recognition System for Color Images Using Neural Networks with Boosting and Deep Learning [D] . Hajiarbabi, Mohammadreza. 2017

机译：基于神经网络的Boosting和深度学习彩色图像人脸检测与识别系统
6. COVID-19 detection using deep learning models to exploit Social Mimic Optimization and structured chest X-ray images using fuzzy color and stacking approaches [O] . Mesut Toğaçar, Burhan Ergen, Zafer Cömert -1

机译：使用深度学习模型进行COVID-19检测以利用模糊的颜色和堆叠方法利用社交模仿优化和结构化的胸部X射线图像
7. celldeath: A tool for detection of cell death in transmitted light microscopy images by deep learning-based visual recognition [O] . Alejandro Damián La Greca, Nelba Pérez, Sheila Castañeda, 2021

机译：CellDeath：通过基于深度学习的视觉识别检测透射光学显微镜图像中细胞死亡的工具

DeepDeSRT: Deep Learning for Detection and Structure Recognition of Tables in Document Images

摘要

著录项

相似文献

相关主题

期刊订阅