【24h】

Table Detection Using Deep Learning

机译:表检测使用深度学习

获取原文

摘要

Table detection is a crucial step in many document analysis applications as tables are used for presenting essential information to the reader in a structured manner. It is a hard problem due to varying layouts and encodings of the tables. Researchers have proposed numerous techniques for table detection based on layout analysis of documents. Most of these techniques fail to generalize because they rely on hand engineered features which are not robust to layout variations. In this paper, we have presented a deep learning based method for table detection. In the proposed method, document images are first pre-processed. These images are then fed to a Region Proposal Network followed by a fully connected neural network for table detection. The proposed method works with high precision on document images with varying layouts that include documents, research papers, and magazines. We have done our evaluations on publicly available UNLV dataset where it beats Tesseract's state of the art table detection system by a significant margin.
机译:表检测是许多文档分析应用中的重要步骤,因为表格用于以结构化方式向读者呈现到读者的基本信息。由于桌子的不同布局和编码,这是一个难题。基于文档的布局分析,研究人员提出了许多表检测技术。这些技术中的大多数都无法泛化,因为它们依靠手动工程特征,这些功能并不稳健地布局变化。在本文中,我们介绍了一种基于深度学习的表检测方法。在所提出的方法中,首先预处理文档图像。然后将这些图像馈送到区域提议网络,然后是一个完全连接的神经网络,用于表检测。该方法在文档图像上具有高精度,具有不同的布局,包括文档,研究论文和杂志。我们已经在公开可用的UNLV数据集中进行了评估,其中它通过显着的边缘击败了艺术表检测系统的TESSERACT状态。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号