首页> 外文会议>International conference on image analysis and processing >A Saliency-Based Convolutional Neural Network for Table and Chart Detection in Digitized Documents
【24h】

A Saliency-Based Convolutional Neural Network for Table and Chart Detection in Digitized Documents

机译:基于显着性的卷积神经网络,用于数字文档中的表格和图表检测

获取原文

摘要

Within the realm of information extraction from documents, detection of tables and charts is particularly needed as they contain a visual summary of the most valuable information contained in a document. For a complete automation of the visual information extraction process from tables and charts, it is necessary to develop techniques that localize them and identify precisely their boundaries. In this paper we aim at solving the table/chart detection task through an approach that combines deep convolutional neural networks, graphical models and saliency concepts. In particular, we propose a saliency-based fully-convolutional neural network performing multi-scale reasoning on visual cues followed by a fully-connected conditional random field (CRF) for localizing tables and charts in digital/digitized documents. Performance analysis, carried out on an extended version of the ICDAR 2013 (with annotated charts as well as tables) dataset, shows that our approach yields promising results, outperforming existing models.
机译:在从文档中提取信息的领域中,特别需要对表格和图表进行检测,因为它们包含文档中包含的最有价值信息的可视摘要。为了使表格和图表的视觉信息提取过程完全自动化,有必要开发将它们定位并精确标识其边界的技术。在本文中,我们旨在通过一种结合深度卷积神经网络,图形模型和显着性概念的方法来解决表格/图表检测任务。特别是,我们提出了一个基于显着性的全卷积神经网络,对视觉线索执行多尺度推理,然后是一个完全连接的条件随机字段(CRF),用于在数字/数字化文档中定位表格和图表。对ICDAR 2013的扩展版本(带有带注释的图表和表格)数据集进行的性能分析表明,我们的方法产生了令人鼓舞的结果,优于现有模型。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号