【24h】

Interpreting Data from Scanned Tables

机译:解释扫描表中的数据

获取原文

摘要

Densely-packed but structured scientific data are typically presented in the form of tables, which often appear in raster image form. To interpret data from scanned tables, understanding their hierarchical structure is vital. To further address the vast variability of table representations, we propose a fully automatic methodology that uses a bottom-up reasoning that is independent on the existence of representation features, such as lines. We evaluate our approach on the ICDAR 2013 dataset and demonstrate its effectiveness on detecting tables cells and their content and classifying header and data cells. For detecting the cell hierarchy, we demonstrate results on synthetic data due to lack of ground truth.
机译:密集包装但结构化的科学数据通常以表格的形式呈现,这些表格通常以光栅图像的形式出现。要解释来自扫描表的数据,了解它们的层次结构至关重要。为了进一步解决表表示形式的巨大差异,我们提出了一种全自动方法,该方法使用自下而上的推理,该推理与表示特征(例如线)的存在无关。我们在ICDAR 2013数据集上评估了我们的方法,并展示了其在检测表单元及其内容以及对标题和数据单元进行分类方面的有效性。为了检测细胞层次,由于缺乏地面真实性,我们在合成数据上显示了结果。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号