【24h】

Visual Detection with Context for Document Layout Analysis

机译:具有上下文的视觉检测,用于文档布局分析

获取原文

摘要

We present 1) a work in progress method to visually segment key regions of scientific articles using an object detection technique augmented with contextual features, and 2) a novel dataset of region-labeled articles. A continuing challenge in scientific literature mining is the difficulty of consistently extracting high-quality text from formatted PDFs. To address this, we adapt the object-detection technique Faster R-CNN for document layout detection, incorporating contextual information that leverages the inherently localized nature of article contents to improve the region detection performance. Due to the limited availability of high-quality region-labels for scientific articles, we also contribute a novel dataset of region annotations, the first version of which covers 9 region classes and 822 article pages. Initial experimental results demonstrate a 23.9% absolute improvement in mean average precision over the baseline model by incorporating contextual features, and a processing speed 14x faster than a text-based technique. Ongoing work on further improvements is also discussed.
机译:我们提出1)一种进行中的方法,使用带有上下文特征的对象检测技术对科学文章的关键区域进行可视化细分,以及2)带有区域标签的文章的新数据集。科学文献挖掘中的一个持续挑战是从格式化的PDF持续提取高质量文本的困难。为了解决这个问题,我们将对象检测技术Faster R-CNN应用于文档布局检测,并结合上下文信息,该上下文信息利用文章内容的固有局部性来改善区域检测性能。由于用于科学文章的高质量区域标签的可用性有限,我们还贡献了一个新颖的区域注释数据集,其第一个版本涵盖9个区域类别和822个文章页面。初步的实验结果表明,通过结合上下文特征,平均平均精度比基线模型提高了23.9%,处理速度比基于文本的技术快14倍。还讨论了正在进行的进一步改进工作。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号