首页> 外文会议>International joint conference on natural language processing >Visual Detection with Context for Document Layout Analysis
【24h】

Visual Detection with Context for Document Layout Analysis

机译:具有文档布局分析的上下文的视觉检测

获取原文

摘要

We present 1) a work in progress method to visually segment key regions of scientific articles using an object detection technique augmented with contextual features, and 2) a novel dataset of region-labeled articles. A continuing challenge in scientific literature mining is the difficulty of consistently extracting high-quality text from formatted PDFs. To address this, we adapt the object-detection technique Faster R-CNN for document layout detection, incorporating contextual information that leverages the inherently localized nature of article contents to improve the region detection performance. Due to the limited availability of high-quality region-labels for scientific articles, we also contribute a novel dataset of region annotations, the first version of which covers 9 region classes and 822 article pages. Initial experimental results demonstrate a 23.9% absolute improvement in mean average precision over the baseline model by incorporating contextual features, and a processing speed 14x faster than a text-based technique. Ongoing work on further improvements is also discussed.
机译:我们展示了1)在通过上下文特征增强的物体检测技术的视觉方法中的进步方法中的工作方法,以及2)区域标记物品的新型数据集。科学文学挖掘的持续挑战是难以一致地从格式化的PDF中提取高质量文本。为了解决此问题,我们适应对象检测技术,用于更快的R-CNN进行文档布局检测,包括利用物品内容的固有局部化性质来改善区域检测性能的上下文信息。由于科学文章的高质量区域标签的可用性有限,我们还贡献了一个新的区域注释数据集,其中第一版本涵盖了9个区域类和822条。初始实验结果通过结合上下文特征,在基线模型中表现出平均平均精度的23.9%绝对改善,以及比基于文本的技术更快的处理速度14X。还讨论了进一步改进的持续工作。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号