Visual Detection with Context for Document Layout Analysis

机译：具有上下文的视觉检测，用于文档布局分析

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

We present 1) a work in progress method to visually segment key regions of scientific articles using an object detection technique augmented with contextual features, and 2) a novel dataset of region-labeled articles. A continuing challenge in scientific literature mining is the difficulty of consistently extracting high-quality text from formatted PDFs. To address this, we adapt the object-detection technique Faster R-CNN for document layout detection, incorporating contextual information that leverages the inherently localized nature of article contents to improve the region detection performance. Due to the limited availability of high-quality region-labels for scientific articles, we also contribute a novel dataset of region annotations, the first version of which covers 9 region classes and 822 article pages. Initial experimental results demonstrate a 23.9% absolute improvement in mean average precision over the baseline model by incorporating contextual features, and a processing speed 14x faster than a text-based technique. Ongoing work on further improvements is also discussed.

机译：我们提出1）一种进行中的方法，使用带有上下文特征的对象检测技术对科学文章的关键区域进行可视化细分，以及2）带有区域标签的文章的新数据集。科学文献挖掘中的一个持续挑战是从格式化的PDF持续提取高质量文本的困难。为了解决这个问题，我们将对象检测技术Faster R-CNN应用于文档布局检测，并结合上下文信息，该上下文信息利用文章内容的固有局部性来改善区域检测性能。由于用于科学文章的高质量区域标签的可用性有限，我们还贡献了一个新颖的区域注释数据集，其第一个版本涵盖9个区域类别和822个文章页面。初步的实验结果表明，通过结合上下文特征，平均平均精度比基线模型提高了23.9％，处理速度比基于文本的技术快14倍。还讨论了正在进行的进一步改进工作。

著录项

来源
《International joint conference on natural language processing;Conference on empirical methods in natural language processing》|2019年|3462-3468|共7页
会议地点
作者
Carlos X. Soto; Shinjae Yoo;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. Visual Similarity Based Document Layout Analysis [J] . Di Wen, Xiao-Qing Ding Journal of Computer Science & Technology . 2006,第3期

机译：基于视觉相似度的文档布局分析
2. Visual Similarity Based Document Layout Analysis [J] . Di Wen, Xiao-Qing Ding 计算机科学技术学报（英文版） . 2006,第003期

机译：基于视觉相似度的文档布局分析
3. Text/Image Region Separation for Document Layout Detection of Old Document Images Using Non-linear Diffusion and Level Set [J] . S. Sachin Kumar, Parvathy Rajendran, P. Prabaharan, Procedia Computer Science . 2016,第1期

机译：文本/图像区域分离，用于使用非线性扩散和水平集的旧文档图像的文档布局检测
4. Visual Detection with Context for Document Layout Analysis [C] . Carlos X. Soto, Shinjae Yoo International joint conference on natural language processing . 2019

机译：具有文档布局分析的上下文的视觉检测
5. Network Visualization Literacy: Task, Context, and Layout [D] . Zoss, Angela Marie. 2018

机译：网络可视化素养：任务，上下文和布局
6. Clinical Documents: Attribute-Values Entity Representation Context Page Layout And Communication [O] . Christian Lovis, Alexander Lamb, Robert Baud, 2003

机译：临床文档：属性-值实体表示上下文页面布局和交流
7. Visual Detection with Context for Document Layout Analysis [O] . Carlos Soto, Shinjae Yoo 2019

机译：具有文档布局分析的上下文的视觉检测

Visual Detection with Context for Document Layout Analysis

摘要

著录项

相似文献

相关主题

期刊订阅