Visual Detection with Context for Document Layout Analysis

机译：具有文档布局分析的上下文的视觉检测

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

We present 1) a work in progress method to visually segment key regions of scientific articles using an object detection technique augmented with contextual features, and 2) a novel dataset of region-labeled articles. A continuing challenge in scientific literature mining is the difficulty of consistently extracting high-quality text from formatted PDFs. To address this, we adapt the object-detection technique Faster R-CNN for document layout detection, incorporating contextual information that leverages the inherently localized nature of article contents to improve the region detection performance. Due to the limited availability of high-quality region-labels for scientific articles, we also contribute a novel dataset of region annotations, the first version of which covers 9 region classes and 822 article pages. Initial experimental results demonstrate a 23.9% absolute improvement in mean average precision over the baseline model by incorporating contextual features, and a processing speed 14x faster than a text-based technique. Ongoing work on further improvements is also discussed.

机译：我们展示了1）在通过上下文特征增强的物体检测技术的视觉方法中的进步方法中的工作方法，以及2）区域标记物品的新型数据集。科学文学挖掘的持续挑战是难以一致地从格式化的PDF中提取高质量文本。为了解决此问题，我们适应对象检测技术，用于更快的R-CNN进行文档布局检测，包括利用物品内容的固有局部化性质来改善区域检测性能的上下文信息。由于科学文章的高质量区域标签的可用性有限，我们还贡献了一个新的区域注释数据集，其中第一版本涵盖了9个区域类和822条。初始实验结果通过结合上下文特征，在基线模型中表现出平均平均精度的23.9％绝对改善，以及比基于文本的技术更快的处理速度14X。还讨论了进一步改进的持续工作。

著录项

来源
《International joint conference on natural language processing》|2019年|cxxxviii p. 3235-3881|共7页
会议地点
作者
Carlos X. Soto; Shinjae Yoo;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类程序设计、软件工程;
关键词

相似文献

外文文献
中文文献
专利

1. Visual Similarity Based Document Layout Analysis [J] . Di Wen, Xiao-Qing Ding Journal of Computer Science & Technology . 2006,第3期

机译：基于视觉相似度的文档布局分析
2. Visual Similarity Based Document Layout Analysis [J] . Di Wen, Xiao-Qing Ding 计算机科学技术学报（英文版） . 2006,第003期

机译：基于视觉相似度的文档布局分析
3. Text/Image Region Separation for Document Layout Detection of Old Document Images Using Non-linear Diffusion and Level Set [J] . S. Sachin Kumar, Parvathy Rajendran, P. Prabaharan, Procedia Computer Science . 2016,第1期

机译：文本/图像区域分离，用于使用非线性扩散和水平集的旧文档图像的文档布局检测
4. Visual Detection with Context for Document Layout Analysis [C] . Carlos X. Soto, Shinjae Yoo International joint conference on natural language processing;Conference on empirical methods in natural language processing . 2019

机译：具有上下文的视觉检测，用于文档布局分析
5. Network Visualization Literacy: Task, Context, and Layout [D] . Zoss, Angela Marie. 2018

机译：网络可视化素养：任务，上下文和布局
6. Clinical Documents: Attribute-Values Entity Representation Context Page Layout And Communication [O] . Christian Lovis, Alexander Lamb, Robert Baud, 2003

机译：临床文档：属性-值实体表示上下文页面布局和交流
7. Visual Detection with Context for Document Layout Analysis [O] . Carlos Soto, Shinjae Yoo 2019

机译：具有文档布局分析的上下文的视觉检测

Visual Detection with Context for Document Layout Analysis

摘要

著录项

相似文献

相关主题

期刊订阅