Convolutional Neural Networks for Figure Extraction in Historical Technical Documents

机译：历史技术文件中用于图形提取的卷积神经网络

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

We present a method of extracting figures and images from the pages of scanned documents, especially from technical research articles. Our approach is novel in two key ways. First, we treat this as a computer vision problem, and train convolutional neural networks to recognize figures in scanned pages. Second, we generate our training data from 'born-digital' structured documents, allowing us to automatically produce labels for our training set using PDF figure extractors. This avoids the otherwise tedious task of hand-labelling thousands of document pages. Our convolutional neural networks achieve precision and recall of close to 85% in identifying figures from a test set consisting of modern journal papers and conference proceedings, and obtain precision and recall above 80% on an application data set comprised of historical technical documents scanned from the Bell Labs Records. Our results show that models trained on digital documents transfer very well to historical scans. Finally, it is easy to extend our models to identify other document elements such as tables and captions.

机译：我们提出了一种从扫描的文档页面（尤其是技术研究文章）中提取图形和图像的方法。我们的方法在两个关键方面是新颖的。首先，我们将其视为计算机视觉问题，并训练卷积神经网络以识别扫描页面中的图形。其次，我们从“数字化”结构化文档中生成训练数据，从而使我们能够使用PDF图形提取器为训练集自动生成标签。这避免了手动标记数千个文档页面的繁琐工作。我们的卷积神经网络在识别包含现代期刊论文和会议论文集的测试集中的图形时，可以达到近85％的精度和查全率，而在包含扫描过的历史技术文档的应用程序数据集上，可以达到80％以上的精度和查全率从贝尔实验室唱片。我们的结果表明，在数字文档上训练的模型可以很好地转移到历史扫描中。最后，很容易扩展我们的模型以识别其他文档元素，例如表格和标题。

著录项

来源
《IAPR International Conference on Document Analysis and Recognition》|2017年|789-795|共7页
会议地点
作者
Chun-Nam Yu; Caleb Carson Levy; Iraj Saniee;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Portable document format; Training data; Layout; Training; Convolutional neural networks; Data mining;

机译：便携式文档格式;训练数据;布局;训练;卷积神经网络;数据挖掘;

相似文献

外文文献
中文文献
专利

1. Automated Extraction of Human Settlement Patterns From Historical Topographic Map Series Using Weakly Supervised Convolutional Neural Networks [J] . Uhl Johannes H., Leyk Stefan, Chiang Yao-Yi, Quality Control, Transactions . 2020,第期

机译：使用弱监督卷积神经网络自动提取历史地形图系列人力沉降模式
2. Automatic extraction of road intersection points from USGS historical map series using deep convolutional neural networks [J] . Saeedimoghaddam Mahmoud, Stepinski T. F. International Journal of Geographical Information Science . 2020,第5a6期

机译：利用深卷积神经网络自动提取USGS历史地图系列的道路交叉点
3. Convolutional neural networks for relevance feedback in content based image retrieval A Content based image retrieval system that exploits convolutional neural networks both for feature extraction and for relevance feedback [J] . Lorenzo Putzu, Luca Piras, Giorgio Giacinto Multimedia Tools and Applications . 2020,第37a38期

机译：基于内容的图像检索的相关反馈的卷积神经网络基于内容的图像检索系统，用于利用特征提取和相关性反馈的卷积神经网络
4. Convolutional Neural Networks for Figure Extraction in Historical Technical Documents [C] . Chun-Nam Yu, Caleb Levy, Iraj Saniee IAPR International Conference on Document Analysis and Recognition . 2017

机译：历史技术文献中的图形提取卷积神经网络
5. Plant Segmentation by Supervised Machine Learning Methods and Phenotypic Trait Extraction of Soybean Plants Using Deep Convolutional Neural Networks with Transfer Learning [D] . Adams, Jason R. 2020

机译：植物分割通过深度卷积神经网络与转移学习的豆豆植物的植物分割和表型特性
6. Use of the Clock Drawing Test and the Rey–Osterrieth Complex Figure Test-copy with convolutional neural networks to predict cognitive impairment [O] . Young Chul Youn, Jung-Min Pyun, Nayoung Ryu, 2021

机译：使用时钟绘图测试和Rey-Osterrieth复杂的数字测试 - 卷积神经网络预测认知障碍
7. Fully Convolutional Neural Networks for Page Segmentation of Historical Document Images [O] . Wick, Christoph, Puppe, Frank 2017

机译：完全卷积神经网络用于历史数据的页面分割文件图片

Convolutional Neural Networks for Figure Extraction in Historical Technical Documents

摘要

著录项

相似文献

相关主题

期刊订阅