Automatic Static/Variable Content Separation in Administrative Document Images

机译：行政文档图像中的自动静态/变量内容分离

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

In this paper we present an automatic method for separating static and variable content from administrative document images. An alignment approach is able to unsupervisedly build probabilistic templates from a set of examples of the same document kind. Such templates define which is the likelihood of every pixel of being either static or variable content. In the extraction step, the same alignment technique is used to match an incoming image with the template and to locate the positions where variable fields appear. We validate our approach on the public NIST Structured Tax Forms Dataset.

机译：在本文中，我们提出了一种自动方法，用于将静态和可变内容从管理文档图像分离。对准方法能够从同一文档类型的一组示例中毫无化地构建概率模板。这样的模板定义了哪个是静态或可变内容的每个像素的可能性。在提取步骤中，相同的对准技术用于将传入图像与模板匹配，并定位出现可变字段的位置。我们在公共NIST结构纳税表格数据集上验证了我们的方法。

著录项

来源
《IAPR International Conference on Document Analysis and Recognition》|2017年|732p|共6页
会议地点
作者
David Aldavert; Mar?al Rusi?ol; Ricardo Toledo;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TP391.41-53;
关键词
Computational modeling; Probabilistic logic; Transforms; Task analysis; Information retrieval; Estimation; Heuristic algorithms;

机译：计算建模;概率逻辑;变换;任务分析;信息检索;估计;启发式算法;

相似文献

外文文献
中文文献
专利

1. Automatic Web Image Categorization by Image Content:A case study with Web Document Images [J] . Dr. Murugappan. S, Abirami S, Mizpha Poorana Selvi S International Journal on Computer Science and Engineering . 2010,第3期

机译：基于图像内容的Web图像自动分类：以Web文档图像为例
2. Separation of mixed Document Images in Farsi Scanned Documents Using Blind Source Separation [J] . Farbod Razzazi, Hossein Ghanbarloo, Shahpur Alirezaei International Journal of Image Processing . 2010,第4期

机译：使用盲源分离分离波斯扫描文档中的混合文档图像
3. Patent Issued for Image Forming System Capable of Causing Document Box Information of the Printer Driver to Automatically Adjust to a Change in the Document Box Information Adjust to a Change in the Document Box Information That Is Stored in an Image [J] . Journal of Engineering . 2013,第13期

机译：授予图像形成系统的专利，该图像形成系统能够使打印机驱动程序的文件夹信息自动适应于文件夹信息的变化，适应于存储在图像中的文件夹信息的变化
4. Automatic Static/Variable Content Separation in Administrative Document Images [C] . David Aldavert, Marçal Rusiñol, Ricardo Toledo IAPR International Conference on Document Analysis and Recognition . 2017

机译：管理文档图像中的自动静态/可变内容分离
5. Implicit models for automatic pose estimation in static images [D] . Holt, B. D. 2015

机译：静态图像中自动姿态估计的隐含模型
6. Ancient administrative handwritten documents: X-ray analysis and imaging [O] . F. Albertin, A. Astolfo, M. Stampanoni, -1

机译：古代行政手写文件：X射线分析和成像
7. Hierarchical content classification and script determination for automatic document image processing [O] . Chi Z, Wang Q, Siu WC 2003

机译：用于自动文档图像处理的分层内容分类和脚本确定

Automatic Static/Variable Content Separation in Administrative Document Images

摘要

著录项

相似文献

相关主题

期刊订阅