Text, photo, and line extraction in scanned documents

M. Sezer Erkilinc; Mustafa Jaber; Eli Saber; Peter Bauer; Dejan Depalov

首页> 外文期刊>Journal of electronic imaging >Text, photo, and line extraction in scanned documents

【24h】

Text, photo, and line extraction in scanned documents

机译：扫描文档中的文本，照片和行提取

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

We propose a page layout analysis algorithm to classify a scanned document into different regions such as text, photo, or strong lines. The proposed scheme consists of five modules. The first module performs several image preprocessing techniques such as image scaling, filtering, color space conversion, and gamma correction to enhance the scanned image quality and reduce the computation time in later stages. Text detection is applied in the second module wherein wavelet transform and run-length encoding are employed to generate and validate text regions, respectively. The third module uses a Markov random field based block-wise segmentation that employs a basis vector projection technique with maximum a posteriori probability optimization to detect photo regions. In the fourth module, methods for edge detection, edge linking, line-segment fitting, and Hough transform are utilized to detect strong edges and lines. In the last module, the resultant text, photo, and edge maps are combined to generate a page layout map using K-Means clustering. The proposed algorithm has been tested on several hundred documents that contain simple and complex page layout structures and contents such as articles, magazines, business cards, dictionaries, and newsletters, and compared against state-of-the-art page-segmentation techniques with benchmark performance. The results indicate that our methodology achieves an average of ～89% classification accuracy in text, photo, and background regions.

机译：我们提出一种页面布局分析算法，以将扫描的文档分类为不同的区域，例如文本，照片或粗线。拟议的方案包括五个模块。第一个模块执行多种图像预处理技术，例如图像缩放，滤波，色彩空间转换和伽玛校正，以提高扫描图像的质量并减少以后的计算时间。在第二模块中应用文本检测，其中小波变换和行程编码分别用于生成和验证文本区域。第三模块使用基于马尔可夫随机场的逐块分段，该分段采用具有最大后验概率优化的基本矢量投影技术来检测照片区域。在第四个模块中，利用边缘检测，边缘链接，线段拟合和霍夫变换的方法来检测强边缘和线。在最后一个模块中，使用K-Means聚类将生成的文本，照片和边缘图组合起来以生成页面布局图。该算法已在包含简单和复杂页面布局结构和内容（例如文章，杂志，名片，词典和新闻通讯）的数百个文档上进行了测试，并与具有基准的最新页面细分技术进行了比较性能。结果表明，我们的方法在文本，图片和背景区域中的分类准确率平均达到了约89％。

著录项

来源
《Journal of electronic imaging》 |2012年第3期|033006.1-033006.18|共18页
作者
M. Sezer Erkilinc; Mustafa Jaber; Eli Saber; Peter Bauer; Dejan Depalov;
展开▼
作者单位

University College London Department of Electronic and Electrical Engineering Optical Networks Group Torrington Place London WC1E 7JE, United Kingdom;

IPPLEX Holdings Corporation Santa Monica California 90025;

Rochester Institute of Technology Department of Electrical and Microelectronic Engineering Rochester, New York 14623;

Hewlett-Packard Corporation Imaging Asset Team Boise, Idaho 83714;

Hewlett-Packard Corporation Imaging Asset Team Boise, Idaho 83714;

展开▼
收录信息美国《科学引文索引》(SCI);美国《工程索引》(EI);
原文格式 PDF
正文语种 eng
中图分类
关键词
入库时间 2022-08-18 01:17:45

相似文献

外文文献
中文文献
专利

1. Review of Text Extraction Algorithms for Scene-text and Document Images [J] . Sahare Parul, Dhok Sanjay B. IETE Technical Review . 2017,第2期

机译：场景文本和文档图像的文本提取算法综述
2. Handheld Mobile Device Based Text Region Extraction and Binarization of Image Embedded Text Documents [J] . Ayatullah Faruk Mollah, Subhadip Basu, Mita Nasipuri, Journal of Intelligent Systems . 2013,第1期

机译：基于手持移动设备的文本区域文本提取和图像嵌入文本文档的二值化
3. Deep Text Mining for Automatic Keyphrase Extraction from Text Documents [J] . Muhammad Abulaish, Jahiruddin, Lipika Dey Journal of Intelligent Systems . 2011,第4期

机译：深度文本挖掘，用于从文本文档中自动提取关键词
4. Simultaneous Optimisation of Image Quality Improvement and Text Content Extraction from Scanned Documents [C] . Shashank Mujumdar, Nitin Gupta, Abhinav Jain, International Conference on Document Analysis and Recognition . 2019

机译：同时优化图像质量和从扫描文档中提取文本内容
5. Markov random field model based text segmentation and image post processing of complex scanned documents [D] . Haneda, Eri 2011

机译：基于马尔可夫随机场模型的复杂扫描文档的文本分割和图像后处理
6. A System for Automated Extraction of Metadata from Scanned Documents using Layout Recognition and String Pattern Search Models [O] . Dharitri Misra, Siyuan Chen, George R. Thoma -1

机译：使用布局识别和字符串模式搜索模型从扫描文档中自动提取元数据的系统
7. A goal-oriented verification-based approach for target text line extraction from a document image captured by a pen scanner [O] . Bai Z, Huo Q 2004

机译：一种面向目标的基于验证的方法，用于从笔式扫描仪捕获的文档图像中提取目标文本行
8. Extractions of Garment Manufacturing Data from 3D Whole Body Scans [R] . McLean, M. L. , Newsom, B. 1998

机译：从3D全身扫描中提取服装制造数据

Text, photo, and line extraction in scanned documents

摘要

著录项

相似文献

相关主题

期刊订阅