A document image segmentation system using analysis of connected components

机译：使用连接组件分析的文档图像分割系统

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Page segmentation into text and non-text elements is an essential preprocessing step before optical character recognition (OCR) operation. In case of poor segmentation, an OCR classification engine produces garbage characters due to the presence of non-text elements. This paper presents a method to separate the textual and non textual components in document images using a graph-based modeling and structural analysis. This is a fast and efficient method to separate adequately the graphical and the textual parts of a document. We have evaluated our method on two well-known subsets: the UW-III dataset and the ICDAR 2009 page segmentation competition dataset. Comparisons are led with two methods of state-of-the-art; these results showing that our method proved better performances in this task.

机译：页面分段为文本和非文本元素是光学字符识别（OCR）操作之前的基本预处理步骤。在分割不良的情况下，由于存在非文本元素，OCR分类引擎产生垃圾字符。本文介绍了一种使用基于图形的建模和结构分析在文档图像中分离文本和非文本组件的方法。这是一种快速而有效的方法，可以采用适当的图形和文档的文本部分。我们已经在两个众所周知的子集中评估了我们的方法：UW-III数据集和ICDAR 2009页面分段竞争数据集。比较是用两种最先进的方法带来的;这些结果表明，我们的方法在这项任务中证明了更好的表现。

著录项

来源
《International Conference on Document Analysis and Recognition》|2013年||共5页
会议地点
作者
F. Zirari; A. Ennaji; S. Nicolas; D. Mammass;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TP391.41-53;
关键词
Text/non-text separating; Connected components; Graph; Structural analysis; Document image;

机译：文本/非文字分离;连接组件;图形;结构分析;文件图像;

相似文献

外文文献
中文文献
专利

1. Integrated technique of segmentation and classification methods with connected components analysis for road extraction from orthophoto images [J] . Abdollahi Abolfazl, Pradhan Biswajeet Expert systems with applications . 2021,第Auga期

机译：分割和分类方法的综合技术，具有从正式图像的道路提取分析分析
2. Touching text line segmentation combined local baseline and connected component for Uchen Tibetan historical documents [J] . Pengfei Hu, Weilan Wang, Qiaoqiao Li, Information Processing & Management . 2021,第6期

机译：触摸文本线段分割组合uchen藏历史文档的本地基线和连接组件
3. Image-based clustering and connected component labeling for rapid automated left and right ventricular endocardial volume extraction and segmentation in full cardiac cycle multi-frame MRI images of cardiac patients [J] . Goyal Ayush Medical and Biological Engineering and Computing: Journal of the International Federation for Medical and Biological Engineering . 2019,第6期

机译：基于图像的聚类和连接的组件标记，用于快速自动左右心室内心室体积提取和全心脏循环多帧MRI图像中的右心室内膜体积提取和分割
4. A Document Image Segmentation System Using Analysis of Connected Components [C] . Zirari F., Ennaji A., Nicolas S., International Conference on Document Analysis and Recognition . 2013

机译：基于连接组件分析的文档图像分割系统
5. Parameter-dependent connected component of gray images and image understanding, segmentation and stereo correspondence. [D] . Wang, Yang. 1997

机译：灰度图像的参数相关连接部分以及图像理解，分割和立体对应。
6. Multithreaded two-pass connected components labelling and particle analysis in ImageJ [O] . Michael Doube 2021

机译：MultiThread双通过连接的组件标记和粒子分析ImageJ
7. Text Line Segmentation for Unconstrained Handwritten Document Images Using Neighborhood Connected Component Analysis [O] . Abhishek Khandelwal, Pritha Choudhury, Ram Sarkar, 2009

机译：使用邻域连接组件分析的无约束手写文档图像的文本线分割
8. Efficient Segmentation of Geophysical Field Images on Basis of Independent Component Analysis [R] . Mironenko, A. , Akhmetshin, A. M. , Akhmetshina, L. G. 2005

机译：基于独立分量分析的地球物理场图像有效分割

A document image segmentation system using analysis of connected components

摘要

著录项

相似文献

相关主题

期刊订阅