首页> 外文会议> >An edge-based block segmentation and classification for document analysis with automatic character string extraction

【24h】

An edge-based block segmentation and classification for document analysis with automatic character string extraction

机译：基于边缘的块分割和分类，用于具有自动字符串提取功能的文档分析

获取原文

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

Presents an edge-based block segmentation and classification with automatic character string extraction for document analysis. By exploiting only four edge features from the gradient and the orientation of the edge pixels, we can make the block segmentations, classifications, and the character string extractions all insensitive to the background noise and the brightness variation of the image. We can efficiently classify a document image into seven categories of small-sized letters, large-sized letters, tables, equations, flow charts, graphs, and photographs, the first five of which are text or character blocks containing characters, and the last two are non-character blocks. We can obtain an efficient block segmentation with reduced memory size by introducing the column and the text line intervals of the document in CRLA (constrained run length algorithm). The simulation results show that an efficient document image segmentation, block classification, and the character string extraction can be done concurrently.

机译：呈现基于边缘的块分段和分类，具有用于文档分析的自动字符串提取。通过从梯度和边缘像素的方向仅利用四个边缘特征，我们可以制作块分割，分类和字符串提取，对背景噪声和图像的亮度变化进行了不敏感。我们可以有效地将文档图像分类为七个类别的小型字母，大型字母，表，方程，流程图，图形和照片，其中前五个是包含字符的文本或字符块，以及最后两个是非字符块。通过在CRLA中引入列和文档的文本线间隔，我们可以获得具有减小的内存大小的有效块分割，并在CRLA中的文本（受限的运行长度算法）。仿真结果表明，可以同时进行高效的文档图像分割，块分类和字符串提取。

著录项

来源
《》|1996年|P.707-712|共6页
会议地点
作者
Chang-Joon Park; Joon-Hyung Jeon;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类无线电电子学、电信技术;
关键词

相似文献

外文文献
中文文献
专利

1. Fully automatic ROI extraction and edge-based segmentation of radius and ulna bones from hand radiographs [J] . Shreyas Simu, Shyam Lal, Pranav Nagarsekar, Biocybernetics and biomedical engineering . 2017,第4期

机译：从手射线照相中全自动ROI提取和基于边缘的半径和尺骨骨骼的分割
2. AUTOMATIC DOCUMENT SKEW PRE-PROCESSOR FOR CHARACTER SEGMENTATION ALGORITHM [J] . Vladan Vu?kovi?, Boban Arizanovic Facta Universitatis. Series Electronics and Energetics . 2017,第4期

机译：字符分割算法的自动文档偏斜预处理器
3. Character string extraction from color documents [J] . Hase H., Yoneda M., Suen CY., Pattern Recognition: The Journal of the Pattern Recognition Society . 2001,第7期

机译：从彩色文档中提取字符串
4. An edge-based block segmentation and classification for document analysis with automatic character string extraction [C] . Chang-Joon Park, Joon-Hyung Jeon, Institute of Electric and Electronic Engineer IEEE International Conference on Systems . 1996

机译：自动字符串提取的基于边缘的块分段和文档分析分类
5. Noun phrases in documents: Preprocessing, automatic extraction, and statistical analysis in different categories of text. [D] . Kim, Youngin. 2002

机译：文档中的名词短语：对不同类别的文本进行预处理，自动提取和统计分析。
6. A System for Automated Extraction of Metadata from Scanned Documents using Layout Recognition and String Pattern Search Models [O] . Dharitri Misra, Siyuan Chen, George R. Thoma -1

机译：使用布局识别和字符串模式搜索模型从扫描文档中自动提取元数据的系统
7. Automatic classification of documents with an in-depth analysis of information extraction and automatic summarization [O] . Hohm Joseph Brandon 1982- 2004

机译：通过对信息提取和自动摘要的深入分析，自动对文档进行分类

An edge-based block segmentation and classification for document analysis with automatic character string extraction

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅