A fast and efficient method for extracting text paragraphs and graphics from unconstrained documents

机译：一种快速有效的方法，用于从无约束文件中提取文本段落和图形

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Outlines a fast and efficient method for extracting graphics and text paragraphs from printed documents. The method presented is based on bottom-up approach to document analysis and it achieves very good performance in most cases. During the preprocessing characters are linked together to form blocks. Created blocks are segmented, labelled and merged into paragraphs. Simultaneously, graphics are extracted from the image. Algorithms for each step of processing are presented. Also, the obtained experimental results are included.

机译：概述了从打印文件中提取图形和文本段落的快速有效方法。所呈现的方法是基于对文档分析的自下而上的方法，并且在大多数情况下实现了非常好的性能。在预处理字符期间，链接在一起形成块。已创建的块被分段，标记并合并到段落中。同时，从图像中提取图形。提出了每个处理步骤的算法。此外，包括所获得的实验结果。

著录项

来源
《IAPR International Conference on Pattern Recognition》|1992年||共5页
会议地点
作者
Lebourgeois F.; Bublinski Z.; Institute of Electric and Electronic Engineer;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类自动化技术、计算机技术;
关键词

相似文献

外文文献
中文文献
专利

1. A knowledge-based system for extracting text-lines from mixed and overlapping text/graphics compound document images [J] . Yen-Lin Chen, Zeng-Wei Hong, Cheng-Hung Chuang Expert systems with applications . 2012,第1期

机译：基于知识的系统，用于从混合和重叠的文本/图形复合文档图像中提取文本行
2. Identification of Biological Relationships from Text Documents Using Efficient Computational Methods [J] . Mathew Palakal, Matthew Stephens, Snehasis Mukhopadhyay, Journal of Bioinformatics and Computational Biology . 2003,第2期

机译：使用有效的计算方法从文本文档中识别生物学关系
3. Identification of Biological Relationships from Text Documents Using Efficient Computational Methods [J] . Mathew Palakal, Matthew Stephens, Snehasis Mukhopadhyay, Journal of Bioinformatics and Computational Biology . 2003,第2期

机译：使用有效的计算方法从文本文档中识别生物学关系
4. A fast and efficient method for extracting text paragraphs and graphics from unconstrained documents [C] . Lebourgeois, F., Bublinski, . 1992

机译：从不受约束的文档中提取文本段落和图形的快速有效方法
5. Fast unconstrained tomosynthesis reconstruction: Methods and applications. [D] . Wang, Beilei. 2005

机译：快速不受约束的断层合成重建：方法和应用。
6. Combining Position Weight Matrices and Document-Term Matrix for Efficient Extraction of Associations of Methylated Genes and Diseases from Free Text [O] . Arwa Bin Raies, Hicham Mansour, Roberto Incitti, -1

机译：结合位置权重矩阵和文档项矩阵从自由文本中高效提取甲基化基因与疾病的关联
7. A method for comparing text and graphic fragments in electronic documents using a hybrid criterion. [O] . S.G. Udovenko, L.E. Chala, E.S. Kushvid 2019

机译：一种使用混合标准将文本和图形片段进行比较的方法。
8. Fast Bundle-Level Type Methods for Unconstrained and Ball-Constrained Convex Optimization. [R] . Chen, Y., Lan, G., Ouyang, Y., 2014

机译：无约束和球约束凸优化的快速束级方法。

A fast and efficient method for extracting text paragraphs and graphics from unconstrained documents

摘要

著录项

相似文献

相关主题

期刊订阅