Layout Analysis for Scanned PDF and Transformation to the Structured PDF Suitable for Vocalization and Navigation

Azaedeh Nazemi; Iain Murray; David A. McMeekin

首页> 外文期刊>Computer and information science >Layout Analysis for Scanned PDF and Transformation to the Structured PDF Suitable for Vocalization and Navigation

【24h】

Layout Analysis for Scanned PDF and Transformation to the Structured PDF Suitable for Vocalization and Navigation

机译：扫描PDF的布局分析并转换为适合语音和导航的结构化PDF

获取原文

获取原文并翻译 | 示例

获取外文期刊封面封底 >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Information can include text, pictures and signatures that can be scanned into a document format, such as the Portable Document Format (PDF), and easily emailed to recipients around the world. Upon the document's arrival, the receiver can open and view it using a vast array of different PDF viewing applications such as Adobe Reader and Apple Preview. Hence, today the use of the PDF has become pervasive. Since the scanned PDF is an image format, it is inaccessible to assistive technologies such as a screen reader. Therefore, the retrieval of the information needs Optical Character Recognition (OCR). The OCR software scans the scanned PDF file and through text extraction generates an editable text formatted document. This text document can then be edited, formatted, searched and indexed as well as translated or converted to speech. A problem that the OCR software does not solve is the accurate regeneration of the full text layout. This paper presents a technology that addresses this issue by closely preserving the original textual layout of the scanned PDF using the open source document analysis and OCR system (OCRopus) based on geometric layout and positioning information. The main issues considered in this research are the preservation of the correct reading order, and the representation of common logical structured elements such as section headings, line breaks, paragraphs, captions, and sidebars, foot-bars, running headers, embedded images, graphics, tables and mathematical expressions.

机译：信息可以包括文本，图片和签名，可以将其扫描成文档格式，例如可移植文档格式（PDF），并可以通过电子邮件轻松地发送给世界各地的收件人。收到文档后，接收者可以使用各种不同的PDF查看应用程序（例如Adobe Reader和Apple Preview）打开并查看它。因此，今天，PDF的使用已变得无处不在。由于扫描的PDF是图像格式，因此诸如屏幕阅读器之类的辅助技术无法访问它。因此，信息的检索需要光学字符识别（OCR）。 OCR软件扫描扫描的PDF文件，并通过文本提取生成可编辑的文本格式文档。然后可以对该文本文档进行编辑，格式化，搜索和索引以及翻译或转换为语音。 OCR软件无法解决的问题是准确重新生成全文版式。本文提出了一种技术，通过使用开放式文档分析和基于几何布局和位置信息的OCR系统（OCRopus）来紧密保留扫描的PDF的原始文本布局，从而解决了这一问题。本研究中考虑的主要问题是保持正确的阅读顺序，以及常见逻辑结构元素的表示，例如节标题，换行符，段落，标题和侧边栏，脚栏，运行标题，嵌入的图像，图形，表格和数学表达式。

著录项

来源
《Computer and information science》 |2014年第1期|162-171|共10页
作者
Azaedeh Nazemi; Iain Murray; David A. McMeekin;
展开▼
作者单位

Departent of Electrical and Computer Engineering, Curtin University, Perth,WA, Australia;

Department of Spatial Sciences, Curtin University, Perth, WA, Australia;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
optical character recognition; document layout analysis; assistive technology;

机译：光学字符识别;文件布局分析;辅助技术;

相似文献

外文文献
中文文献
专利

1. Layout Analysis for Scanned PDF and Transformation to the Structured PDF Suitable for Vocalization and Navigation [J] . Azaedeh Nazemi, Iain Murray, David A. McMeekin Computer and Information Science . 2014,第1期

机译：扫描PDF的布局分析并转换为适合语音和导航的结构化PDF
2. Layout Analysis for Scanned PDF and Transformation to the Structured PDF Suitable for Vocalization and Navigation [J] . Computer and Information Science . 2014,第1期

机译：扫描PDF的布局分析并转换为适合语音和导航的结构化PDF
3. Elucidation of structure and nature of the PdO-Pd transformation using in situ PDF and XAS techniques [J] . Jonathan Keating, Gopinathan Sankar, Timothy I. Hyde Physical chemistry chemical physics: PCCP . 2013,第22期

机译：使用原位PDF和XAS技术阐明PdO-Pd转化的结构和性质
4. Using Layout Applications for Creation of Accessible PDF: Technical and Mental Obstacles When Creating PDF/UA from Adobe Indesign CS 5.5 [C] . Olaf Druemmer International conference on computers helping people with special needs . 2012

机译：使用布局应用程序创建可访问的PDF：从Adobe Indesign CS 5.5创建PDF / UA时的技术和心理障碍
5. An Analysis of STEP, JT, and PDF Format Translation Between Constraint-based Cad Systems with a Benchmark Model. [D] . McKenzie-Veal, Dillon. 2012

机译：基于基准模型的基于约束的Cad系统之间的STEP，JT和PDF格式转换分析。
6. Layout-aware text extraction from full-text PDF of scientific articles [O] . Cartic Ramakrishnan, Abhishek Patnia, Eduard Hovy, 2012

机译：从科学文章的全文PDF中提取可识别布局的文本
7. Layout analysis for Scanned PDF and Transformation to the Structured PDF Suitable for Vocalization and Navigation [O] . Azaedeh Nazemi, Iain Murray, David A. Mcmeekin 2014

机译：扫描pDF的布局分析和适用于发声和导航的结构化pDF的转换

Layout Analysis for Scanned PDF and Transformation to the Structured PDF Suitable for Vocalization and Navigation

摘要

著录项

相似文献

相关主题

期刊订阅