首页> 外文期刊>Pattern Recognition: The Journal of the Pattern Recognition Society >A generic method for determining up/down orientation of text in roman and non-roman scripts
【24h】

A generic method for determining up/down orientation of text in roman and non-roman scripts

机译:确定罗马文字和非罗马文字中文本的上/下方向的通用方法

获取原文
获取原文并翻译 | 示例
       

摘要

This paper presents a method for determining the up/down orientation of text in a scanned document of unknown orientation, so that it can be appropriately rotated and processed by an optical character recognition (OCR) engine. The method analyzes the "open" portions of text blobs to determine the direction in which the open portions face. By determining the respective densities of blobs opening in a pair of opposite directions (e.g., right or left), the method can establish the direction in which the text as a whole is oriented. We first describe a method for determining the up/down orientation of roman text based on the asymmetry in the openness of most roman letters in the horizontal direction. For non-roman text such as Pashto and Hebrew, we provide a method that determines a direction that is the most asymmetric, and therefore the most useful for the determination of text orientation, given a training data set of documents of known orientation. This work can be adapted for use in automated mail processing or to determine the orientation of checks in automated teller machine envelopes, scanned or copied documents, documents sent via facsimile, and digital photographs that include text (e.g., road signs, business cards, driver's licenses), among other applications. (c) 2005 Pattern Recognition Society. Published by Elsevier Ltd. All rights reserved.
机译:本文提出了一种确定方向未知的扫描文档中文本的上/下方向的方法,以便可以通过光学字符识别(OCR)引擎对其进行适当的旋转和处理。该方法分析文本斑点的“开放”部分,以确定开放部分面对的方向。通过确定在一对相反方向(例如,右或左)上开口的斑点的各自的密度,该方法可以确定文本整体的取向方向。我们首先描述一种基于水平方向上大多数罗马字母的开放性不对称性来确定罗马文本的上下方向的方法。对于非罗马文本,例如普什图语和希伯来语,我们提供了一种方法,该方法可以在给定已知方向的文档训练数据集的情况下,确定最不对称的方向,因此对于确定文本方向最有用。这项工作可适用于自动邮件处理或确定自动柜员机信封,扫描或复印的文档,通过传真发送的文档以及包括文字(例如路标,名片,驾驶员证件)的数码照片的支票方向。许可),以及其他应用程序。 (c)2005模式识别学会。由Elsevier Ltd.出版。保留所有权利。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号