...
首页> 外文期刊>Pattern Recognition: The Journal of the Pattern Recognition Society >Classification of machine-printed and handwritten texts using character block layout variance
【24h】

Classification of machine-printed and handwritten texts using character block layout variance

机译:使用字符块布局差异对机器打印和手写文本进行分类

获取原文
获取原文并翻译 | 示例

摘要

Machine-printed and handwritten texts always intermixedly appear in several kinds of documents, such as form documents. The classification of machine-printed and handwritten texts is thus a prerequisite to facilitate later optical character recognition task. In this paper, we will present a machine-printed and handwritten text classification method to automatically identify the identity of texts segmented from a document image. In our approach, the orientation of a text block is first divided into horizontal or vertical direction by analyzing the widths of valleys of X and Y projection profiles of a text block image. Then, a reduced X-Y cut algorithm is utilized to obtain the base blocks from a text block image. Last, the spatial feature, character block layout variance, is devised to achieve the classification goal. Our method can be applied to either English or Chinese document images. Experimental results reveal the feasibility of our proposed method in classifying handwritten and machine-printed texts. (C) 1998 Pattern Recognition Society. Published by Elsevier Science Ltd. All rights reserved. [References: 8]
机译:机器打印的文本和手写的文本总是混合出现在多种文档中,例如表格文档。因此,机器印刷和手写文本的分类是促进以后的光学字符识别任务的前提。在本文中,我们将提出一种机器打印和手写的文本分类方法,以自动识别从文档图像中分割出的文本的身份。在我们的方法中,首先通过分析文本块图像的X和Y投影轮廓的谷底宽度,将文本块的方向分为水平或垂直方向。然后,利用简化的X-Y剪切算法从文本块图像中获取基本块。最后,设计空间特征(字符块布局变化)以实现分类目标。我们的方法可以应用于英语或中文文档图像。实验结果表明,我们提出的方法可以对手写和机器打印的文本进行分类。 (C)1998模式识别学会。由Elsevier Science Ltd.出版。保留所有权利。 [参考:8]

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号