首页> 外文会议>Conference on Information Technology and Quantitative Management >A New Approach to Detect and Extract Characters from Off-Line Printed Images and Text
【24h】

A New Approach to Detect and Extract Characters from Off-Line Printed Images and Text

机译:从脱机打印图像和文本中检测和提取字符的新方法

获取原文

摘要

Characters extraction is the most critical pre-processing step for any off-line text recognition system because the characters are the smallest unit of any language script. The paper proposes an approach to segment character images from the text containing images and computer printed or handwritten words. This segmentation approach is based on a set ofproperties for each connected component (object) in the whole binary image of the machine printed or handwritten text containing some other images. These words which are printed along with some images are of different lengths and are printed by different cursive fonts of different sizes. This character extraction technique is applied for the segmentation of untouched characters from the machine printed or handwritten words of varying length written on a noisy background having some images etc. Very promising results are achieved which reveals the robustness of the proposed character detection and extraction technique.
机译:字符提取是任何离线文本识别系统的最关键的预处理步骤,因为字符是任何语言脚本的最小单位。本文提出了一种从包含图像和计算机打印或手写单词的文本分段字符图像的方法。这种分割方法基于每个连接的组件(对象)的一组均方,在包含一些其他图像的机器的整个二进制图像中的每个连接组件(对象)。与某些图像一起打印的这些单词具有不同的长度,并由不同尺寸的不同法制性字体印刷。该字符提取技术用于从具有一些图像的嘈杂背景写入的机器印刷或手写单词的未触摸字符的分割,这实现了非常有前途的结果,这揭示了所提出的特征检测和提取技术的鲁棒性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号