首页> 外文会议>International Conference on Pattern Recognition >Text-line extraction and character recognition of Japanese newspaper headlines with graphical designs
【24h】

Text-line extraction and character recognition of Japanese newspaper headlines with graphical designs

机译:具有图形设计的日本报纸头条新闻的文本线提取与字符识别

获取原文

摘要

The conventional OCR fails to recognize most characters in Japanese newspaper headlines with graphical designs because of the difficulty of removing the designs. This paper proposes a method that recognizes such characters without removing the designs. First, text-line regions are extracted from a local distribution of the combination of black and white runs observed in a rectangular window while the window is shifted pixel-by-pixel in the direction of the text-line. Characters in the extracted text-line region are then recognized by displacement matching. Adaptive thresholding against the degree of degradation suppresses spurious candidates yielded by displacement matching even with graphical designs. Experimental results for fifty Japanese newspaper headlines show that the method achieves a recognition rate of 97.7%, much higher than a conventional method (17.0%).
机译:由于难以去除设计,传统的OCR未能识别日本报纸头条新闻中的大多数角色。 本文提出了一种识别此类字符而不删除设计的方法。 首先,在矩形窗口中观察到的黑白窗口组合的局部分布中提取文本线区域,而窗口在文本线的方向上移位像素逐个像素。 然后通过位移匹配识别提取的文本线区域中的字符。 抵抗降解程度的自适应阈值抑制了即使用图形设计也通过位移匹配产生的杂散候选。 五十日报标题的实验结果表明,该方法达到97.7%的识别率,远高于常规方法(17.0%)。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号