Text-line extraction and character recognition of Japanese newspaper headlines with graphical designs

机译：具有图形设计的日本报纸头条新闻的文本线提取与字符识别

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

The conventional OCR fails to recognize most characters in Japanese newspaper headlines with graphical designs because of the difficulty of removing the designs. This paper proposes a method that recognizes such characters without removing the designs. First, text-line regions are extracted from a local distribution of the combination of black and white runs observed in a rectangular window while the window is shifted pixel-by-pixel in the direction of the text-line. Characters in the extracted text-line region are then recognized by displacement matching. Adaptive thresholding against the degree of degradation suppresses spurious candidates yielded by displacement matching even with graphical designs. Experimental results for fifty Japanese newspaper headlines show that the method achieves a recognition rate of 97.7%, much higher than a conventional method (17.0%).

机译：由于难以去除设计，传统的OCR未能识别日本报纸头条新闻中的大多数角色。本文提出了一种识别此类字符而不删除设计的方法。首先，在矩形窗口中观察到的黑白窗口组合的局部分布中提取文本线区域，而窗口在文本线的方向上移位像素逐个像素。然后通过位移匹配识别提取的文本线区域中的字符。抵抗降解程度的自适应阈值抑制了即使用图形设计也通过位移匹配产生的杂散候选。五十日报标题的实验结果表明，该方法达到97.7％的识别率，远高于常规方法（17.0％）。

著录项

来源
《International Conference on Pattern Recognition》|1996年||共6页
会议地点
作者
Sawaki M.; Hagita N.; Institute of Electric and Electronic Engineer;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类自动化技术、计算机技术;
关键词

相似文献

外文文献
中文文献
专利

1. Text-line extraction and character recognition of document headlines with graphical designs using complementary similarity measure [J] . Sawaki M., Hagita N. IEEE Transactions on Pattern Analysis and Machine Intelligence . 1998,第10期

机译：使用互补相似性度量的图形设计文本标题的文本行提取和字符识别
2. Text-Line and Character Segmentation for Off-line Recognition of Handwritten Japanese Text [J] . Kha Cong Nguyen, Nakagawa Masaki 電子情報通信学会技術研究報告. パターン認識·メディア理解. Pattern Recognition and Media Understanding . 2015,第517期

机译：文本行和字符分割，用于手写日语文本的离线识别
3. Extraction of newspaper headlines from microfilm for automatic indexing [J] . Chew Lim Tan, Qing Hong Liu International Journal on Document Analysis and Recognition . 2004,第3期

机译：从缩微胶卷中提取报纸头条以进行自动索引
4. Text-line extraction and character recognition of Japanese newspaper headlines with graphical designs [C] . Sawaki, M., Hagita, . 1996

机译：带有图形设计的日本报纸标题的文本行提取和字符识别
5. An Optical Character Recognition Engine for Graphical Processing Units. [D] . Reed, Jeremy. 2016

机译：用于图形处理单元的光学字符识别引擎。
6. An Outbreak of Fearsome Photos and Headlines: Ebola and Local Newspapers in West Africa [O] . Eric S. Halsey 2016

机译：令人恐惧的照片和头条新闻爆发：埃博拉病毒和西非当地报纸
7. Text-Line Extraction and Character Recognition of Document Headlines With Graphical Designs Using Complementary Similarity Measure [O] . Minako Sawaki 1998

机译：基于互补相似性度量的图形设计文本标题文本行提取与字符识别

Text-line extraction and character recognition of Japanese newspaper headlines with graphical designs

摘要

著录项

相似文献

相关主题

期刊订阅