A new method of character line extraction from mixed-unformatted document image for Japanese mail address recognition

机译：用于日语邮件地址识别的混合 - 未格式化文档图像的一种新的字符线提取方法

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Presents a new method of horizontal and vertical character line extraction in mixed (handwritten/printed) unformatted document images, in various character sizes, gaps and orientations nested among advertisement characters, drawings and photographs. We use the inherent features of a character line, such as the number and size of the characters it contains and the angular spectrum of the characters. When an area has characters along both horizontal and vertical lines, then competitive judgment is applied. Using multi-set thresholds in a bottom-up methodology, we can successfully extract Japanese mail address character lines. 957 address character lines, taken from 252 pieces of mail, were tested, and a 95.9% correct extraction rate was achieved.

机译：呈现了混合（手写/打印）未格式化的文档图像中的水平和垂直字符线提取的新方法，以各种字符尺寸，间隙和嵌套在广告字符，图纸和照片中的方向。我们使用字符行的固有功能，例如它包含的字符的数量和大小以及字符的角谱。当一个区域沿水平和垂直线的角色有角色，然后应用竞争判断。在自下而上的方法中使用多个阈值，我们可以成功提取日语地址字符线。从252件邮件中取出的957个地址字符线，实现了95.9％的正确提取率。

著录项

来源
《International Conference on Document Analysis and Recognition》|1999年||共4页
会议地点
作者
Xian Wang; Tsutsumida T.; Institute of Electric and Electronic Engineer;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类自动化技术、计算机技术;
关键词

相似文献

外文文献
中文文献
专利

1. Constraint Satisfaction Approach to Extraction of Japanese Character Regions from Unformatted Document Image [J] . Keiji GYOHTEN, Noboru BABAGUCHI, Tadahiro KITAHASHI IEICE Transactions on Information and Systems . 1995,第4期

机译：从无格式文档图像中提取日语字符区域的约束满足方法
2. Synthetic Scene Character Generator and Ensemble Scheme with the Random Image Feature Method for Japanese and Chinese Scene Character Recognition [J] . Fuma HORIE, Hideaki GOTO, Takuo SUGANUMA IEICE transactions on information and systems . 2021,第11期

机译：综合场景字符生成器和与日语场景字符识别随机图像特征方法的合成场景
3. Character extraction and recognition for low-resolution color images using dominant-color-based-line-segment method [J] . Masahiko Hamanaka 電子情報通信学会技術研究報告. パターン認識·メディア理解. Pattern Recognition and Media Understanding . 2001,第525期

机译：基于主色线段法的低分辨率彩色图像字符提取与识别
4. A new method of character line extraction from mixed-unformatted document image for Japanese mail address recognition [C] . Xian Wang, Tsutsumida, T. . 1999

机译：从混合无格式文档图像中提取字符行的新方法，用于日语邮件地址识别
5. Use of character recognition and syntax in locating address paragraphs in complex documents. [D] . Lii, Jenchyou. 1995

机译：在复杂文档中查找地址段落时使用字符识别和语法。
6. Robust Combined Binarization Method of Non-Uniformly Illuminated Document Images for Alphanumerical Character Recognition [O] . Hubert Michalak, Krzysztof Okarma 2020

机译：非均匀照明文档图像的鲁棒组合二值化方法用于字母数字字符识别
7. Study on feature extraction methods for character recognition of Balinese script on palm leaf manuscript images [O] . Made Windu Antara Kesiman, Sophea Prum, Jean-Christophe Burie, 2016

机译：棕榈叶手稿图像上巴厘岛脚本字符识别特征提取方法研究
8. Some methods of encoding simple visual images for use with a sparse distributed memory, with applications to character recognition [R] . Jaeckel, Louis A. 1989

机译：一些编码简单视觉图像的方法，用于稀疏分布式存储器，应用于字符识别

A new method of character line extraction from mixed-unformatted document image for Japanese mail address recognition

摘要

著录项

相似文献

相关主题

期刊订阅