Training of an on-line handwritten Japanese character recognizer by artificial patterns

Bin Chen; Bilan Zhu; Masaki Nakagawa

首页> 外文期刊>Pattern recognition letters >Training of an on-line handwritten Japanese character recognizer by artificial patterns

【24h】

Training of an on-line handwritten Japanese character recognizer by artificial patterns

机译：通过人工模式训练在线手写日语字符识别器

获取原文

获取原文并翻译 | 示例

开具论文收录证明 >>

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

This paper presents effects of a large amount of training patterns artificially generated to train an on-line handwritten Japanese character recognizer, which is based on the Markov Random Field model. In general, the more training patterns, the higher the recognition accuracy. In reality, however, the existing pattern samples are not enough, especially for languages with large sets of characters, for which a higher number of parameters needs to be adjusted. We use six types of linear distortion models and combine them among themselves and with a non-linear distortion model to generate a large amount of artificial patterns. These models are based on several geometry transform models, which are considered to simulate distortions in real handwriting. We apply these models to the TUAT Nakayosi database and expand its volume by up to 300 times while evaluating the notable effect of the TUAT Kuchibue database for improving recognition accuracy. The effect is analyzed for subgroups in the character set and a significant effect is observed for Kanji, ideographic characters of Chinese origin. This paper also considers the order of linear and non-linear distortion models and the strategy to select patterns in the original database from patterns close to character class models to those away from them or vice versa. For this consideration, we merge the Nakayosi and Kuchibue databases. We take 100 patterns existed in the merged database to form the testing set, while the remaining samples to form the training set. For the order, linear then non-linear distortions produce higher recognition accuracy. For the strategy, selecting patterns away from character class models to those close to them produce higher accuracy.

机译：本文介绍了基于Markov Random Field模型人工生成的大量训练模式对在线手写日语字符识别器的训练效果。通常，训练模式越多，识别精度越高。然而，实际上，现有的模式样本是不够的，特别是对于具有大量字符集的语言而言，需要调整大量参数。我们使用六种类型的线性失真模型，并将它们相互之间以及与非线性失真模型结合起来以生成大量的人造图案。这些模型基于几种几何变换模型，这些模型被认为可以模拟真实笔迹中的变形。我们将这些模型应用于TUAT Nakayosi数据库，并将其数量最多扩展300倍，同时评估TUAT Kuchibue数据库在提高识别准确性方面的显着效果。对字符集中的子组进行了效果分析，并且对汉字汉字（表意文字）观察到了显着效果。本文还考虑了线性和非线性失真模型的顺序，以及在原始数据库中从接近字符类模型的模型到远离字符类模型的模型选择模型的策略，反之亦然。为此，我们合并了Nakayosi和Kuchibue数据库。我们采用合并数据库中存在的100个模式来形成测试集，而其余样本来形成训练集。对于此顺序，线性然后是非线性失真会产生更高的识别精度。对于该策略，从角色类模型到与其接近的模型选择模式会产生更高的准确性。

著录项

来源
《Pattern recognition letters》 |2014年第1期|178-185|共8页
作者
Bin Chen; Bilan Zhu; Masaki Nakagawa;
展开▼
作者单位

Department of Computer and Information Sciences, Tokyo University of Agriculture and Technology, Tokyo, Japan;

Department of Computer and Information Sciences, Tokyo University of Agriculture and Technology, Tokyo, Japan;

Department of Computer and Information Sciences, Tokyo University of Agriculture and Technology, Tokyo, Japan;

展开▼
收录信息美国《科学引文索引》(SCI);美国《工程索引》(EI);
原文格式 PDF
正文语种 eng
中图分类
关键词
On-line handwriting recognition; Artificial patterns; Linear distortion models; Non-linear distortion model; Combined model; Pattern selection strategy;

机译：在线手写识别;人造图案;线性失真模型;非线性失真模型;组合模型;模式选择策略;

相似文献

外文文献
中文文献
专利

1. A robust method for coarse classifier construction from a large number of basic recognizers for on-line handwritten Chinese/Japanese character recognition [J] . Bilan Zhu, Masaki Nakagawa Pattern Recognition: The Journal of the Pattern Recognition Society . 2014,第2期

机译：一种从大量基本识别器构造粗分类器的可靠方法，用于在线手写中日文字识别
2. Prototype learning for structured pattern representation applied to on-line recognition of handwritten Japanese characters [J] . Akihito Kitadai, Masaki Nakagawa International Journal on Document Analysis and Recognition . 2007,第2期

机译：用于结构化模式表示的原型学习应用于手写日语字符的在线识别
3. Collection of on-line handwritten Japanese character pattern databases and their analyses [J] . Masaki Nakagawa, Kaoru Matsumoto International Journal on Document Analysis and Recognition . 2004,第1期

机译：在线手写日语字符模式数据库的收集及其分析
4. Effects of Generating a Large Amount of Artificial Patterns for On-line Handwritten Japanese Character Recognition [C] . Chen Bin, Zhu Bilan, Nakagawa Masaki 2011 International Conference on Document Analysis and Recognition . 2011

机译：生成大量人工图案对在线手写日语字符识别的影响
5. Handwritten character recognition by combining multiple recognizers using conditional probabilities [D] . Padmanaban, Mageshkumar 2005

机译：通过使用条件概率组合多个识别器来进行手写字符识别
6. Handwritten Bangla Character Recognition Using the State-of-the-Art Deep Convolutional Neural Networks [O] . Md Zahangir Alom, Paheding Sidike, Mahmudul Hasan, 2018

机译：使用最先进的深度卷积神经网络进行手写Bangla字符识别
7. Collection and Analysis of On-Line Handwritten Japanese Character Patterns [O] . Kaoru Matsumoto, Takahiro Fukushima, Masaki Nakagawa 2001

机译：在线手写日语字符模式的收集和分析

Training of an on-line handwritten Japanese character recognizer by artificial patterns

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅