首页> 外文期刊>International Journal on Document Analysis and Recognition >A sigma-lognormal model-based approach to generating large synthetic online handwriting sample databases
【24h】

A sigma-lognormal model-based approach to generating large synthetic online handwriting sample databases

机译:基于sigma-lognormal模型的方法来生成大型合成在线手写样本数据库

获取原文
获取原文并翻译 | 示例
           

摘要

This article describes a methodology to generate a large database of synthetic samples from a small set of original online handwriting specimens. The overall paradigm is based on the Kinematic Theory of rapid human movements and its sigma-lognormal model. The principal contributions of the present study include (i) development of a strategy for sigma-lognormal model-based generation of synthetic samples from real online handwriting samples of arbitrary scripts captured by arbitrary relevant devices and (ii) verification of the structural similarities, including the naturalness of such synthetic prototypes, through various human perception experiments, computer evaluations and statistical hypothesis testing. A database consisting of a large number of online synthetic handwritten word samples is used to train and evaluate the performance of three existing automatic online handwriting recognition systems. Training based on a combined set of original and synthetic samples improves the recognition accuracies on the test set. A combined training set is useful irrespective of the nature of the feature set used (online, offline or combined). Although the proposed method has primarily been developed and applied to the design of an online handwriting sample database of a popular Indian script, Bangla, it can be applied to the generation of large databases of any arbitrary script for example: English, Chinese and Arabic.
机译:本文介绍了一种方法,可以从一小组原始的在线手写样本中生成一个大型的合成样本数据库。总体范例基于快速人类运动的运动学理论及其sigma-lognormal模型。本研究的主要贡献包括(i)开发一种策略,用于从对数相关设备捕获的任意脚本的真实在线手写样本中生成基于sigma-lognormal模型的合成样本,以及(ii)验证结构相似性,包括通过各种人类感知实验,计算机评估和统计假设检验,这种合成原型的自然性。由大量在线合成手写单词样本组成的数据库用于训练和评估三个现有的自动在线手写识别系统的性能。基于原始样本和合成样本的组合进行的训练可提高测试集的识别准确性。无论使用的功能集是什么性质(联机,脱机或组合),组合训练集都是有用的。尽管所提出的方法主要是开发并应用于流行的印度文字孟加拉语的在线手写样本数据库的设计,但它可以应用于生成任何任意文字的大型数据库,例如:英语,中文和阿拉伯语。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号