【24h】

Handwriting Style Mixture Adaptation

机译:手写风格混合适应

获取原文

摘要

In handwriting recognition, the test data usually come from multiple writers which are not shown in the training data. Therefore, adapting the base classifier towards the new style of each writer can significantly improve the generalization performance. Traditional writer adaptation methods usually assume that there is only one writer (one style) in the test data, and we call this situation as style-clear adaptation. However, a more common situation is that multiple handwriting styles exist in the test data, which is widely appeared in multi-font documents and handwriting data produced by the cooperation of multiple writers. We call the adaptation in this situation as style-mixture adaptation. To deal with this problem, in this paper, we propose a novel method called K-style mixture adaptation (K-SMA) with the assumption that there are totally K styles in the test data. Specifically, we first partition the test data into K groups (style clustering) according to their style consistency, which is measured by a newly designed style feature that can eliminate class (category) information and keep handwriting style information. After that, in each group, a style transfer mapping (STM) is used for writer adaptation. Since the initial style clustering may be not reliable, we repeat this process iteratively to improve the adaptation performance. The K-SMA model is fully unsupervised which do not require either the class label or the style index. Moreover, the K-SMA model can be effectively combined with the benchmark convolutional neural network (CNN) models. Experiments on the online Chinese handwriting database CASIA-OLHWDB demonstrate that K-SMA is an efficient and effective solution for style-mixture adaptation.
机译:在手写识别中,测试数据通常来自训练数据中未显示的多个作家。因此,将基础分类器适应每个作家的新风格可以显着提高泛化性能。传统的作家适应方法通常假设测试数据中只有一个作家(一种样式),并且我们称这种情况称为风格清晰的适应。然而,更常见的情况是,在测试数据中存在多种手写样式,其广泛出现在多字体文档和由多个编写者协作产生的手写数据中。我们称这种情况称为风格混合适应。为了解决这个问题,在本文中,我们提出了一种新的方法,该方法是假设测试数据中存在完全k样式的假设,提出了一种称为K样式混合适应(K-SMA)的方法。具体地,我们首先将测试数据分区为K组(样式群集)根据其型号的一致性,它通过新设计的风格功能来衡量,可以消除类(类别)信息并保持手写样式信息。之后,在每个组中,样式传输映射(STM)用于编写器适应。由于初始样式群集可能不可靠,因此我们迭代地重复此过程以提高适应性性能。 K-SMA模型完全无监督,不需要类标签或样式索引。此外,K-SMA模型可以与基准卷积神经网络(CNN)模型有效地结合。在线汉语手写数据库Casia-OLHWDB的实验表明K-SMA是适用于款式混合适应的高效有效的解决方案。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号