首页> 外文会议>International Workshop on Document Analysis Systems >The Impact of Large Training Sets on the Recognition Rate of Off-line Japanese Kanji Character Classifiers
【24h】

The Impact of Large Training Sets on the Recognition Rate of Off-line Japanese Kanji Character Classifiers

机译:大型培训集对离线识别日语汉字字符分类器的影响

获取原文

摘要

Though it is commonly agreed that increasing the training set size leads to improved recognition rates, the deficit of publicly available Japanese character pattern databases prevents us from verifying this assumption empirically for large data sets. Whereas the typical number of training samples has usually been between 100-200 patterns per category until now, newly collected databases and increased computing power allows us to experiment with a much higher number of samples per category. In this paper, we experiment with off-line classifiers trained with up to 1550 patterns for 3036 categories respectively. We show that this bigger training set size indeed leads to improved recognition rates compared to the smaller training sets normally used.
机译:虽然通常同意增加培训集大小导致改善的识别率,但是公开的日本字符模式数据库的缺陷可防止我们验证大型数据集的主题验证此假设。然而,典型的训练样本通常在每类的100-200模式之间,直到现在,新收集的数据库和增加的计算能力允许我们每类别进行更高数量的样本。在本文中,我们分别在培训的离线分类器进行培训,分别为3036类高达1550个模式。我们表明,与通常使用的较小训练集相比,这种更大的训练集规则确实导致识别率提高。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号