首页> 外文会议>International Conference on Parallel and Distributed Processing Techniques and Applications >An Effective and Interactive Training Data Collection Method for Early-Modern Japanese Printed Character Recognition
【24h】

An Effective and Interactive Training Data Collection Method for Early-Modern Japanese Printed Character Recognition

机译:早期现代日本印刷字符识别的有效互动培训数据收集方法

获取原文

摘要

In this paper, we present a web application that supports to collect training data efficiently for early-modern Japanese printed character recognition. The national diet library in Japan provides a lot of early-modern (AD1868-1945) Japanese printed books to the public, but full-text search is essentially impossible. In order to perform advanced search in historical literatures, it is required extracting texts from images. To solve this problem, we have already proposed a multi-font Kanji character recognition method using the PDC feature and an SVM. For growing inperformance of this method, we need big amounts of training data. However, collecting training data by hand is extremely inefficient. Therefore, we propose a Web application that supports collecting training data.
机译:在本文中,我们提出了一个支持早期现代日本印刷字符识别有效地收集培训数据的Web应用程序。日本的国家饮食图书馆提供了许多早期现代(AD1868-1945)日本印刷书籍向公众提供,但全文搜索本质上是不可能的。为了在历史文献中执行高级搜索,需要从图像中提取文本。为了解决这个问题,我们已经提出了一种使用PDC功能和SVM的多字体Kanji字符识别方法。为了不断增长这种方法的表现,我们需要大量的培训数据。但是,手工收集培训数据非常低效。因此,我们提出了一个支持收集培训数据的Web应用程序。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号