首页> 外文会议>International conference on machine vision >Training a Whole-Book LSTM-Based Recognizer with an Optimal Training Set
【24h】

Training a Whole-Book LSTM-Based Recognizer with an Optimal Training Set

机译:使用最佳训练集训练基于LSTM的全书型识别器

获取原文

摘要

Despite the recent progress in OCR technologies, whole-book recognition, is still a challenging task, in particular in case of old and historical books, that the unknown font faces or low quality of paper and print contributes to the challenge. Therefore, pre-trained recognizers and generic methods do not usually perform up to required standards, and usually the performance degrades for larger scale recognition tasks, such as of a book. Such reportedly low error-rate methods turn out to require a great deal of manual correction. Generally, such methodologies do not make effective use of concepts such redundancy in whole-book recognition. In this work, we propose to train Long Short Term Memory (LSTM) networks on a minimal training set obtained from the book to be recognized. We show that clustering all the sub-words in the book, and using the sub-word cluster centers as the training set for the LSTM network, we can train models that outperform any identical network that is trained with randomly selected pages of the book. In our experiments, we also show that although the sub-word cluster centers are equivalent to about 8 pages of text for a 101-page book, a LSTM network trained on such a set performs competitively compared to an identical network that is trained on a set of 60 randomly selected pages of the book.
机译:尽管OCR技术最近取得了进步,但整本书的识别仍然是一项艰巨的任务,特别是在旧书和历史书籍的情况下,未知的字体或较低质量的纸张和印刷品是造成挑战的原因。因此,经过预训练的识别器和通用方法通常无法达到要求的标准,并且对于诸如书本之类的较大规模的识别任务,性能通常会下降。据报道,这种错误率低的方法需要大量的手动校正。通常,这种方法在整本书识别中没有有效利用诸如冗余之类的概念。在这项工作中,我们建议在从要识别的书中获得的最小训练集上训练长期短期记忆(LSTM)网络。我们显示出将书中的所有子词聚类,并使用子词聚类中心作为LSTM网络的训练集,我们可以训练出优于任何使用书中随机选择的页面训练的相同网络的模型。在我们的实验中,我们还表明,尽管对于101页的书来说,子词聚类中心相当于大约8页文本,但是与在相同的网络上训练的LSTM网络相比,在这样的集合上训练的LSTM网络具有竞争优势。这套书的60个随机选择的页面集。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号