首页> 外文会议>International Conference on Frontiers in Handwriting Recognition >Exploiting Existing Modern Transcripts for Historical Handwritten Text Recognition
【24h】

Exploiting Existing Modern Transcripts for Historical Handwritten Text Recognition

机译:用于历史手写识别现有现代成绩单

获取原文

摘要

Existing transcripts for historic manuscripts are a very valuable resource for training models useful for automatic recognition, aided transcription, and/or indexing of the remaining untranscribed parts of these collections. However, these existing transcripts generally exhibit two main problems which hinder their convenience: a) text of the transcripts is seldom aligned with manuscript lines, and b) text often deviate very significantly from what can be seen in the manuscript, either because writing style has been modernized or abbreviations have been expanded, or both. This work presents an analysis of these problems and discusses possible solutions for minimizing human effort needed to adapt existing transcripts in order to render them usable. Empirical results presented show the huge performance gain that can be obtained by adequately adapting the transcripts, thus motivating future development of the proposed solutions.
机译:历史稿件的现有成绩单是用于培训模型的非常有价值的资源,可用于自动识别,辅助转录和/或索引这些集合的剩余未经筛选部分的索引。然而,这些现有的成绩单通常表现出阻碍其便利性的两个主要问题:a)成绩单的文本很少与手稿线对齐,b)文本通常非常偏离手稿中可以在手稿中可以看到的,无论是因为写作风格已经展开了现代化或缩写,或两者都是扩大的。这项工作提出了对这些问题的分析,并讨论了最大限度地减少适应现有成绩单所需的人力努力的可能解决方案,以使其可用。提出了经验结果,展示了通过充分适应成绩单可以获得的巨大性能增益,从而激励所提出的解决方案的未来发展。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号