首页> 外文会议>International Conference on Document Analysis and Recognition >Crossing the lines: making optimal use of context in line-based Handwritten Text Recognition
【24h】

Crossing the lines: making optimal use of context in line-based Handwritten Text Recognition

机译:穿过线条:在基于线条手写文本识别中最佳使用上下文

获取原文

摘要

Hand-written text recognition (HTR) is often carried out line-by-line: the decoding of text lines is carried out independently. This approach is known to deteriorate recognition accuracy of words and characters close to the line boundaries. The present study investigates this issue from the point of view of the language modeling component of the HTR system. Obviously, lack of linguistic context may be one of the reasons for loss of accuracy, but it certainly is not the only factor in play. We seek to clarify to which extent the problem can be influenced by the language modeling component of the system. We first discuss how to develop adapted language models which significantly improve HTR performance in general. We then focus on the deployment of methods to improve accuracy at line boundaries. The final result is an efficient approach which significantly improves HTR accuracy without changing the basic HTR system setup.
机译:手写文本识别(HTR)通常是在线执行的:文本线的解码是独立执行的。众所周知,这种方法可以恶劣地忽略靠近线边界的单词和字符的识别准确性。本研究从HTR系统的语言建模组件的角度调查了这个问题。显然,缺乏语言背景可能是损失准确性的原因之一,但它肯定不是游戏中唯一的因素。我们寻求澄清在系统语言建模组件的情况下对该问题的影响。我们首先讨论如何开发适应的语言模型,这通常会显着提高HTR性能。然后,我们专注于部署方法,以提高线路边界的准确性。最终结果是一种有效的方法,在不改变基本的HTR系统设置的情况下显着提高了HTR精度。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号