首页> 外文会议>International conference on machine vision >Fast Words Boundaries Localization in Text Fields for Low Quality Document Images
【24h】

Fast Words Boundaries Localization in Text Fields for Low Quality Document Images

机译:低质量文档图像的文本字段中的快速单词边界本地化

获取原文

摘要

The paper examines the problem of word boundaries precise localization in document text zones. Document processing on a mobile device consists of document localization, perspective correction, localization of individual fields, finding words in separate zones, segmentation and recognition. While capturing an image with a mobile digital camera under uncontrolled capturing conditions, digital noise, perspective distortions or glares may occur. Further document processing gets complicated because of its specifics: layout elements, complex background, static text, document security elements, variety of text fonts. However, the problem of word boundaries localization has to be solved at runtime on mobile CPU with limited computing capabilities under specified restrictions. At the moment, there are several groups of methods optimized for different conditions. Methods for the scanned printed text are quick but limited only for images of high quality. Methods for text in the wild have an excessively high computational complexity, thus, are hardly suitable for running on mobile devices as part of the mobile document recognition system. The method presented in this paper solves a more specialized problem than the task of finding text on natural images. It uses local features, a sliding window and a lightweight neural network in order to achieve an optimal algorithm speed-precision ratio. The duration of the algorithm is 12 ms per field running on an ARM processor of a mobile device. The error rate for boundaries localization on a test sample of 8000 fields is 0.3
机译:本文研究了文档文本区域中单词边界精确定位的问题。移动设备上的文档处理包括文档定位,透视校正,各个字段的定位,在单独区域中查找单词,分割和识别。在不受控制的拍摄条件下用移动数码相机拍摄图像时,可能会发生数字噪音,透视失真或眩光。由于其特殊性,进一步的文档处理变得复杂:布局元素,复杂的背景,静态文本,文档安全性元素,各种文本字体。但是,字边界本地化的问题必须在运行时在具有指定限制的有限计算能力的移动CPU上解决。目前,有几组针对不同条件进行了优化的方法。扫描打印文本的方法快速,但仅适用于高质量图像。野外文本方法的计算复杂度过高,因此,几乎不适合作为移动文档识别系统的一部分在移动设备上运行。本文提出的方法比在自然图像上查找文本的任务解决了更专业的问题。它使用局部特征,滑动窗口和轻量级神经网络,以实现最佳算法速度精度比。该算法的持续时间是在移动设备的ARM处理器上运行的每个字段12 ms。在8000个场的测试样本上边界定位的错误率是0.3

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号