Fast Words Boundaries Localization in Text Fields for Low Quality Document Images

机译：低质量文档图像的文本字段中的快速单词边界本地化

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

The paper examines the problem of word boundaries precise localization in document text zones. Document processing on a mobile device consists of document localization, perspective correction, localization of individual fields, finding words in separate zones, segmentation and recognition. While capturing an image with a mobile digital camera under uncontrolled capturing conditions, digital noise, perspective distortions or glares may occur. Further document processing gets complicated because of its specifics: layout elements, complex background, static text, document security elements, variety of text fonts. However, the problem of word boundaries localization has to be solved at runtime on mobile CPU with limited computing capabilities under specified restrictions. At the moment, there are several groups of methods optimized for different conditions. Methods for the scanned printed text are quick but limited only for images of high quality. Methods for text in the wild have an excessively high computational complexity, thus, are hardly suitable for running on mobile devices as part of the mobile document recognition system. The method presented in this paper solves a more specialized problem than the task of finding text on natural images. It uses local features, a sliding window and a lightweight neural network in order to achieve an optimal algorithm speed-precision ratio. The duration of the algorithm is 12 ms per field running on an ARM processor of a mobile device. The error rate for boundaries localization on a test sample of 8000 fields is 0.3

机译：本文研究了文档文本区域中单词边界精确定位的问题。移动设备上的文档处理包括文档定位，透视校正，各个字段的定位，在单独区域中查找单词，分割和识别。在不受控制的拍摄条件下用移动数码相机拍摄图像时，可能会发生数字噪音，透视失真或眩光。由于其特殊性，进一步的文档处理变得复杂：布局元素，复杂的背景，静态文本，文档安全性元素，各种文本字体。但是，字边界本地化的问题必须在运行时在具有指定限制的有限计算能力的移动CPU上解决。目前，有几组针对不同条件进行了优化的方法。扫描打印文本的方法快速，但仅适用于高质量图像。野外文本方法的计算复杂度过高，因此，几乎不适合作为移动文档识别系统的一部分在移动设备上运行。本文提出的方法比在自然图像上查找文本的任务解决了更专业的问题。它使用局部特征，滑动窗口和轻量级神经网络，以实现最佳算法速度精度比。该算法的持续时间是在移动设备的ARM处理器上运行的每个字段12 ms。在8000个场的测试样本上边界定位的错误率是0.3

著录项

来源
《International conference on machine vision》|2017年|106960V.1-106960V.8|共8页
会议地点
作者
Dmitry Ilin; Dmitriy Novikov; Dmitry Polevoy; Dmitry Nikolaev;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Localization; detection; neural networks; image processing; computational efficiency;

机译：本土化;检测;神经网络;图像处理;计算效率;
入库时间 2022-08-26 13:48:19

相似文献

外文文献
中文文献
专利

1. Word Extraction and Character Segmentation from Text Lines of Unconstrained Handwritten Bangla Document Images [J] . Ram Sarkar, Samir Malakar, Nibaran Das, Journal of Intelligent Systems . 2011,第3期

机译：从不受约束的手写孟加拉语文档图像的文本行中提取单词并进行字符分割
2. Text retrieval from document images based on word shape analysis [J] . Tan CL., Huang WH., Sung SY., Applied Intelligence: The International Journal of Artificial Intelligence, Neural Networks, and Complex Problem-Solving Technologies . 2003,第3期

机译：基于词形分析的文档图像文本检索
3. Improved localization accuracy by LocNet for Faster R-CNN based text detection in natural scene images [J] . Zhong Zhuoyao, Sun Lei, Huo Qiang Pattern Recognition: The Journal of the Pattern Recognition Society . 2019,第期

机译：通过LOCNET提高本地化精度，以便在自然场景图像中更快的基于R-CNN的文本检测
4. Fast Words Boundaries Localization in Text Fields for Low Quality Document Images [C] . Dmitry Ilin, Dmitriy Novikov, Dmitry Polevoy, International Conference on Machine Vision . 2018

机译：快文字边界在文本字段中定位低质量文档图像
5. Markov random field model based text segmentation and image post processing of complex scanned documents [D] . Haneda, Eri 2011

机译：基于马尔可夫随机场模型的复杂扫描文档的文本分割和图像后处理
6. Text Detection in Natural Scene Images by Stroke Gabor Words [O] . Chucai Yi, Yingli Tian -1

机译：通过笔划Gabor词的自然场景图像中的文本检测
7. British Letters Patent of 1908 and 1917 constituting the Falkland Islands Dependencies The following are the texts of the two Letters Patent denning the boundaries of the Falkland Islands Dependencies. They are reprinted here in view of the current political interest in this area. Some confusion has arisen owing to misrepresentation of the wording of these documents. The Letters Patent of 1908 made provision for the government of certain specified land areas lying between specified latitudes and longitudes. No claim was made to jurisdiction over the High Seas within these boundaries; still less was any claim made to that part of South America which lies to the south of latitude 50° S. The Letters Patent of 1917 denned the area more precisely in order to avoid this ambiguity. All subsequent British legislation for the administration of these Dependencies is based on the authority of these two documents. [O] . 1948

机译：英国信件1908年和1917年构成福克兰群岛的依赖性以下是谴责福克兰群岛依赖性的边界的两封信的文本。鉴于目前对该领域的政治兴趣，他们在此转载。由于这些文件的措辞歪曲了一些混乱。 1908年的信件专利为某些指定土地区域的政府提供了符合特定纬度和纵向的政府。在这些界限内没有索赔对公海的司法管辖区;仍然仍然是对南美洲的那部分索利的任何索赔，这些南美侧向纬度为50°S南部。1917年的字母专利更准确地击落了该地区，以避免这种歧义。所有后续英国人的管理这些依赖项的立法是基于这两份文件的权威。
8. Seeing and Reading Red: Hue and Color-word Correlation in Images and Attendant Text on the WWW [R] . Newsam, S. 2004

机译：看到和读取红色：WWW上的图像和助理文本中的色调和颜色词相关性

Fast Words Boundaries Localization in Text Fields for Low Quality Document Images

摘要

著录项

相似文献

相关主题

期刊订阅