OCR enhancement through neighbor embedding and fast approximate nearest neighbors

机译：通过邻居嵌入和快速近似最近邻居增强OCR

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
相似文献
相关主题

摘要

Generic optical character recognition (OCR) engines often perform very poorly in transcribing scanned low resolution(LR) text documents. To improve OCR performance, we apply the Neighbor Embedding (NE) single-imagesuper-resolution (SISR) technique to LR scanned text documents to obtain high resolution (HR) versions, which wesubsequently process with OCR. For comparison, we repeat this procedure using bicubic interpolation (BI). We demonstratethat mean-square errors (MSE) in NE HR estimates do not increase substantially when NE is trained in oneLatin font style and tested in another, provided both styles belong to the same font category (serif or sans serif). Thisis very important in practice, since for each font size, the number of training sets required for each category may bereduced from dozens to just one. We also incorporate randomized ik/i-d trees into our NE implementation to performapproximate nearest neighbor search, and obtain a 1000x speed up of our original NE implementation, with negligibleMSE degradation. This acceleration also made it practical to combine all of our size-specific NE Latin modelsinto a single Universal Latin Model (ULM). The ULM eliminates the need to determine the unknown font categoryand size of an input LR text document and match it to an appropriate model, a very challenging task, since the dpi(pixels per inch) of the input LR image is generally unknown. Our experiments show that OCR character error rates(CER) were over 90% when we applied the Tesseract OCR engine to LR text documents (scanned at 75 dpi and 100dpi) in the 6-10 pt range. By contrast, using ik/i-d trees and the ULM, CER after NE preprocessing averaged less than7% at 3x (100 dpi LR scanning) and 4x (75 dpi LR scanning) magnification, over an order of magnitude improvement.Moreover, CER after NE preprocessing was more that 6 times lower on average than after BI preprocessing.© (2012) COPYRIGHT Society of Photo-Optical Instrumentation Engineers (SPIE). Downloading of the abstract is permitted for personal use only.

机译：通用光学字符识别（OCR）引擎在转录扫描的低分辨率（LR）文本文档时通常表现非常差。为了提高OCR性能，我们将邻居嵌入（NE）单图像超分辨率（SISR）技术应用于LR扫描的文本文档，以获得高分辨率（HR）版本，随后我们使用OCR处理该版本。为了进行比较，我们使用双三次插值（BI）重复此过程。我们证明，如果NE以一种拉丁字体训练并以另一种字体进行测试，则NE HR估计中的均方误差（MSE）不会显着增加，只要这两种样式属于同一字体类别（serif或sans serif）。这在实践中非常重要，因为对于每种字体大小，每个类别所需的训练集的数量可以从几十个减少到一个。我们还将随机化的 k -d树合并到我们的NE实现中，以执行近似的最近邻居搜索，并以不超过MSE降级的速度将原始NE实现的速度提高了1000倍。这种加速还使将所有特定于尺寸的NE拉丁模型合并为一个通用拉丁模型（ULM）变得切实可行。由于通常不知道输入LR图像的dpi（每英寸像素），因此ULM无需确定输入LR文本文档的未知字体类别和大小并将其与适当的模型匹配，这是一项非常艰巨的任务。我们的实验表明，当我们将Tesseract OCR引擎应用于6-10 pt范围内的LR文本文档（以75 dpi和100dpi扫描）时，OCR字符错误率（CER）超过90％。相比之下，使用 k -d树和ULM，NE预处理后的CER在3倍（100 dpi LR扫描）和4倍（75 dpi LR扫描）放大倍数下平均不到7％。此外，NE预处理后的CER平均比BI预处理后低6倍。©（2012）COPYRIGHT光电仪器工程师协会（SPIE）。摘要的下载仅允许个人使用。 展开▼

著录项

来源
《Applications of digital image processing XXXV.》|2012年|p.1I.1-1I.12|共12页

会议地点 San Diego CA(US)

作者
D. C. Smith;
展开▼

作者单位

U.S. Dept. of Defense (United States);

展开▼

会议组织

原文格式 PDF

正文语种 eng

中图分类信息处理（信息加工）;信息处理（信息加工）;

关键词

入库时间 2022-08-26 14:30:51

相似文献

外文文献

中文文献

专利

1. Performance Comparison between Equal-Average Equal-Variance Equal-Norm Nearest Neighbor Search (EEENNS) Method and Improved Equal-Average Equal-Variance Nearest Neighbor Search (IEENNS) Method for Fast Encoding of Vector Quantization [J] . Zhibin PAN, Koji KOTANI, Tadahiro OHMI IEICE Transactions on Information and Systems . 2005,第9期

机译：用于矢量量化快速编码的平均平均等方差最近邻搜索（EEENNS）方法和改进的平均平均等方差最近邻搜索（IEENNS）方法之间的性能比较

2. Approximate nearest neighbor search for l(p)-spaces (2 p infinity) via embeddings [J] . Bartal Yair, Gottlieb Lee-Ad Theoretical computer science . 2019,第期

机译：近似最近的邻居搜索L（p） - 空间（2＆＆ Infinity）通过嵌入式

3. Randomized Embeddings with Slack and High-Dimensional Approximate Nearest Neighbor [J] . Anagnostopoulos Evangelos, Emiris Ioannis Z., Psarros Ioannis ACM transactions on algorithms . 2018,第2期

机译：随机嵌入与松弛和高维近似最近邻居的嵌入式

4. OCR enhancement through neighbor embedding and fast approximate nearest neighbors [C] . D. C. Smith Conference on applications of digital image processing . 2012

机译：OCR通过邻居嵌入和快速近似邻居增强

5. Fast Locality Sensitive Hashing Algorithm for Approximate Nearest Neighbor Search: A Practical Data Mining Approach. [D] . Buaba, Ruben. 2012

机译：近似最近邻居搜索的快速局部敏感哈希算法：一种实用的数据挖掘方法。

6. Fast open modification spectral library searching through approximate nearest neighbor indexing [O] . Wout Bittremieux, Pieter Meysman, William Stafford Noble, -1

机译：通过近似最近邻居索引快速开放修改谱库搜索

7. Randomized embeddings with slack, and high-dimensional Approximate Nearest Neighbor [O] . Anagnostopoulos, Evangelos, Emiris, Ioannis Z., Psarros, Ioannis 2016

机译：具有松弛和高维近似的随机嵌入最近的邻居

1. 基于邻居聚类的近似最近邻搜索 [J] . 赵增 ,李明勇 ,胡航飞 . 智能计算机与应用 . 2020,第011期

2. 基于邻居聚类的近似最近邻搜索 [J] . 赵增 ,李明勇 ,胡航飞 . 智能计算机与应用 . 2020,第011期

3. 改进LANDMARC最近邻居算法在嵌入式系统的实现 [J] . 张玉茹 ,谭丽萍 ,张晓兰 . 哈尔滨商业大学学报（自然科学版） . 2013,第003期

4. 一种面向协同过滤的快速最近邻居搜索方法 [J] . 王永 ,赵旭辉 ,李晓光 . 计算机工程与应用 . 2021,第017期

5. 最近却也最陌生的邻居行摄蒙古国,探秘即将消失的驯鹿部落 [J] . 水冬青 . 旅游世界 . 2019,第008期

6. 自然最近邻居在谱图聚类算法中的运用 [C] . 邹成林 ,朱庆生 ,陈旭东 . 2011年中国自动化大会暨钱学森诞辰一百周年及中国自动化学会五十周年会庆 . 2011

7. 基于聚类离群因子和唯一最近邻居集的离群点检测算法 [A] . 邱敬仰 . 2019

1. 利用KD‑FERN的快速最近邻居搜索 [P] . 中国专利： CN104216936B . 2017.12.15

2. 利用KD-FERN 的快速最近邻居搜索 [P] . 中国专利： CN104216936A . 2014-12-17

3. APPROXIMATE NEAREST NEIGHBOR SEARCH METHOD, NEAREST NEIGHBOR SEARCH PROGRAM, AND NEAREST NEIGHBOR SEARCH DEVICE [P] . 外国专利： JP2013073256A . 2013-04-22

机译：近似的近邻搜索方法，近邻搜索程序和近邻搜索设备

4. approximate nearest neighbor searching apparatus, approximate nearest neighbor search method and program [P] . 外国专利： JPWO2013129580A1 . 2015-07-30

机译：近似最近邻居搜索装置，近似最近邻居搜索方法和程序

5. APPROXIMATE NEAREST NEIGHBOR SEARCH DEVICE, APPROXIMATE NEAREST NEIGHBOR SEARCH METHOD, AND PROGRAM [P] . 外国专利： WO2013129580A1 . 2013-09-06

机译：近似的近邻搜索设备，近似的近邻搜索方法和程序

相关主题

OCR enhancement through neighbor embedding and fast approximate nearest neighbors

摘要

著录项

相似文献

相关主题

期刊订阅