首页> 外国专利> A METHOD OF IMPROVING THE RATIO OF OPTICAL CONTENT RECOGNITION IN FORM-BASED PHYSICAL DOCUMENT DIGITALIZATION USING A MULTI-SPECTRAL ANALYSIS APPROACH

A METHOD OF IMPROVING THE RATIO OF OPTICAL CONTENT RECOGNITION IN FORM-BASED PHYSICAL DOCUMENT DIGITALIZATION USING A MULTI-SPECTRAL ANALYSIS APPROACH

机译:利用多光谱分析方法提高基于形式的物理文档数字化过程中光学内容识别率的方法

摘要

The proposed device (Multispectral Scanner) is used to divide the scanned document into layers. Each of these corresponds to a different medium, eg. Paper, ink printout, laser printout, dirt etc. The device utilizes the properties of light waves such as reflectance, absorption, etc. This makes it possible to identify the pigments present in the output. Available now on the market document scanners are not able to automatically separate the content for OCR (content suitable for optical character recognition) from noise and dirt. It is worth noting that it is important not only to use the proposed construction of a scanner but also the proposed algorithms which are analyzing the recorded data.
机译:建议的设备(多光谱扫描仪)用于将扫描的文档分为几层。这些中的每一个对应于不同的介质,例如。纸张,墨水打印输出,激光打印输出,脏物等。该设备利用了光波的特性,例如反射率,吸收率等。这使得可以识别输出中存在的颜料。市场上现在可用的文档扫描仪无法自动将OCR的内容(适合光学字符识别的内容)与噪音和污垢分开。值得注意的是,不仅重要的是使用建议的扫描仪结构,而且还建议使用正在分析记录数据的算法。

著录项

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号