首页> 外文会议>IEEE International Conference on Electronic Measurement and Instruments >Automatic classification and recognition of complex documents based on Faster RCNN
【24h】

Automatic classification and recognition of complex documents based on Faster RCNN

机译:基于Faster RCNN的复杂文档自动分类和识别

获取原文

摘要

OCR(Optical Character Recognition) has been widely used in digital document processing, but the current OCR technology only works well in simple document processing. In contrast, complex documents contain a lot of non-text information (icons, forms, signatures, seals, noise, etc.). Most OCR systems often misinterpret these non-textual information as text when dealing with complex documents, resulting in the wrong target recognition. Automatic document segmentation and recognition technology not only improves the processing accuracy of OCR, but also enhances the processing of documents. Therefore, this project will study how to use Faster RCNN (fast regional convolution) technology to automatically classify and identify text, icon, table, noise and other objects in complex documents. Faster RCNN is one of the mainstream frameworks in the field of target detection. Although it has been applied to text area recognition, it has not been reported in the division and recognition of complex documents. This project specifically studies the complex document image preprocessing technology, RPN (Region Proposal Network) technology and target recognition technology involved in the Faster RCNN technology. It enables fast and accurate separation of seal area, text area and page number area from complex documents. It provides accurate target area for the next step of OCR recognition and document enhancement, and realizes accurate recognition and enhancement.
机译:OCR(光学字符识别)已广泛用于数字文档处理中,但是当前的OCR技术仅在简单的文档处理中有效。相反,复杂的文档包含许多非文本信息(图标,表格,签名,印章,杂音等)。大多数OCR系统在处理复杂文档时常常会将这些非文本信息误解为文本,从而导致错误的目标识别。自动文档分割和识别技术不仅提高了OCR的处理精度,而且还增强了文档的处理能力。因此,该项目将研究如何使用Faster RCNN(快速区域卷积)技术自动分类和识别复杂文档中的文本,图标,表格,噪声和其他对象。更快的RCNN是目标检测领域的主流框架之一。尽管已将其应用于文本区域识别,但尚未在复杂文档的划分和识别中进行报告。该项目专门研究Faster RCNN技术中涉及的复杂文档图像预处理技术,RPN(区域提议网络)技术和目标识别技术。它可以快速,准确地将印章区域,文本区域和页码区域与复杂文档区分开。它为下一步的OCR识别和文档增强提供了准确的目标区域,并实现了准确的识别和增强。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号