首页> 外国专利> Improving optical character recognition (OCR) accuracy by combining results across video frames

Improving optical character recognition (OCR) accuracy by combining results across video frames

机译:通过合并视频帧之间的结果来提高光学字符识别(OCR)的准确性

摘要

The present disclosure relates to optical character recognition using captured video. According to one embodiment, using a first image in stream of images depicting a document, the device extracts text data in a portion of the document depicted in the first image and determines a first confidence level regarding an accuracy of the extracted text data. If the first confidence level satisfies a threshold value, the device saves the extracted text data as recognized content of the source document. Otherwise, the device extracts the text data from the portion of the document as depicted in one or more second images in the stream and determines a second confidence level for the text data extracted from each second image until identifying one of the second images where the second confidence level associated with the text data extracted from the identified second image satisfies the threshold value.
机译:本公开涉及使用捕获的视频的光学字符识别。根据一个实施例,使用描绘文档的图像流中的第一图像,设备提取第一图像中描绘的文档的一部分中的文本数据,并确定关于所提取的文本数据的准确性的第一置信度。如果第一置信度满足阈值,则设备将提取的文本数据保存为源文档的识别内容。否则,设备从流中的一个或多个第二图像中描绘的文档部分中提取文本数据,并确定从每个第二图像中提取的文本数据的第二置信度,直到识别出第二个图像中的一个与从识别出的第二图像中提取的文本数据相关联的置信度满足阈值。

著录项

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号