首页> 外国专利> Selectively displaying OCR-recognized text from the publication and the corresponding image on the client device.

Selectively displaying OCR-recognized text from the publication and the corresponding image on the client device.

机译:在客户端设备上选择性地显示出版物中的OCR识别文本和相应的图像。

摘要

Text is extracted from a source image of a publication using an Optical Character Recognition (OCR) process. A document is generated containing text segments of the extracted text. The document includes a control module that responds to user interactions with the displayed document. Responsive to a user selection of a displayed text segment, a corresponding image segment from the source image containing the text is retrieved and rendered in place of the selected text segment. The user can select again to toggle the display back to the text segment. Each text segment can be tagged with a garbage score indicating its quality. If the garbage score of a text segment exceeds a threshold value, the corresponding image segment can be automatically displayed instead.
机译:使用光学字符识别(OCR)过程从出版物的源图像中提取文本。生成包含提取的文本的文本段的文档。该文档包括控制模块,该模块响应用户与显示的文档的交互。响应于用户对所显示的文本段的选择,从源图像中包含文本的对应图像段被检索并被渲染以代替所选择的文本段。用户可以再次选择将显示切换回文本段。每个文本段都可以用表明其质量的垃圾得分标记。如果文本段的垃圾得分超过阈值,则可以自动显示相应的图像段。

著录项

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号