首页>
外国专利>
Selectively displaying OCR-recognized text from the publication and the corresponding image on the client device.
Selectively displaying OCR-recognized text from the publication and the corresponding image on the client device.
展开▼
机译:在客户端设备上选择性地显示出版物中的OCR识别文本和相应的图像。
展开▼
页面导航
摘要
著录项
相似文献
摘要
Text is extracted from a source image of a publication using an Optical Character Recognition (OCR) process. A document is generated containing text segments of the extracted text. The document includes a control module that responds to user interactions with the displayed document. Responsive to a user selection of a displayed text segment, a corresponding image segment from the source image containing the text is retrieved and rendered in place of the selected text segment. The user can select again to toggle the display back to the text segment. Each text segment can be tagged with a garbage score indicating its quality. If the garbage score of a text segment exceeds a threshold value, the corresponding image segment can be automatically displayed instead.
展开▼