首页> 外国专利> Apparatuses, methods, and systems for 3-channel dynamic contextual script recognition using neural network image analytics and 4-tuple machine learning with enhanced templates and context data

Apparatuses, methods, and systems for 3-channel dynamic contextual script recognition using neural network image analytics and 4-tuple machine learning with enhanced templates and context data

机译:使用神经网络图像分析和具有增强的模板和上下文数据的四元组机器学习进行3通道动态上下文脚本识别的设备,方法和系统

摘要

In some embodiments, a method includes training a first machine learning model based on multiple documents and multiple templates associated with the multiple documents. The method further includes executing the first machine learning model to generate multiple relevancy masks, the multiple relevancy masks to remove a visual structure of the multiple templates from a visual structure of the multiple documents. The method further includes generating multiple multichannel field images to include the multiple relevancy masks and at least one of the multiple documents or the multiple templates. The method further includes training a second machine learning model based on the multiple multichannel field images and multiple non-native texts associated with the multiple documents. The method further includes executing the second machine learning model to generate multiple non-native texts from the multiple multichannel field images.
机译:在一些实施例中,一种方法包括基于多个文档和与多个文档相关联的多个模板来训练第一机器学习模型。该方法还包括执行第一机器学习模型以生成多个相关掩码,该多个相关掩码以从多个文档的视觉结构中移除多个模板的视觉结构。该方法还包括生成多个多通道场图像,以包括多个相关性掩码以及多个文档或多个模板中的至少一个。该方法还包括基于多个多通道场图像和与多个文档相关联的多个非本机文本来训练第二机器学习模型。该方法还包括执行第二机器学习模型以从多个多通道场图像生成多个非本地文本。

著录项

  • 公开/公告号US10671892B1

    专利类型

  • 公开/公告日2020-06-02

    原文格式PDF

  • 申请/专利权人 HYPER LABS INC.;

    申请/专利号US201916674324

  • 申请日2019-11-05

  • 分类号G06K9/62;G06N3/04;G06K9/46;G06N3/08;

  • 国家 US

  • 入库时间 2022-08-21 11:28:14

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号