首页> 外国专利> DETECTING SECTIONS OF TABLES IN DOCUMENTS BY NEURAL NETWORKS USING GLOBAL DOCUMENT CONTEXT

DETECTING SECTIONS OF TABLES IN DOCUMENTS BY NEURAL NETWORKS USING GLOBAL DOCUMENT CONTEXT

机译:使用全局文档上下文通过神经网络检测文档中的表格部分

摘要

FIELD: data processing.;SUBSTANCE: invention relates to detection of text fields in documents. Data processing method includes obtaining a plurality of document symbol sequences comprising at least one table; determining a plurality of vectors representing sequences of symbols comprising at least one alphanumeric character or graphic element of a table; processing a plurality of vectors using a first neural network to obtain a plurality of recalculated vectors; determining a link between a first scaled vector and a second recalculated vector, where the first scaled vector belongs to the alphanumeric sequence, and the second scaled vector is associated with a table section, as well as determining communication between alphanumeric sequence and table section based on communication between first recalculated vector and second recalculated vector.;EFFECT: technical result is wider range of means.;20 cl, 9 dwg
机译:技术领域本发明涉及文档中文本字段的检测。数据处理方法包括:获得包括至少一个表的多个文档符号序列;以及确定表示代表符号序列的多个矢量,这些符号包括表格的至少一个字母数字字符或图形元素;使用第一神经网络处理多个向量以获得多个重新计算的向量;确定第一缩放向量和第二重新计算向量之间的链接,其中第一缩放向量属于字母数字序列,第二缩放向量与表部分相关联,并基于以下信息确定字母数字序列和表部分之间的通信第一个重新计算的向量与第二个重新计算的向量之间的通信。效果:技术结果是更广泛的手段。20cl,9 dwg

著录项

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号