首页>
外国专利>
Apparatus and Method for Recognizing Image-Based Content Presented in a Structured Layout
Apparatus and Method for Recognizing Image-Based Content Presented in a Structured Layout
展开▼
机译:用于识别以结构化布局呈现的基于图像的内容的装置和方法
展开▼
页面导航
摘要
著录项
相似文献
摘要
A method for extracting information from a table includes steps as follows. Characters of a table are extracted. The characters are merged into n-gram characters. The n-gram characters are merged into words and text lines through a two-stage GNN mode. The two-stage GNN mode comprises sub steps as: spatial features, semantic features, CNN image features are extracted from a target source; a first GNN stage is processed to output graph embedding spatial features from the spatial features; and a second GNN stage is processed to output graph embedding semantic features and graph embedding CNN image features from the semantic features and the CNN image features, respectively. The text lines are merged into cells. The cells are grouped into rows, columns, and key based on one or more adjacency matrices, a row relationship among the cells, a column relationship among the cells, and a key-value relationship among the cells.
展开▼