首页>
外国专利>
METHOD AND SYSTEM FOR EXTRACTING DATA FROM IMAGES OF SEMISTRUCTURED DOCUMENTS
METHOD AND SYSTEM FOR EXTRACTING DATA FROM IMAGES OF SEMISTRUCTURED DOCUMENTS
展开▼
机译:从半结构化文档的图像中提取数据的方法和系统
展开▼
页面导航
摘要
著录项
相似文献
摘要
FIELD: physics.;SUBSTANCE: text representation of the document image is obtained in the process of extracting data from the fields to the document image. A graph is constructed to store attributes of the document text fragments and the links between them. A cascade classification is made to calculate the attributes of the document text fragments and the links between them. A set of hypotheses is formed about the text fragment affiliation in the fields on the document image. A combination of hypotheses is selected. And data extracting is done from the fields on the document image based on the selected combination of the hypotheses.;EFFECT: saving computing resources.;15 cl, 8 dwg
展开▼