首页> 外国专利> TABLE DATA PARSING METHOD FOR PDF FILE

TABLE DATA PARSING METHOD FOR PDF FILE

机译：表数据解析PDF文件的解析方法

页面导航

摘要
著录项
相似文献

摘要

The present invention relates to a table data parsing method for a PDF file. The present invention comprises the steps of: generating a parse tree for a PDF file by extracting data from the PDF file and analyzing the file structure; by using the generated parse tree, retrieving the location of a page which contains a headword of a table being searched; setting a parsing range in the retrieved page, with respect to coordinates (x, y) assigned to the headword of the table being searched; and parsing table data in the parsing range that has been set. According to the present invention, a merit is achieved of enabling target table data to be accurately parsed from a PDF file.

机译：本发明涉及一种用于PDF文件的表数据解析方法。本发明包括以下步骤：通过从PDF文件中提取数据并分析文件结构来为PDF文件生成解析树;通过使用生成的解析树，检索包含正在搜索的表格的页面的位置;在检索页面中设置解析范围，相对于分配给被搜索的表格的坐标（x，y）;并在已设置的解析范围内解析表数据。根据本发明，实现了使得能够从PDF文件准确解析的目标表数据的优点。

著录项

公开/公告号WO2021145541A1

专利类型
公开/公告日2021-07-22

原文格式PDF
申请/专利权人 TITECHNOLOGY CO. LTD.;
展开▼

申请/专利号WO2020KR15235
发明设计人 GU DA HAE;KIM DONG HOON;
展开▼

申请日2020-11-03
分类号G06F16/22;
国家 KR
入库时间 2022-08-24 20:09:10

相似文献

专利
外文文献
中文文献