首页>
外国专利>
METHOD FOR PARSING TABLE DATA IN PDF FILE
METHOD FOR PARSING TABLE DATA IN PDF FILE
展开▼
机译:PDF文件中的表格数据解析方法
展开▼
页面导航
摘要
著录项
相似文献
摘要
The present invention relates to a method for parsing table data targeting a PDF file. The present invention extracts data from the PDF file and analyzes the file structure to generate a parse tree for the PDF file, and using the generated parse tree to search for a location of a page containing a headword of a table to be searched. , Based on the coordinates (x, y) assigned to the headword of the table to be searched, including the step of setting a parsing range within the searched page and parsing the table data targeting the set parsing range. It is characterized. According to the present invention, there is an advantage that target table data can be accurately parsed from a PDF file.
展开▼