首页>
外国专利>
AUTOMATIC TRANSFORMATION OF COMPLEX TABLES IN DOCUMENTS INTO COMPUTER UNDERSTANDABLE STRUCTURED FORMAT WITH MAPPED DEPENDENCIES AND PROVIDING SCHEMA-LESS QUERY SUPPORT FOR SEARCHING TABLE DATA
AUTOMATIC TRANSFORMATION OF COMPLEX TABLES IN DOCUMENTS INTO COMPUTER UNDERSTANDABLE STRUCTURED FORMAT WITH MAPPED DEPENDENCIES AND PROVIDING SCHEMA-LESS QUERY SUPPORT FOR SEARCHING TABLE DATA
An information processing system, a computer readable storage medium, and a computer-implemented method, collect tables from a corpus of documents, convert the collected tables to flattened table format and organized to be searchable by schema-less queries. A method collects tables, extracts feature values from collected table data and collected table meta-data for each collected table. A table classifier classifies each collected table as being a type of table. Based on the classifying, the collected table is converted to a flattened table including table values that are the table data and the table meta-data of the collected table. Dependencies of the data values are mapped. The flattened table and mapped dependencies are stored in a triple store searchable by schema-less queries. The table classifier learns and improves its accuracy and reliability. Dependency information is maintained among a plurality of database tables. The dependency information can be updated at variable update frequency.
展开▼