首页>
外国专利>
JOINING WEB DATA WITH SPREADSHEET DATA USING EXAMPLES
JOINING WEB DATA WITH SPREADSHEET DATA USING EXAMPLES
展开▼
机译:使用示例将Web数据与电子表格数据结合起来
展开▼
页面导航
摘要
著录项
相似文献
摘要
Provided are methods and systems for joining semi-structured data from the web with relational data in a spreadsheet table using input-output examples. A first sub-task performed by the system learns a string transformation program to transform input rows of a table to URL strings that correspond to the webpages where the relevant data is present. A second sub-task learns a program in a rich web data extraction language to extract desired data from the webpage given the example extractions. Hierarchical search and input-driven ranking are used to efficiently learn the programs using few input-output examples. The learnt programs are then run on the remaining spreadsheet entries to join desired data from the corresponding web pages.
展开▼