Website presents data in various forms and formats, one of them in the form of a table. Tables on the Internet can be takenudsuch way by copy and paste, but this way is not easy if done on many tables then from extracted result they have beenudmerged with the other tables. This article discussed the research on extraction of HTML tables which stored into a databaseudform. The approach used was algorithm to perform the search process the number of rows and number of columns from theudtable, and algorithms to perform matching the contents of the table cell extraction results with a Property Name database, soudit is unknown whether the extracted table has property in the row/column/table without property. Table and Property Nameuddatabase displays the data in the Indonesian Language. At pre processing stage Property Name database which is alsoudprepared the techniques to enrich the instance of the Property Name database. The tables in the extract is a table HTMLudformat with a simple table where the form is not found of any merger of the rows and columns in the row position mergeud1/column 1. This research provides techniques to enrich the instance of a database, and with the use of illustrations, and thenudan approach to do the extraction of tabular HTML format can be done in a semi-automatic. In addition to that property in theudtable which is extracted can be distinguished from the contents of the cell which is a data table.
展开▼