A method of document structure extraction using generic layout knowledge is described. With this method, it is possible to translate images of multimedia documents, i.e. documents that include pictures, graphics, and color information, to hypertext. Hypertext consists of decomposed elements linked with each other through some logical relationship. The principal components of the method are extraction of logical structure elements using a rectangular set operation and generation of hierarchical links of the logical structure between the extracted document elements. It is shown experimentally that the logical structure of a technical paper can be extracted.
展开▼