The purpose of document structure analysis is to get the document structure of the source text. Document structure is defined as 3 layers in the paper. A new model of document structure analysis — DLM is proposed. The model is composed of three layers: physical structure layer, logical structure layer and semantic structure layer, which are corresponding to the definition of the document structure. The input, output and operation of each layer are illustrated in details in the paper. The model has the feature of flexible, systematic and extendible. DLM is implemented on the Automatic Summarization System. It shows that the model is feasible and good result can be achieved.
展开▼