In this paper, a document algebra is proposed to support both document transformation and pattern matching. Based on the tree domain theory, the operational semantics for the document transformation are defined. Then, by equating a subtree structure from a DTD to an attribute from a relation schema, a set of operations for treating document sets is developed, which is equipped with the pattern matching to cope with the queries issued to a document database.
展开▼