Semi-structured data has become prevalent with the gorwth of the Internet and other on-line information repositories. Many organizational databases are presented on the web as semi-structured data. Designing a "good" semi-structured database is increasingly crucial to prevent data redundancy, incosistency and updating anomalies. In this paper, we define a semi-structured schema graph and identify the varuous anomalies that may occur in the graph. a normal form for semistructured schema graph, S3-NF, is proposed. We present two approaches to design S3-NF database, namely, restructuring by decomposition and the ER approach. The first approach consists of a set of rules to decompose a semi-structured schema graph into S3-NF. The second approach uses the ER model to remove anomalies at the semantic level.
展开▼