Much content on the World Wide Web is becoming tagged with simple words or phrases in natural language as web citizens create tags that organize information primarily to facilitate their personal retrieval and use, These tags represent, often incomplete, pieces of knowledge about concepts in a domain. Aggregated across a large number of contributors, these tags provide the potential to identify, in a bottom-up manner, key constructs in a domain. This research develops a set of heuristics that aggregate and analyze tags contributed by individual users on the web to extract and generate domain-level constructs. The heuristics infer the existence of constructs, and distinguish entities, attributes, and relationships.
展开▼