A simultaneous semantic and structure threaded discussion modeling system and method for generating a model of a discussion thread and using the model to mine data from the discussion thread. Embodiments of the system and method generate a model that contains both semantic terms and structure terms. The model simultaneously models both semantics and structure of the discussion thread. A model generator includes a semantic module generates two semantic terms for the model and a structure module generates two structure terms for the model. The generator combines the two semantic terms and the two structure terms to generate the simultaneous semantic and structure model. Embodiments of the system and method include an applications module, which contains three application that use the model to reconstruct reply relations among posts in the discussion thread, identify junk posts in the discussion thread, and find experts in each sub-board of web forums.
展开▼