Various kinds of video recordings have discourse structures. Therefore, it is important to determine how video segments are combined and what kind of coherence relations they are connected with. We propose a method for estimating the discourse structure of video news reports.
展开▼