In this paper, we compare the distribution of disfluencies in two human-computer dialogue corpora. One corpus consists of unimodal travel booking dialogues, which were recorded over the telephone. In this unimodal system, all components except the speech recognition were authentic. The other corpus was collected using a semi-simulated multi-modal dialogue system with an animated talking agent and a clickable map. The aim of this paper is to anlayze and discuss the effects of modality, task and interface design on the distribution and frequency of disfluencies in these two corpora.
展开▼