To enable a lecturer to grasp a state of audience more easily.SOLUTION: A video acquisition unit 11 receives video captured by imaging audience. A face detection unit 12 detects the faces of people included in the video. A facial expression measurement unit 13, a line-of-sight measurement unit 14, and a nod measurement unit 15 measure facial expressions, lines of sight, and nodding states of the detected faces. An individual state estimation unit 165 estimates the state of each of the faces on the basis of the facial expressions, lines of sight, and nodding states as measured. A group state estimation unit 166 estimates the state of the whole audience on the basis of the state of each of the faces. A consolidated avatar generation unit 17 generates an avatar on the basis of the state of the whole audience. An avatar presentation device 3 displays the generated avatar.SELECTED DRAWING: Figure 2
展开▼