Provided are an audio output device, a head mounted display, an audio output method, and a program capable of improving the reality of augmented reality space and mixed reality space. The AR space image generation unit (116) is displayed on the display unit of the head mounted display, including a portion occupied by the image of the virtual reality space and a portion occupied by the image of the real space photographed by the camera for photographing the periphery of the user Create an augmented reality space or mixed reality space image. The synthetic speech generation unit (108) synthesizes the speech in the virtual reality space and the speech in the real space around the user to generate synthetic speech. The synthetic speech transmission unit (110) outputs synthetic speech emitted from a speaker disposed near the user's ear.
展开▼