Speech perception refers to the processes by which humans are able to interpret and understand the sounds used in language. Watching at lips and tongue movements of the speaker significantly improves the understanding of acoustic signal [1]. Computers generated 3D human head models, specified to animate synthesized or natural speech, are called Talking heads. They are playing considerably important role in human-computer communication and have caused significant scientific, technological and artistic interests in computer facial animation. Talking heads can be employed for e-consulting services: virtual secretary, WEB navigator or virtual agent who is responsible for information conveying to user in a Smart Ecological and Social Apartments (SESA) [2]. Also they are widely used in e-learning technologies as animated 3D models for the correct sound pronunciation presentation [3] or applied in movie, advertising and computer game industries. Most of the existing models are dedicated to animate English language, meanwhile there are proposed Talking heads for other languages [4, 5].
展开▼