A computer-implemented method and system for incorporating emotional and contextual visualization into an electronic communication, comprises: creating a 2D texture map of a user's face from a series of photos; comparing user's 2D texture map with 2D texture maps of the samples from a reference database to find the closest matches and create a photorealistic composite 3D mesh model of the user's head that can be modified to present different emotions and phonemes; during an electronic communication between a sending device of the user and the receive device, determining an emotional state and a current phoneme (viseme) through user's voice, text or data from the camera and transmitting an emotional identifier and a phoneme identifier to the receiving devise; using emotion ID and phoneme ID to retrieve from the databases, loaded on receiving device, corresponding 3D mesh models and corresponding 2D textures to create and display a fully animated video message on the receiving devise without requiring video data transmission via the communication channel.
展开▼