A neural network based parameter system 118 is used for generating a virtual actor (visually-rendered model with speech 132) of which the movements are correlated with synthetic speech.Text 102, used to drive the virtual actor, is converted 104 to a linguistic representation of speech 106, which is converted to neural network linguistic parameters 110 by pre-processor 108. The neural network module 112 converts the neural network linguistic parameters into raw spatial parameters 114, which are finally converted into model parameters 120 by a post-processor 116. These model parameters 120 are then used to drive the virtual actors. Alternatively, a non-neural network based linguistics-to-speech module is used to convert the linguistic representation of speech 106. The speaker profile of a linguistics-to-speech module 126 provides data to change the characteristics of the alternatively-synthesized speech 128.
展开▼