The present invention relates to a method and device at speech-to-text conversion. From a given speech the fundamental tone is extracted. A model of the speech is further created from the speech. In the model a duration reproduction in words and sentences is obtained. The duration reproduction is compared with a segment duration in the speech. From the comparison is obtained information which decides which type of accent that exists, at which a text with sentence accent information is produced. IMAGE
展开▼