This paper presents the development of Croatian speech synthesis systems. Three voices were built using the same recorded speech corpus. Two of these voices were built with the Festival speech synthesis system, using the clustering unit selection method and the statistical parametric method. The third developed voice uses a general unit selection algorithm implemented in a custom speech synthesis system. Obtained voices are compared mutually and with voices generated with a previously developed diphone based TTS system. The comparison is based on subjective tests using MOS evaluation.
展开▼