首页>
外国专利>
Speech analysis synthesis system and method thereof having the energy normalization and unvoiced frame suppression function
Speech analysis synthesis system and method thereof having the energy normalization and unvoiced frame suppression function
展开▼
机译:具有能量归一化和清音抑制功能的语音分析综合系统及其方法
展开▼
页面导航
摘要
著录项
相似文献
摘要
Energy normalization in speech synthesis systems is achieved by a look-ahead adaptive normalization procedure, wherein energy is adaptively tracked, and the adaptive energy-tracking value is used to normalize a much earlier frame's energy value. In another aspect, silence suppression in speech synthesis systems is achieved by detecting and processing only segments of voice activity. A segment is classified as "speech" if the energy of the signal is greater than an adaptively adjusted threshold. The adaptively adjusted threshold is preferably defined as the maximum of scaled values of two separate envelope parameters, which both track the variation in energy over the sequence of frames of speech data. One contour is a slow-rising fast-falling value, which is updated only during unvoiced speech frames, and therefore tracks a lower envelope of the energy contour. This parameter in effect tracks an ambient noise level. The other parameter is a fast-rising slow-falling parameter, which is updated only during voiced speech frames, and thus tracks an upper envelope of the energy contour. (This in effect tracks the average speech level.)
展开▼