首页>
外国专利>
VOICE ANALYSIS/SYNTHESIZATION SYSTEM AND METHOD HAVING ENERGY NORMALIZING AND VOICELESS FRAME INHIBITING FUNCTIONS
VOICE ANALYSIS/SYNTHESIZATION SYSTEM AND METHOD HAVING ENERGY NORMALIZING AND VOICELESS FRAME INHIBITING FUNCTIONS
展开▼
机译:具有能量归一化和无声框抑制功能的语音分析/综合系统和方法
展开▼
页面导航
摘要
著录项
相似文献
摘要
Energy normalization in speech synthesis systems is achieved by a look-ahead adaptive normalization procedure, wherein energy is adaptively tracked, and the adaptive energy-tracking value is used to normalize a much earlier frame's energy value. In another aspect, silence suppression in speech synthesis systems is achieved by detecting and processing only segments of voice activity. A segment is classified as "speech" if the energy of the signal is greater than an adaptively adjusted threshold. The adaptively adjusted threshold is preferably defined as the maximum of scaled values of two separate envelope parameters, which both track the variation in energy over the sequence of frames of speech data. One contour is a slow-rising fast-falling value, which is updated only during unvoiced speech frames, and therefore tracks a lower envelope of the energy contour. This parameter in effect tracks an ambient noise level. The other parameter is a fast-rising slow-falling parameter, which is updated only during voiced speech frames, and thus tracks an upper envelope of the energy contour. (This in effect tracks the average speech level.)
展开▼