首页>
外国专利>
Attention network based on duration information for text-to-speech analysis
Attention network based on duration information for text-to-speech analysis
展开▼
机译:关注网络基于文本到语音分析的持续时间信息
展开▼
页面导航
摘要
著录项
相似文献
摘要
The method and apparatus include receiving a text input comprising a sequence of text components. Each time duration of the text component is determined using a duration model. A first set of spectra is generated based on the sequence of text components. A second set of spectra is generated based on the first set of spectra and the respective temporal durations of the text component sequence. A spectrogram frame is generated based on the second set of spectra. The audio waveform is generated based on the spectrogram frame. An audio waveform is provided as an output.
展开▼