Pitch-Asynchronous Overlap-Add Waveform-Concatenation Speech Synthesis by Using a Phase-Optimizing Neural Network

机译：通过使用相位优化神经网络，俯仰异步重叠 - 添加波形串联语音合成

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

The pitch-synchronous overlap-add (PSOLA) speech synthesis method has been conventionally used for a high-quality waveform-concatenation. The basis lies in the periodic structure of voiced speech, i.e., the pitchmark. Though the PSOLA-synthesized sound has a high quality so far as the pitchmark detection is successful, it is sometimes degraded to a great extent when it fails to detect the pitchmark or, more fundamentally, when the sound is unvoiced consonant. In this paper, we propose a pitch-asynchronous waveform-concatenation speech synthesis method. It is based on an adaptive phase optimization by using a complex-valued neural processing to maintain a desirable degree of pulse sharpness. Experimental results demonstrate a successful generation of high-quality sound.

机译：距离同步重叠 - 添加（PSOLA）语音合成方法通常用于高质量的波形级联。基础位于浊音语音的周期性结构中，即，凝聚氧化织片。虽然PSOLA合成的声音具有高质量的凝聚标记检测成功，但在很大程度上在很大程度上降低了当声音是无声辅音时，它有时会在很大程度上降级。在本文中，我们提出了一种俯仰异步波形 - 倾斜语音合成方法。它通过使用复值的神经处理来基于自适应相位优化，以保持所需的脉冲清晰度。实验结果表明了成功的高质量声音。

著录项

来源
《International Conference on Knowledge-Based Intelligent Engineering Systems》|2003年||共8页
会议地点
作者
Keiichi Tsuda; Akira Hirose; Lecture Notes in Artificial Intelligence 2774;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TP18-532;
关键词

相似文献

外文文献
中文文献
专利

1. Pitch-asynchronous overlap-add waveform-concatenation speech synthesis by using a phase-optimizing neural network [J] . Keiichi Tsuda, Akira Hirose 電子情報通信学会技術研究報告. ニュ-ロコンピュ-ティング. Neurocomputing . 2002,第729期

机译：相位优化神经网络的音高异步重叠叠加波形级联语音合成
2. Pitch-asynchronous overlap-add waveform-concatenation speech synthesis by using a phase-optimizing neural network [J] . Keiichi Tsuda, Akira Hirose 電子情報通信学会技術研究報告. ニュ-ロコンピュ-ティング. Neurocomputing . 2002,第729期

机译：通过使用相位优化神经网络，俯仰异步重叠 - 添加波形串联语音合成
3. Unit Selection Speech Synthesis Using Frame-Sized Speech Segments and Neural Network Based Acoustic Models [J] . Zhen-Hua Ling, Zhi-Ping Zhou Journal of VLSI signal processing systems for signal, image, and video technology . 2018,第7期

机译：基于帧大小的语音片段和基于神经网络的声学模型的单位选择语音合成
4. Pitch-Asynchronous Overlap-Add Waveform-Concatenation Speech Synthesis by Using a Phase-Optimizing Neural Network [C] . Keiichi Tsuda, Akira Hirose, Lecture Notes in Artificial Intelligence 2774 International Conference on Knowledge-Based Intelligent Engineering Systems . 2003

机译：通过使用相位优化神经网络，俯仰异步重叠 - 添加波形串联语音合成
5. Objective evaluation of speech quality over telecommunication networks using neural networks [D] . Meky, Mohamed Mohamed 1998

机译：使用神经网络的电信网络语音质量的客观评估
6. Multi-resolution speech analysis for automatic speech recognition using deep neural networks: Experiments on TIMIT [O] . Doroteo T. Toledano, María Pilar Fernández-Gallego, Alicia Lozano-Diez 2012

机译：基于深度神经网络的自动语音识别的多分辨率语音分析：TIMIT实验
7. Speech Enhancement for a Noise-Robust Text-to-Speech Synthesis System using Deep Recurrent Neural Networks [O] . Valentini Botinhao, Cassia, Wang, Xin, Takaki, Shinji, 2016

机译：使用深度递归神经网络的噪声鲁棒文本到语音合成系统的语音增强

Pitch-Asynchronous Overlap-Add Waveform-Concatenation Speech Synthesis by Using a Phase-Optimizing Neural Network

摘要

著录项

相似文献

相关主题

期刊订阅