SYNTHESIS OF FUNDAMENTAL FREQUENCY CONTOURS FOR STANDARD CHINESE BASED ON SUPERPOSITIONAL AND TONE NUCLEUS MODELS

Keikichi HIROSE; Qinghua SUN; Nobuaki MINEMATSU

首页> 外文期刊>Archives of acoustics >SYNTHESIS OF FUNDAMENTAL FREQUENCY CONTOURS FOR STANDARD CHINESE BASED ON SUPERPOSITIONAL AND TONE NUCLEUS MODELS

【24h】

SYNTHESIS OF FUNDAMENTAL FREQUENCY CONTOURS FOR STANDARD CHINESE BASED ON SUPERPOSITIONAL AND TONE NUCLEUS MODELS

机译：基于叠加和音调核模型的标准汉语基础频率轮廓综合

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

A method for generating sentence F_0 contours of Standard Chinese speech is developed. It is based on superposing tone components on phrase components in logarithmic frequency. While tone components are language specific, phrase components are assumed to be more language universal. Taking this situation into account, the method treats two kinds of components differently. The tone components are generated by concatenating Fo patterns of tone nuclei, which are predicted by a corpus-based scheme, while the phrase components are generated by rules. Experiments on F_0 contour generation were conducted using 100 news utterances by a female speaker. First experiments were conducted on the generation of tone components, with phrase components of the original utterances being used unchanged. The results showed that the method could generate F_0 contours close to those of target speech. Speech synthesis was conducted by substituting original F_0 contours to generated ones by TD-PSOLA. A high score 4.5 in 5-point scale was obtained on average as the result of listening experiments on the quality of synthetic speech. Second experiments were on the generated phrase components, with the tone components extracted from the original utterances. Although the synthetic speech with generated Fo contours sounded mostly natural, there were occasional "degraded sounds", because of mismatch between the phrase and the tone components. To cope with the mismatch, a two-step method was developed, where information of the phrase contours was used for the prediction of tone components. Validity on the method was shown through perceptual experiments on synthesized speech.

机译：提出了一种生成标准汉语语音句子F_0轮廓的方法。它基于以对数频率将音调成分叠加在短语成分上。虽然语气成分是特定于语言的，但短语成分被认为是更通用的语言。考虑到这种情况，该方法对两种成分进行不同的处理。音调成分是通过串联由基于语料库的方案预测的音调核的Fo模式而生成的，而短语成分是由规则生成的。女发言人使用100条新闻话语进行了F_0轮廓生成的实验。在音调成分的产生上进行了第一实验，原始话语的短语成分未改变。结果表明，该方法可以生成接近目标语音的F_0轮廓。通过用TD-PSOLA将原始的F_0轮廓替换为生成的轮廓来进行语音合成。通过听取合成语音质量的实验，平均得到5分制的高分4.5。第二个实验是在生成的乐句成分上进行的，从原始话语中提取了音调成分。尽管具有生成的Fo轮廓的合成语音听起来大多自然，但由于短语和音调成分之间不匹配，因此偶尔会出现“降级的声音”。为了解决不匹配问题，开发了一种两步方法，其中将短语轮廓信息用于预测声调成分。通过对合成语音的感知实验证明了该方法的有效性。

著录项

来源
《Archives of acoustics》 |2007年第1期|p.41-50|共10页
作者
Keikichi HIROSE; Qinghua SUN; Nobuaki MINEMATSU;
展开▼
作者单位

University of Tokyo 7-3-1 Hongo, Bunkyo-ku, Tokyo, 113-8656 Japan;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类声学;
关键词
speech synthesis; F_0 contour generation; standard Chinese; superpositional model; tone nucleus model; rule- and corpus-based generation;

机译：语音合成;F_0轮廓生成;标准汉语;叠加模型;音核模型;基于规则和语料的生成;

相似文献

外文文献
中文文献
专利

1. Analysis and synthesis of fundamental frequency contours of Standard Chinese using the command-response model [J] . Fujisaki H, Wang CF, Ohno S, Speech Communication . 2005,第1a2期

机译：基于命令响应模型的普通话基本频率等值线分析与综合
2. Analytical Study on Fundamental Frequency Contours of Thai Tones using Tone-Geometrical Model [J] . Suphattharachai Chomphan Journal of computer sciences . 2011,第3期

机译：音调几何模型对泰语基本频率轮廓的分析研究
3. Analytical Study on Fundamental Frequency Contours of Thai Tones using Tone-Geometrical Model | Science Publications [J] . Suphattharachai Chomphan Journal of computer sciences . 2011,第3期

机译：语气几何模型泰语色调基频轮廓的分析研究科学出版物
4. Generation of Fundamental Frequency Contours for Thai Speech Synthesis using Tone Nucleus Model [C] . Oraphan Krityakien, Keikichi Hirose, Nobuaki Minematsu Conference of the International Speech Communication Association . 2013

机译：用音调模型生成泰式语音合成的基本频率轮廓
5. Intonation in standard Thai: Contours, registers and boundary tones [D] . Kallayanamit, Saovapak 2004

机译：标准泰语声调：轮廓，套准和边界音
6. Sentence Context Differentially Modulates Contributions of Fundamental Frequency Contours to Word Recognition in Chinese-Speaking Children With and Without Dyslexia [O] . Linjun Zhang, Yu Li, Hong Zhou, 2020

机译：句子上下文差异地调制基本频率轮廓的贡献以汉语的儿童在汉语的儿童中有没有阅读障碍
7. Analytical Study on Fundamental Frequency Contours of Thai Tones using Tone-Geometrical Model [O] . Suphattharachai Chomphan 2015

机译：利用声 - 几何模型对泰语基调频率等值线的分析研究
8. Model-Based Analysis and Resynthesis of Acoustic Guitar Tones [R] . Tolonen, T. 1998

机译：基于模型的声学吉他音调分析与再合成

SYNTHESIS OF FUNDAMENTAL FREQUENCY CONTOURS FOR STANDARD CHINESE BASED ON SUPERPOSITIONAL AND TONE NUCLEUS MODELS

摘要

著录项

相似文献

相关主题

期刊订阅