首页> 外文期刊>Technical Gazette >Meta Learning Approach to Phone Duration Modeling
【24h】

Meta Learning Approach to Phone Duration Modeling

机译:元学习方法用于电话持续时间建模

获取原文
           

摘要

One of the essential prerequisites for achieving the naturalness of synthesized speech is the possibility of the automatic prediction of phone duration, due to the high importance of segmental duration in speech perception. In this paper we present a new phone duration prediction model for the Serbian language using meta learning approach. Based on the data obtained from the analysis of a large speech database, we used a feature set of 21 parameters describing phones and their contexts. These include attributes related to the segmental identity, manner of articulation (for consonants), attributes related to phonological context, such as segment types and voicing values of neighboring phones, presence or absence of lexical stress, morphological attributes, such as part-of-speech, and prosodic attributes, such as phonological word length, the position of the segment in the syllable, the position of the syllable in a word, the position of a word in a phrase, phrase break level, etc. Phone duration model obtained using meta learning algorithm outperformed the best individual model by approximately 2,0% and 1,7% in terms of the relative reduction of the root-mean-squared error and the mean absolute error, respectively.
机译:实现分段语音的自然性的基本前提之一是自动预测电话持续时间的可能性,这是因为分段持续时间在语音感知中具有很高的重要性。在本文中,我们使用元学习方法提出了一种针对塞尔维亚语的新电话持续时间预测模型。根据从大型语音数据库分析获得的数据,我们使用了21个参数的功能集来描述电话及其上下文。这些属性包括与片段身份相关的属性,发音方式(针对辅音),与语音环境相关的属性(例如相邻电话的片段类型和发声值,是否存在词汇重音,词法属性(例如词性的一部分)语音和韵律属性,例如语音字长,段在音节中的位置,字在音节中的位置,字在短语中的位置,短语中断级别等。就均方根误差和平均绝对误差的相对减少而言,元学习算法的性能优于最佳个体模型分别约2.0%和1.7%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号