Meta Learning Approach to Phone Duration Modeling

Sovilj-Niki? Sandra; Sovilj-Niki? Ivan; Markovi? Maja

首页> 外文期刊>Technical Gazette >Meta Learning Approach to Phone Duration Modeling

【24h】

Meta Learning Approach to Phone Duration Modeling

机译：元学习方法用于电话持续时间建模

获取原文

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

One of the essential prerequisites for achieving the naturalness of synthesized speech is the possibility of the automatic prediction of phone duration, due to the high importance of segmental duration in speech perception. In this paper we present a new phone duration prediction model for the Serbian language using meta learning approach. Based on the data obtained from the analysis of a large speech database, we used a feature set of 21 parameters describing phones and their contexts. These include attributes related to the segmental identity, manner of articulation (for consonants), attributes related to phonological context, such as segment types and voicing values of neighboring phones, presence or absence of lexical stress, morphological attributes, such as part-of-speech, and prosodic attributes, such as phonological word length, the position of the segment in the syllable, the position of the syllable in a word, the position of a word in a phrase, phrase break level, etc. Phone duration model obtained using meta learning algorithm outperformed the best individual model by approximately 2,0% and 1,7% in terms of the relative reduction of the root-mean-squared error and the mean absolute error, respectively.

机译：实现分段语音的自然性的基本前提之一是自动预测电话持续时间的可能性，这是因为分段持续时间在语音感知中具有很高的重要性。在本文中，我们使用元学习方法提出了一种针对塞尔维亚语的新电话持续时间预测模型。根据从大型语音数据库分析获得的数据，我们使用了21个参数的功能集来描述电话及其上下文。这些属性包括与片段身份相关的属性，发音方式（针对辅音），与语音环境相关的属性（例如相邻电话的片段类型和发声值，是否存在词汇重音，词法属性（例如词性的一部分）语音和韵律属性，例如语音字长，段在音节中的位置，字在音节中的位置，字在短语中的位置，短语中断级别等。就均方根误差和平均绝对误差的相对减少而言，元学习算法的性能优于最佳个体模型分别约2.0％和1.7％。

著录项

来源
《Technical Gazette》 |2018年第3期|共6页
作者
Sovilj-Niki? Sandra; Sovilj-Niki? Ivan; Markovi? Maja;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种
中图分类一般工业技术;
关键词

相似文献

外文文献
中文文献
专利

1. Morpho-Phonetic Effects in Speech Production: Modeling the Acoustic Duration of English Derived Words With Linear Discriminative Learning [J] . Simon David Stein, Ingo Plag Frontiers in Psychology . 2021,第a期

机译：语音生产中的语音术效应：用线性辨别学习建模英语衍生词的声学持续时间
2. A recommendation system for meta-modeling: A meta-learning based approach [J] . Cui Can, Hu Mengqi, Weir Jeffery D., Expert Systems with Application . 2016,第Mara期

机译：元模型推荐系统：一种基于元学习的方法
3. Machine learning based approach to analyze file meta data for smart phone file triage [J] . Serhal Cezar, Le-Khac Nhien-An Digital investigation . 2021,第Suppla期

机译：基于机器学习的方法分析智能手机文件分类的文件元数据
4. Learning continuous representation of text for phone duration modeling in statistical parametric speech synthesis [C] . Sai Krishna Rallabandi, Sai Sirisha Rallabandi, Padmini Bandi, IEEE Workshop on Automatic Speech Recognition and Understanding . 2015

机译：在统计参数语音合成中学习用于电话持续时间建模的文本连续表示
5. Three Research Topics in Education: (1) Associations between Approaches to Learning and Academic Achievement; (2) a Meta- Analytic Review on Approaches to Learning and Academic Achievement; (3) Power Analysis in Meta-Analysis: A Three-Level Model [D] . Zhang, Bixi. 2021

机译：教育三大研究主题：（1）学习途径与学术成果之间的协会; （2）关于学习和学术成就的方法的元分析审查; （3）Meta分析中的功率分析：三级模型
6. Morpho-Phonetic Effects in Speech Production: Modeling the Acoustic Duration of English Derived Words With Linear Discriminative Learning [O] . Simon David Stein, Ingo Plag 2021

机译：语音生产中的语音拼音效应：用线性鉴别学习建模英语衍生词的声学持续时间
7. Utterance Verification Using Word Voiceprint Models Based on Probabilistic Distributions of Phone-Level Log-Likelihood Ratio and Phone Duration [O] . S.-B. KWON, H. KIM 2008

机译：使用Word VoicePrint模型的话语验证，基于概率分布的电话级日志似然比和电话持续时间

Meta Learning Approach to Phone Duration Modeling

摘要

著录项

相似文献

相关主题

期刊订阅