首页> 外文期刊>ACM transactions on Asian language information processing >Stacking Model-Based Korean Prosodic Phrasing Using Speaker Variability Reduction and Linguistic Feature Engineering
【24h】

Stacking Model-Based Korean Prosodic Phrasing Using Speaker Variability Reduction and Linguistic Feature Engineering

机译:基于说话人变异性降低和语言特征工程的基于模型的韩国韵律短语堆叠

获取原文
获取原文并翻译 | 示例
       

摘要

This article presents a prosodic phrasing model for a general purpose Korean speech synthesis system. To reflect the factors affecting prosodic phrasing in the model, linguistically motivated machine-learning features were investigated. These features were effectively incorporated using a stacking model. The phrasing performance was also improved through feature engineering. The corpus used in the experiment is a 4,392-sentence corpus (55,015 words with an average of 13 words per sentence). Because the corpus contains speaker-dependent variability and such variability is not appropriately reflected in a general purpose speech synthesis system, a method to reduce such variability is proposed. In addition, the entire set of data used in the experiment is provided to the public for future use in comparative research.
机译:本文介绍了通用韩语语音合成系统的韵律短语模型。为了在模型中反映影响韵律短语的因素,对语言动机的机器学习功能进行了研究。使用堆叠模型有效地合并了这些功能。通过特征工程也改善了措词性能。实验中使用的语料库是4,392个句子的语料库(55,015个单词,每个句子平均13个单词)。由于语料库包含说话者相关的可变性,并且这种可变性在通用语音合成系统中没有得到适当反映,因此提出了一种减少这种可变性的方法。此外,实验中使用的全部数据还提供给公众,以供将来进行比较研究。

著录项

  • 来源
  • 作者单位

    Department of Computer Science and Engineering, Pohang University of Science and Technology, San 31, Hyoja-dong, Nam-gu, Pohang, Gyeongbuk, 790-784, Republic of Korea;

    Department of Computer Science and Engineering, Pohang University of Science and Technology, San 31, Hyoja-dong, Nam-gu, Pohang, Gyeongbuk, 790-784, Republic of Korea;

    Department of Computer Science and Engineering, Pohang University of Science and Technology, San 31, Hyoja-dong, Nam-gu, Pohang, Gyeongbuk, 790-784, Republic of Korea;

    School of Computer and Information Communication Engineering, Catholic University of Daegu, Daegu, Republic of Korea;

    Department of Computer Science and Engineering, Pohang University of Science and Technology, San 31, Hyoja-dong, Nam-gu, Pohang, Gyeongbuk, 790-784, Republic of Korea;

  • 收录信息
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

    prosodic phrasing; phrase break prediction; linguistic feature; stacking model; prosody; speech synthesis; korean;

    机译:韵律短语短语中断预测;语言特征堆叠模型韵律语音合成韩国人;
  • 入库时间 2022-08-17 13:41:40

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号