【24h】

Multi-Strategy Data Mining on Mandarin Prosodic Patterns

机译:普通话韵律模式的多策略数据挖掘

获取原文

摘要

Mandarin prosodic models are very important in speech research and synthesis, which mainly describes the variation of pitch. The models that are now being used in most Chinese Text-To=Speech systems. In this paper, we propose a Multi-stategy Data Mining framework to extract prosodic patterns from actual large Mandarin speech database to improve the naturalness and intelligibility of synthesized speech. In data preprocessing, typical prosody models are found by clustering analysis, and Rough Set is employed for feature selection. ANN and Decision tree are trained respectively. The prediction result of ANN and Decision Tree are integrated to generate fundamental frequency and energy contours. The experimental results showed that synthesized prosodic features quite resembled their original counterparts for most syllables.
机译:普通话韵律模型在语音研究和合成中非常重要,主要描述音高的变化。现在,大多数中文Text-To = Speech系统中都在使用这些模型。在本文中,我们提出了一种多策略数据挖掘框架,以从实际的大型普通话语音数据库中提取韵律模式,以提高合成语音的自然性和清晰度。在数据预处理中,通过聚类分析找到典型的韵律模型,并将粗糙集用于特征选择。人工神经网络和决策树分别训练。人工神经网络的预测结果与决策树相结合,以生成基本频率和能量等值线。实验结果表明,合成韵律特征与大多数音节的原始韵律特征非常相似。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号